Chain rule for functions of operators?

Click For Summary

Discussion Overview

The discussion revolves around the differentiation of operator-valued functions, particularly in the context of quantum mechanics. Participants explore the implications of applying the chain rule to functions of operators, addressing both theoretical aspects and specific examples involving time-dependent Hamiltonians and spectral decompositions.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants question how to define the derivative df/dA in the context of operator-valued functions, suggesting it may not be a useful definition.
  • One participant proposes a Taylor series representation for functions of operators, specifically mentioning the function f(A) = e^A, while noting potential convergence issues.
  • Another participant raises a specific problem involving a time-dependent Hamiltonian H(t) and the need for an operator Q(t) such that dQ/dt = H, questioning the implications of non-commutativity between H and Q.
  • A participant suggests that the directional derivative in the direction of an operator A could be represented as a limit expression, indicating that standard derivative notations may be inadequate.
  • Some participants discuss the conditions under which a function f(A) can be defined using spectral decomposition, emphasizing the need for self-adjoint or unitary operators.
  • Concerns are raised about the complexity of differentiating functions of a continuous family of operators A(t), particularly regarding the time-dependent eigenvalues and eigenstates.

Areas of Agreement / Disagreement

Participants express differing views on the definitions and implications of derivatives of operator-valued functions. There is no consensus on how to approach the differentiation of these functions, particularly in the context of non-commuting operators and time-dependent scenarios.

Contextual Notes

Limitations include unresolved definitions of derivatives in the context of operators, potential convergence issues with Taylor series, and the complexities introduced by non-commutativity in operator differentiation.

pellman
Messages
683
Reaction score
6
This is strictly a math question but I figured that since it is something which would show up in QM, the quantum folks might be already familiar with it.

Suppose we have an operator valued function A(x) of a real parameter x and another function f, both of which have well defined derivatives.

consider [tex]\frac{d}{dx}f(A(x))[/tex]

Does this equal

[tex]\frac{df}{dA}\frac{dA}{dx}[/tex]

or

[tex]\frac{dA}{dx}\frac{df}{dA}[/tex]

or something else? Of course, if A and dA/dx commute, then either expression is good. But it is not clear to me that A and dA/dx would necessarily commute.
 
Physics news on Phys.org
How do you define df/dA? (I don't think it's something we even want to define).
 
Fredrik said:
How do you define df/dA? (I don't think it's something we even want to define).

If f(u) is R --> R and has a Taylor series representation

[tex]f(u)=\Sigma \frac{1}{n!}f_n u^n[/tex]

where the f_n are just coefficients. Then

[tex]f'(u)=\Sigma \frac{1}{n!}f_{n+1} u^n[/tex]


We can similarly put

[tex]f(A)=\Sigma \frac{1}{n!}f_n A^n[/tex]

[tex]f'(A)=\Sigma \frac{1}{n!}f_{n+1} A^n[/tex]

For some f this may not work, it may not converge, blah, blah, blah. Let's just assume f is a function for which this works. The actual function I am interested in is [tex]f(A)=e^A[/tex], so f(A) = df/dA anyway.
 
Maybe I shouldn't be so general. My problem is this:

Suppose we have time-dependent Hamiltonian H(t). Then we can no longer write

[tex]|\Psi(t)\rangle = e^{-iHt}|\Psi(0)\rangle[/tex]

because dH/dt != 0 . What we need is an operator Q(t) such that dQ/dt=H .

Then we would have

[tex]i\frac{d}{dt}|\Psi(t)\rangle =i\frac{d}{dt}e^{-iQ(t)}|\Psi(0)\rangle[/tex]

[tex]=He^{-iQ(t)}|\Psi(0)\rangle[/tex]

or would it be

[tex]=e^{-iQ(t)}H|\Psi(0)\rangle[/tex]?

If H does not commute with Q, then the latter means we are not dealing with a solution to the Schrödinger equation. So what is

[tex]\frac{d}{dt}e^{-iQ(t)}=?[/tex]
 
(I wrote this before I saw your last post).

I think that's a directional derivative in the direction of A

[tex]\lim_{t\rightarrow 0}\frac{f(A+t\frac{A}{\|A\|})-f(A)}{t}[/tex]

Both the df/dA notation and the f'(A) notation seem very inadequate for directional derivatives. You could use something like [itex]D_X f(A)[/itex] for the directional derivative in direction X, at A. Your df/dA would then be [itex]D_A f(A)[/itex]. However, when we take the derivate of exponentials, don't we always do it with respect to a parameter? For example, when we prove that A is self-adjoint if U=exp(itA) is unitary:

[tex]U^\dagger U=I[/tex]

[tex]U^\dagger=U^{-1}[/tex]

[tex]e^{-itA^\dagger}=e^{-itA}[/tex]

Now apply [tex]\frac{d}{dt}\bigg|_0[/tex] to both sides, and we're done.

Added after I read your post #4: If Q(t) commutes with Q(s) for all t and s, then Q'(t) commutes with Q(t) and therefore with exp(iQ(t)), so the two options are equivalent. I need to think about the possibility that Q(t) doesn't commute with Q(s).
 
Last edited:
If your A is either self-adjoint or unitary in a (rigged) Hilbert space, then you can easily define a function f(A) by the means of the spectral decomposition of A. Then you can compute a derivative, but, of course, under tight conditions of convergence.
 
Fredrik said:
I need to think about the possibility that Q(t) doesn't commute with Q(s).
I don't think there are any simple formulas in this case. Note e.g. that d/dt Q(t)2=Q'(t)Q(t)+Q(t)Q'(t). So if we try to apply d/dt to each term of the exponential, things are already weird in the second order term.
 
I see what you mean. Darn. I was hoping this would have a simple answer.
 
bigubau said:
If your A is either self-adjoint or unitary in a (rigged) Hilbert space, then you can easily define a function f(A) by the means of the spectral decomposition of A. Then you can compute a derivative, but, of course, under tight conditions of convergence.

Umm,... how does this work when one is dealing is a continuous family of operators
such as A(t) ?

E.g., for a given time, we have an operator [itex]A_0 = A(t=0)[/itex], (assumed to self-adjoint, say),
then we can spectral-decompose in terms of its eigenvalues and eigenstates:

[tex] f(A_0) ~=~ \int da_0 f(a_0) |a_0\rangle \langle a_0| ~~.[/tex]

But each A(t) will have a different set of eigenvalues and eigenstates in general,

[tex] f(A_t) ~=~ \int da_t f(a_t) |a_t\rangle \langle a_t| ~~.[/tex]

so how does one take the t derivative of the LHS without first computing
the time-dependent eigenvalues and eigenstates explicitly?

(Or did I misunderstand you?)
 

Similar threads

  • · Replies 13 ·
Replies
13
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 8 ·
Replies
8
Views
18K
  • · Replies 59 ·
2
Replies
59
Views
6K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 14 ·
Replies
14
Views
3K
  • · Replies 12 ·
Replies
12
Views
6K