Momentum operator in curvilinear coordinates

Click For Summary
The discussion centers on the validity of the momentum operator in curvilinear coordinates, challenging the claim that \vec p=\frac{\hbar}{i} \vec \nabla applies universally. The author argues that the paper's foundation is flawed, particularly regarding the interpretation of coordinates in the context of the uncertainty principle. Participants highlight that angular variables do not maintain a consistent spread in length from the origin, complicating the uncertainty relations. The conversation also touches on the existence of an angle operator and its implications for commutation relations, with some asserting that the paper lacks rigor and relies on elementary sources. Overall, the thread critiques the paper's approach and seeks clarity on the established principles of quantum mechanics regarding momentum in different coordinate systems.
  • #31
That's of course always possible, but it's well known, that "canonical quantization" only works for position and momentum in total ##\mathbb{R}^3##. The operator algebra has to be derived from group theoretical considerations, and still then it's a hypothesis whether this procedure really describes the system at hand correctly. The only way to figure this out is to do experiments to validate of invalidate a model in question. Physics is an empirical science!
 
Physics news on Phys.org
  • #32
DrDu said:
The paper is completely scrap. Already the starting point Eq. 4 is wrong as q is not a well defined operator in the case of angular momentum, i.e. an angle operator does not exist.

I think that conclusion is a little strong, but I don't want to argue about it. However, I remember from studying quantum mechanics with periodic boundary conditions that if x and x+L represent the same physical location, then we replaced the usual commutation relation:

[p, x] = -i \hbar

by one that's appropriate for the ambiguity in x:

[p, e^{\frac{2 \pi i x}{L}}] = \frac{2 \pi \hbar}{L} e^{\frac{2 \pi i x}{L}}

The function e^{\frac{2 \pi i x}{L}} is single-valued, even though x is not.
 
  • #33
vanhees71 said:
I haven't read the paper nor the postings so far, but from the abstract of the paper, I can only conclude that it's utter nonsense. Momentum is defined as the generator for translations in Euclidean space (which is the space for inertial observers both in non-relativistic as well as special-relativistic physics, including quantum theory). Even in position representation, where
$$\hat{\vec{p}}=-\mathrm{i} \vec{\nabla},$$
you have an coordinate-independent description. You can express the nabla operator in any curvilinear coordinates you like. It's a vector operator and thus independent of the choice of coordinates!

I'm not qualified to argue, but I had heard the claim years ago, in studying quantum mechanics in spherical coordinates, that it was convenient to define:

p_r = -i \hbar (\frac{\partial}{\partial r} + \frac{1}{r})

rather than simply

p_r = -i \hbar \frac{\partial}{\partial r}

I do not remember the reason for this choice, but it agrees with the result in the paper.
 
  • Like
Likes dextercioby
  • #34
stevendaryl said:
I'm not qualified to argue, but I had heard the claim years ago, in studying quantum mechanics in spherical coordinates, that it was convenient to define:

p_r = -i \hbar (\frac{\partial}{\partial r} + \frac{1}{r})

rather than simply

p_r = -i \hbar \frac{\partial}{\partial r}

I do not remember the reason for this choice, but it agrees with the result in the paper.

For example, in this paper, equation 3.2.4:
http://users.ece.gatech.edu/~alan/ECE6451/Lectures/StudentLectures/Brown_3p2_HydrogenAtom.pdf
 
  • #35
stevendaryl said:
I'm not qualified to argue, but I had heard the claim years ago, in studying quantum mechanics in spherical coordinates, that it was convenient to define:

p_r = -i \hbar (\frac{\partial}{\partial r} + \frac{1}{r})

rather than simply

p_r = -i \hbar \frac{\partial}{\partial r}

I do not remember the reason for this choice, but it agrees with the result in the paper.
You can do so, and it may be useful, especially in compound expressions, however, ##p_r## itself is not self-adjoint.
 
  • Like
Likes dextercioby and vanhees71
  • #36
stevendaryl said:
I think that conclusion is a little strong, but I don't want to argue about it. However, I remember from studying quantum mechanics with periodic boundary conditions that if x and x+L represent the same physical location, then we replaced the usual commutation relation:

[p, x] = -i \hbar

by one that's appropriate for the ambiguity in x:

[p, e^{\frac{2 \pi i x}{L}}] = \frac{2 \pi \hbar}{L} e^{\frac{2 \pi i x}{L}}

The function e^{\frac{2 \pi i x}{L}} is single-valued, even though x is not.
We have used this (see e.g. Shyans post #13). The interpretation is however slightly different: x can be made single valued, e.g. taking mod_L. However this way it get's discontinuous and the commutation relations will only hold for exponentials ##\exp(ikx)## for ##k=2\pi/L##. So this is at variance with eq. 4 which forms the starting point of the paper cited.
 
  • Like
Likes vanhees71
  • #37
stevendaryl said:
This source clearly shows, how this is meant. Of course, this operator is NOT the operator representing the radial component of momentum in spherical coordinates. That's of course
##\hat{p}_r=-\mathrm{i} \vec{e}_r \cdot \vec{\nabla} =-\mathrm{i} \partial_r.##
Of course it is also true that
##\hat{p}_r^2 \psi(r) \neq -\Delta \psi(r),##
because the correct operator for the kinetic energy is given by the Laplace(-Beltrami) operator.

A naive derivation, aka "canonical quantization", is almost always wrong. You are lucky in Cartesian coordinates. The correct way to derive it is to use group-theoretical methods and write everything covariant. In this case that's the usual vector calculus, and the Laplace operator in spherical coordinates (applied to radial-symmetric wave functions) reads
$$\Delta \psi(r)=\frac{1}{r} \partial_r^2[r \psi(r)]=\frac{1}{r} \partial_r [\psi(r)+r \psi'(r)]=\frac{1}{r}[2 \psi'(r)+r \psi''(r)]=\psi''(r)+\frac{2}{r} \psi'(r).$$
The same holds true for the angular piece. It's correct expression can be written has ##\hat{\vec{L}}^2/r^2##, where ##\hat{\vec{L}}=\hat{\vec{x}} \times \hat{\vec{p}}##.

Another nice example, where the canonical quantization fails is the quantization of the spinning top, which can be used as a (rough) model for the rotation bands of molecule spectra.
 
  • #38
vanhees71 said:
This source clearly shows, how this is meant. Of course, this operator is NOT the operator representing the radial component of momentum in spherical coordinates. That's of course
##\hat{p}_r=-\mathrm{i} \vec{e}_r \cdot \vec{\nabla} =-\mathrm{i} \partial_r.##

The point that has been made in other papers about this definition of "radial momentum" is that it is not Hermitian:

\hat{p}_r = \vec{e}_r \cdot \vec{p} = -i \hbar \frac{\partial}{\partial r}
(\hat{p}_r)^\dagger = \vec{p} \cdot \vec{e}_r = -i \hbar (\frac{2}{r} + \frac{\partial}{\partial r})

So \hat{p}_r defined that way isn't a Hermitian operator, and so, under the usual Rules of Quantum Mechanics, can't represent an observable.

The alternative definition of p_r = -i \hbar (\frac{1}{r} + \frac{\partial}{\partial r}) is Hermitian. (It is equal to \frac{1}{2} (\hat{p}_r + (\hat{p}_r)^\dagger)).
 
  • #39
stevendaryl said:
The alternative definition of p_r = -i \hbar (\frac{1}{r} + \frac{\partial}{\partial r}) is Hermitian. (It is equal to \frac{1}{2} (\hat{p}_r + (\hat{p}_r)^\dagger)).
But it isn't self-adjoint, so it can't be an observable, either.
 
  • Like
Likes dextercioby
  • #40
stevendaryl said:
The point that has been made in other papers about this definition of "radial momentum" is that it is not Hermitian:

\hat{p}_r = \vec{e}_r \cdot \vec{p} = -i \hbar \frac{\partial}{\partial r}
(\hat{p}_r)^\dagger = \vec{p} \cdot \vec{e}_r = -i \hbar (\frac{2}{r} + \frac{\partial}{\partial r})

So \hat{p}_r defined that way isn't a Hermitian operator, and so, under the usual Rules of Quantum Mechanics, can't represent an observable.

The alternative definition of p_r = -i \hbar (\frac{1}{r} + \frac{\partial}{\partial r}) is Hermitian. (It is equal to \frac{1}{2} (\hat{p}_r + (\hat{p}_r)^\dagger)).

So that actually is the starting point of the disputed paper. By definition, if \hat{A} is an operator, then \hat{A}^\dagger is defined by:

\langle \psi | \hat{A}^\dagger | \phi \rangle = \langle \phi | \hat{A} | \psi \rangle^*

For the Hilbert space of square-integrable functions in 3-dimensional space, this implies:

\int \psi^* (\hat{A}^\dagger \phi) \sqrt{g} dr d\theta d\phi = \int (\hat{A} \psi)^* \phi \sqrt{g} dr d\theta d\phi

where g is the determinant of the metric, which is just g = r^4 sin^2(\theta) in the usual spherical coordinates in flat 3-space. It follows that:

(-i \hbar \partial_r)^\dagger = -i \hbar (\partial_r + \frac{\partial_r \sqrt{g}}{\sqrt{g}}) = -i \hbar(\partial_r + \frac{2}{r})

So, really, that's the point of the article, which is that unless (\partial_j g) = 0 (which is only true in general for Cartesian coordinates x_j in flat space), then -i \hbar \partial_j will not be a Hermitian operator, so it can't represent an observable.
 
Last edited:
  • #41
DrDu said:
But it isn't self-adjoint, so it can't be an observable, either.

Are you using "self-adjoint" to mean \hat{A}^\dagger = \hat{A}? Then it certainly is self-adjoint.

By definition of adjoint with the usual Hilbert space of square-integrable functions on 3-space,

\int \psi^* ((\partial_r)^\dagger \phi) \ r^2 sin^2(\theta) dr d\theta d\phi = \int (\partial_r \psi)^* \phi \ r^2 sin^2(\theta) dr d\theta d\phi
= \int \partial_r (\psi^* \phi \ r^2 sin^2(\theta)) dr d\theta d\phi - \int \psi^* (\partial_r \phi \ r^2 sin^2(\theta)) dr d\theta d\phi

The first integral goes to zero, and the second one is equal to

- \int \psi^* (\partial_r \phi + \frac{2}{r}) \ r^2 sin^2(\theta) dr d\theta d\phi

(because you have to take a derivative of r^2 sin^2(theta) as well as \phi)

So the conclusion (as I pointed out in another post) is that (\partial_r)^\dagger = - (\partial_r + \frac{2}{r}).
This implies that (-i \hbar(\partial_r + \frac{1}{r}))^\dagger = -i \hbar(\partial_r + \frac{1}{r}). So it's self-adjoint.
 
Last edited:
  • #42
The real point of the article, now that I've thought about it a little more, is that the adjoint of a differential operator involves the metric:

In curvilinear coordinates x^a, with corresponding metric g_{a b} and determinant g,

\partial_a^\dagger = - (\partial_a + \frac{1}{\sqrt{g}} \partial_a \sqrt{g})

So -i \hbar \partial_a is not self-adjoint, except in the special case of Cartesian coordinates in flat space.
 
  • #43
The point is not about ##p_r## being symmetric but about the ranges of definition of ##p_r## and ##p_r^\dagger## not to coincide.
 
  • Like
Likes dextercioby
  • #44
DrDu said:
The point is not about ##p_r## being symmetric but about the ranges of definition of ##p_r## and ##p_r^\dagger## not to coincide.

I'm not sure I understand what that sentence means. What does it mean for p_r to be symmetric?
 
  • #45
Hermitian and symmetric are synonymous.
stevendaryl said:
= \int \partial_r (\psi^* \phi \ r^2 sin^2(\theta)) dr d\theta d\phi - \int \psi^* (\partial_r \phi \ r^2 sin^2(\theta)) dr d\theta d\phi

The first integral goes to zero,

Edit: Ups, that was not what I wanted to write. Take the function ##\phi=r^{-1} \exp(-r)##. It is square integrable and has a well defined first derivative. Hence you could include it into the range of definition of ##p_r##. However, then your can't include it into the range of definition of ##p_r^\dagger## as ##\langle \phi |p_r \phi\rangle ## diverges. But if you don't include it into the range of definition of ##p_r##, you can include it into the range of definition of ##p_r^\dagger##. Hence you can't make the ranges of ##p_r## and ##p_r^\dagger## to coincice. Hence ##p_r## can't be self adjoint.
This is formalized using the deficiency index of the operator which is (1,0) for ##p_r##. For an operator to have a self-adjoint extensions, the two indices have to be equal.
 
Last edited:
  • Like
Likes dextercioby
  • #46
Again, an introduction to quantum theory by Levin, this time the last part of section 10.1.
He says that the operator ## \hat{P}_{rad}(r)=\hat e_r \cdot \hat{ \vec P}(\vec r) ## is not hermitian and can't be the radial momentum operator but its symmetrization ## \hat{P}_{rad}(r)=\frac 1 2[(\frac{\vec r}{r})\cdot \hat{ \vec P}(\vec r) +\hat{ \vec P}(\vec r)\cdot (\frac{\vec r}{r})] ## leads to the correct Hermitian form. Now because we have ## \hat{ \vec P}(\vec r)\cdot (\frac{\vec r}{r})=-\frac{2 i \hbar }{r} ## , we'll get ## \hat{P}_{rad}(r)=-i\hbar (\frac{\partial}{\partial r}+\frac 1 r) ##.

P.S.
Dude, this book by Levin is really nice!

EDIT: oops...stevendaryl has already mentioned this in post #38!
So why the argument?
 
  • Like
Likes strangerep
  • #47
DrDu said:
Hermitian and symmetric are synonymous.

What if ##\phi=r^{-(1.1)} \exp(-r)##and ##\psi=1/\exp(-r)##? Both functions are square integrable and both are in the range of definition of ##p_r##, yet, the first integral won't vanish.

An operator is only Hermitian or not relative to a set of elements of the Hilbert space. That has nothing particularly to do with p_r. With the usual 1-D momentum operator, p_x = -i \hbar \partial_x, the proof that it is Hermitian uses:

\int \partial_x (\psi^* \phi) dx = 0

The proof goes:
  1. \int \partial_x (\psi^* \phi) dx = 0
  2. But also, \int \partial_x (\psi^* \phi) dx = \int (\partial_x \psi^*) \phi dx + \int \psi^* (\partial_x \phi) dx
  3. Therefore, \int (\partial_x \psi^*) \phi dx = - \int \psi^* (\partial_x \phi) dx
  4. By definition of \dagger, it follows that \partial_x^\dagger = - \partial_x
  5. Therefore, (-i \hbar \partial_x)^\dagger = -i \hbar \partial_x
If equation 1 doesn't hold, then the proof that -i \hbar \partial_x is Hermitian isn't valid.
 
  • #48
This is some standard stuff which I am a bit rusty. The reference of coice is the book by Glazman and Akhiezer.
The following article is also very instructive:
http://arxiv.org/abs/quant-ph/0103153
 
  • Like
Likes dextercioby
  • #49
Shyan said:
[...] its symmetrization [...] leads to the correct Hermitian form.
Ha! :oldlaugh:

I'd forgotten that (important) symmetrization trick too -- despite the fact that I've been crunching some quantum Runge-Lenz vector stuff in the hydrogen/kepler problem recently, where it's essential. (Sigh.)

So easy to overlook things like that... :blushing:
 
  • #50
DrDu is right, of course. Let's discuss both operators for the so-called "radial momentum" in spherical coordinates. More precisely, let's set the problem properly from a mathematical viewpoint: Let's take the "complicated" one first:

p_r =: -i\hbar \left(\frac{\partial}{\partial r} + \frac {1}{r} \right)

which we want to study in the Hilbert space \mathcal{H} = \mathcal{L}^2 \left((0,\infty), r^2 dr\right).

A careful analysis shows that this operator in well defined in the Hilbert space under study, it has a maximum domain dense everywhere (in the Hilbert space) made up of absolutely continuous functions on any finite sub-interval of ## (0,\infty)## which go to 0 in the limit r→∞ and moreover go to 0 when r→0 with the additional property that the partial derivative (actually we neglect spherical angles, so we can use safely use d instead of ∂) also belongs to ##\mathcal{H}##, so it has a unique adjoint: ##p_{r}^{\dagger}## (whose domain we needn't calculate, see below).

Now let's use what von Neumann invented as far back as 1929: Assume the adjoint ##p_{r}^{\dagger}## is a (closed) extension of ##p_r##. Thus, for any ##\psi (r) \subset D_{p_{r}^{\dagger}} ## we can consider the deficiency indices equation in the form:

p_{r}^{\dagger} \psi_{\pm} (r) = \pm i \frac{\hbar}{d} \psi_{\pm} (r) , with d having the dimension of length and d>0 (it's a constant to account for the length dimension in the derivative + fraction in the LHS)

This is a genuine differential equation, since we assume that ##p_{r}^{\dagger}## acts in the same way as ##p_r ##.

The ODE:

-i\hbar \frac{d\psi_{\pm} (r)}{dr} - i\hbar \frac{\psi_{\pm} (r)}{r} = \pm i \frac{\hbar}{d} \psi_{\pm} (r) has the solution (family of solutions):

\psi_{+} (r) = \frac{C_{+}}{r} e^{ -\frac{r}{d}} and

\psi_{-} (r) = C_{-} r {} {} e^{ +\frac{r}{d}}

The second solution is not in the Hilbert space ##\mathcal{H}##, while the first solution is. [The first solution is in the domain of ##p_{r}^{\dagger} ##, but not in the domain of ##p_{r}## - thus a hint that ##p_r## can't be self-ajoint]. So the v-N deficiency indices are (1,0), in other words, ##p_r## is a closed symmetric operator with no self-adjoint extension on its maximum domain of definition.
Thus DrDu is right.

For the simpler (so-called 'unsymmetrized' version) ##p_r = -i \hbar \frac{\partial}{\partial r} ## one is again invited to do the simpler maths outlined above (it's already done in a different Hilbert space in the quoted article by Bonneaux and al. which in turn cite the book by Akhiezer & Glazman as their source). He/she will reach the same conclusion bolded above.

So you see, even if you naively symmetrize something, it won't generate self-adjoint operators, hence won't generate observables. In other words, the "radial momentum" is not an observable in QM, it turns out that only the Schrödinger/Dirac Hamiltonian is.
 
Last edited:
  • Like
Likes vanhees71 and ShayanJ
  • #51
Let's hear from Pauli:
Pauli said:
The radial momentum operator defined through ## \vec p_r f=\frac{\hbar}{ir}\frac{\partial}{\partial r}(rf) ## behaves in ordinary space in exactly the same way as the operator ## \frac{\hbar}{i}\frac{\partial}{\partial x}## in the half space. It is Hermitian, but its matrices cannot be diagonalised. However, the operator ## p_r^2 f=-\hbar^2 \frac 1 r \frac{\partial^2}{\partial r^2}(rf) ## appearing in the Hamiltonian, can very well be diagonalized and has the eigenfunctions ## \frac 1 r \sin{kr} ##.
 
Last edited:
  • Like
Likes dextercioby
  • #52
Beyond all mathematical formalism it should be clear on physical grounds that there can't be a radial momentum operator:
It should generate a translation along r: ##\psi(r)\to \psi(r-a)##. If the shift a is negative, part of the function will get shifted outside ##[0,\infty]## into the negative range. Hence this translation can't be unitary and its generator not self-adjoint.
On a closed interval [0, 1] this problem can be avoided as we can "fold back" that part which gets shifted out on one end on the other.
 
  • Like
Likes vanhees71
  • #53
It's of course not ##-\mathrm{i} \partial_r## that represents a physical quantity as an operator, because it's of course not self-adjoint (not even Hermitean). Also the other operator discussed, which is made Hermitean by symmetrization (it's not even self-adjoint as pointed out before in #50) doesn't represent an observable, which one can easily interpret physically. This physical interpretation is possible by the symmetry analysis using the ray representations of the Galilei or proper orthochronous Poincare group underlying the derivation of the observable algebra, represented by the generators of the one-parameter subgroups.

The momentum operator is the generator of translations, and it's a (co-)vector given by
$$\vec{\hat{p}}=-\mathrm{i} \vec{\nabla}.$$
It's square is a scalar and as such invariant under isomorphisms. Thus one has to use the covariant expression
$$\Delta=\mathrm{div} \; \mathrm{grad}$$
in the nonrelativistic kinetic-energy operator
$$\frac{1}{2m} \hat{\vec{p}}^2=-\frac{1}{2m} \Delta$$
to get the correct Hamiltonian for the Schrödinger equation in position representation.

I guess, it was misleading to call the formal component ##-\mathrm{i} \partial_r## of the gradient in spherical coordinates ##\hat{p}_r##, because it has not the meaning of a proper vector component. What's, however correct is
$$\vec{\nabla}=\vec{e}_r \partial_r + \vec{e}_{\vartheta} \frac{1}{r} \partial_{\vartheta} + \vec{e}_{\varphi} \frac{1}{r \sin \vartheta} \partial_{\varphi}.$$
Then you have to take into account the dependence of the basis vectors on position, when squaring the nabla operator.

That's all. It's just vector analysis in terms of (anholonomous!) orthonormal curvilinear coordinates, known already from the electrodynamics lecture! There's nothing specifically quantum mechanical here!
 
Last edited:
  • #54
vanhees71 said:
Also the other operator discussed, which is made self-adjoint by symmetrization doesn't represent an observable, which one can easily interpret physically..
Dextercioby just provided all the mathematical details in post #50 why even this symmetrized operator isn't self-adjoint.
 
  • #55
DrDu said:
Dextercioby just provided all the mathematical details in post #50 why even this symmetrized operator isn't self-adjoint.
Good point; I've corrected my posting accordingly.
 
  • #56
strangerep said:
Ha! :oldlaugh:

I'd forgotten that (important) symmetrization trick too -- despite the fact that I've been crunching some quantum Runge-Lenz vector stuff in the hydrogen/kepler problem recently, where it's essential. (Sigh.)

So easy to overlook things like that... :blushing:
The Runge-Lenz business is a bit cumbersome. I came to the conclusion that in this case it's easier to work in the position representation throughout (and use Mathematica to avoid boring technical calculations with derivatives:-)).
 
  • #57
vanhees71 said:
The Runge-Lenz business is a bit cumbersome.
More than a "bit". My (by hand) computations of ##[A_i, A_j]## and ##{\mathbf A}^2## consume several pages of latex, even with lots of auxiliary utility formulas. This stuff must have been a massive PITA for people before computers were invented.
 
  • #58
Yeah, it's going back to Pauli, who knew the Runge-Lenz vector from classical mechanics and thus was able to solve the hydrogen-atom problem in terms of matrix mechanics even before Schrödinger came up with his much more convenient wave-mechanics treatment. I'm always glad to live in a time where Computer algebra is available that helps us to avoid such cumbersome (but boring) calculations. Of course, it's worse doing it to get some familiarity with the operator algebra and Lie algebras and Lie groups.
 
  • #59
DrDu said:
Beyond all mathematical formalism it should be clear on physical grounds that there can't be a radial momentum operator:
It should generate a translation along r: ##\psi(r)\to \psi(r-a)##. If the shift a is negative, part of the function will get shifted outside ##[0,\infty]## into the negative range. Hence this translation can't be unitary and its generator not self-adjoint.
On a closed interval [0, 1] this problem can be avoided as we can "fold back" that part which gets shifted out on one end on the other.

That's a nice insight. While I put all the gory details and shyan quoted Pauli's textbook (I sense Pauli quoting some original work of von Neumann), the "issue" of the so-called "radial angular momentum" had been solved without resorting to deficiency indices since 1973, see the original (behind a paywall, unfortunately) found here: http://scitation.aip.org/content/aapt/journal/ajp/41/8/10.1119/1.1987445

One can also read this: http://iopscience.iop.org/0143-0807/22/4/308/refs
 
Last edited:

Similar threads

  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 14 ·
Replies
14
Views
3K
Replies
2
Views
2K
Replies
37
Views
4K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 1 ·
Replies
1
Views
822
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 46 ·
2
Replies
46
Views
11K