# Commuting operators require simultaneous eigenfunctions?

#### kmm

Gold Member
Here is what I understand. The generalized uncertainty principle is: $\sigma^{2}_{A} \sigma^{2}_{B} \geq ( \frac{1}{2i} \langle [ \hat{A}, \hat{B} ] \rangle )^2$

So if $\hat{A}$ and $\hat{B}$ commute, then the commutator $[ \hat{A}, \hat{B} ] = 0$ and the operators are compatible. What I don't understand about this is why two compatible operators must have shared eigenfunctions or that in other words, incompatible operators such as the position and momentum operators of the original Heisenberg uncertainty principle cannot share eigenfunctions. I'm having a hard time completely understanding the proof of this.

Proof: If we have $\hat{A} f_{n} = \lambda_{n} f_{n}$ and $\hat{B}f_{n} = \mu_{n} f_{n}$ so that $f_{n}(x)$ are eigenfunctions of both operators and the set $\{f_{n}\}$ are complete so that any function $f(x)$ can be written as a linear combination of them $f = \Sigma c_{n} f{n}$. Then: $$[ \hat{A}, \hat{B} ] f = (\hat{A} \hat{B} - \hat{B} \hat{A}) \Sigma c_{n} f{n} = \hat{A}( \Sigma c_{n} \mu_{n} f{n} ) - \hat{B} (\Sigma c_{n} \lambda_{n} f{n}) = \Sigma c_{n} \mu_{n} \lambda_{n} f{n} - \Sigma c_{n} \mu_{n} \lambda_{n} f{n} = 0$$ Since this is true for any $f(x)$ then $[ \hat{A}, \hat{B} ] = 0$

I still don't quite see how this shows that incompatible operators don't share any eigenfunctions. I see how this means that if two operators share a complete set of eigenfunctions then they must commute. But I don't see how this rules out the possibility of two incompatible operators sharing some "incomplete" set of eigenfunctions. Maybe there is no such thing as an "incomplete" set of eigenfunctions? The way I understand it, the eigenfunctions of a Hermitian operator are complete, but does this mean that some subset of the eigenfunctions are also complete? If not, then it seems that this proof has not ruled out that two incompatible operators could share some set of eigenfunctions, although incomplete. I'm pretty certain that's wrong, but I'm not sure why or how I'm thinking wrong about this.

Last edited:
Related Quantum Physics News on Phys.org

#### bhobba

Mentor
You made an assumption the eigenfunctions are the same in your proof.

You may find the following helpful:
http://www.indiana.edu/~ssiweb/C561/PDFfiles/Simult-Exp2008.pdf

That only works if its non degenerate.

For the degenerate case remember physically you can relabel the outcomes anyway you like so you can easily turn a degenerate case to a non degenerate one without altering the physics.

If two operators share the same common eigenvectors then its easy to see they must commute. So its really iff.

I will let you think about the degenerate case which really requires using resolutions of the identity and not eigenvectors.

Thanks
Bill

Last edited:

#### kmm

Gold Member
You made an assumption the eigenfunctions are the same in your proof.
Yes, but what throws me off is that we assumed the set of eigenfunctions are complete. So what was proved that if two operators share a complete set of eigenfunctions, then they commute. But can operators share an "incomplete" set of eigenfunctions? So if an operator $\hat{A}$ has the set of eigenfunctions, $f_{1}, f_{2}, f_{3} ... f_{n}$ could there be another operator $\hat{B}$ that shares the eigenfunctions $f_{1}, f_{2}, f_{3} ... f_{n' < n}$?

That helps, but I'm definitely going to have to think about the degenerate case.

Actually, here's a proof that seems to me to show that if two operators share any eigenfunctions, then the operators will commute. Perhaps you can tell me if you think I'm missing something here.

Proof: If operators $\hat{A}$ and $\hat{B}$ share an eigenfunction then, $\hat{A} f = \lambda f$ and $\hat{B} f = \mu f$

From this $\hat{A} \hat{B} f = \hat{A} \mu f = \mu \hat{A} f = \mu \lambda f$ and $\hat{B} \hat{A} f = \hat{B} \lambda f = \lambda \hat{B} f = \lambda \mu f$. Therefore, subtracting the two equations $(\hat{A} \hat{B} - \hat{B} \hat{A})f = 0$ and so $[\hat{A}, \hat{B}]f = 0$ for any function $f$. Therefore, $[\hat{A}, \hat{B}] = 0$ if they share any eigenfunction.

*EDIT* I see that I actually haven't avoided the case of degeneracy here.

Last edited:

Mentor

#### Avodyne

Science Advisor
Proof: If operators $\hat{A}$ and $\hat{B}$ share an eigenfunction then, $\hat{A} f = \lambda f$ and $\hat{B} f = \mu f$

From this $\hat{A} \hat{B} f = \hat{A} \mu f = \mu \hat{A} f = \mu \lambda f$ and $\hat{B} \hat{A} f = \hat{B} \lambda f = \lambda \hat{B} f = \lambda \mu f$. Therefore, subtracting the two equations $(\hat{A} \hat{B} - \hat{B} \hat{A})f = 0$ and so $[\hat{A}, \hat{B}]f = 0$ for any function $f$. Therefore, $[\hat{A}, \hat{B}] = 0$ if they share any eigenfunction.
This is wrong; you started out with $f$ being an eigenfunction of both $A$ and $B$, but then later assumed $f$ was any function.

Noncommuting operators can share eigenstates. Here is a simple example for a 3-dimensional Hilbert space:
$$A=\pmatrix{ 1 & 0 & 0 \cr 0 & -1 & 0 \cr 0 & 0 & 1}, \quad B=\pmatrix{ 0 & 1 & 0 \cr 1 & 0 & 0 \cr 0 & 0 & 1}.$$ $A$ and $B$ do not commute, but have a common eigenvector $\pmatrix{0 \cr 0 \cr 1}$.

An infinite-dimensional example: consider a harmonic oscillator. Then the ground state $|0\rangle$ is an eigenstate of both $a$ and $a^\dagger a$ with eigenvalue zero, but $a$ and $a^\dagger a$ do not commute. The other eigenstates of these two operators are all different.

#### kmm

Gold Member
Like I said look into resolutions of the identity.

At a rigorous level check out Von-Neumans classic:
https://www.amazon.com/dp/0691028931/?tag=pfamazon01-20
Thanks, I took a look at it and I think I'm going to get that.

This is wrong; you started out with $f$ being an eigenfunction of both $A$ and $B$, but then later assumed $f$ was any function.

Noncommuting operators can share eigenstates. Here is a simple example for a 3-dimensional Hilbert space:
$$A=\pmatrix{ 1 & 0 & 0 \cr 0 & -1 & 0 \cr 0 & 0 & 1}, \quad B=\pmatrix{ 0 & 1 & 0 \cr 1 & 0 & 0 \cr 0 & 0 & 1}.$$ $A$ and $B$ do not commute, but have a common eigenvector $\pmatrix{0 \cr 0 \cr 1}$.

An infinite-dimensional example: consider a harmonic oscillator. Then the ground state $|0\rangle$ is an eigenstate of both $a$ and $a^\dagger a$ with eigenvalue zero, but $a$ and $a^\dagger a$ do not commute. The other eigenstates of these two operators are all different.
OK. So this seems like it would create the possibility in quantum mechanics to make a simultaneous measurement of noncommuting operators. That is, I could make a measurement and find the same eigenstate for both observables. I don't think this is actually right, I'm not sure why though. What's also confusing is that I thought it is fundamental to quantum mechanics that the generalized uncertainty principle is a consequence of the fact that two non commuting operators can't share simultaneous eigenfunctions.

Last edited by a moderator:

#### kith

Science Advisor
The crucial thing is that what you call the generalized uncertainty principle depends on the state.

For commuting observables, the right hand side is zero for all states, so uncertainties don't play a role at all.

For canonical conjugate observables, the commutator is equal to iħ1. Since every state is an eigenstate of the identity operator 1, the right hand side is independent of the state. So uncertainties are non-zero for all states and no state is a common eigenstate of both observables (can you prove this?).

Cases where [A,B] is nonzero and not proportional to the identity operator are kind of intermediate to the cases above. If you find a common eigenstate of A and B, <[A,B]> is zero for this state and the uncertainties can be zero. But there are also states which aren't common eigenstates and for which <[A,B]> takes a non-zero value, which restricts the uncertainties.

Last edited:

#### Avodyne

Science Advisor
I agree with kith.

And you can make simultaneous measurements of noncommuting observables if you find, as a result of the first measurement, that the system is in one of the common eigenstates. Let me modify my 3-state example to make this clearer:
$$A=\pmatrix{ 1 & 0 & 0 \cr 0 & -1 & 0 \cr 0 & 0 & 2}, \quad B=\pmatrix{ 0 & 1 & 0 \cr 1 & 0 & 0 \cr 0 & 0 & 3}.$$
Suppose you measure $A$ and get the result 2. Now if you measure $B$, you are guaranteed to get the result 3.

#### kmm

Gold Member
And you can make simultaneous measurements of noncommuting observables if you find, as a result of the first measurement, that the system is in one of the common eigenstates. Let me modify my 3-state example to make this clearer:
$$A=\pmatrix{ 1 & 0 & 0 \cr 0 & -1 & 0 \cr 0 & 0 & 2}, \quad B=\pmatrix{ 0 & 1 & 0 \cr 1 & 0 & 0 \cr 0 & 0 & 3}.$$
Suppose you measure $A$ and get the result 2. Now if you measure $B$, you are guaranteed to get the result 3.
OK I see. I haven't been able to study entanglement much yet, but would this be an example of that?

The crucial thing is that what you call the generalized uncertainty principle depends on the state.

For commuting observables, the right hand side is zero for all states, so uncertainties don't play a role at all.

For canonical conjugate observables, the commutator is equal to iħ1. Since every state is an eigenstate of the identity operator 1, the right hand side is independent of the state. So uncertainties are non-zero for all states and no state is a common eigenstate of both observables (can you prove this?).

Cases where [A,B] is nonzero and not proportional to the identity operator are kind of intermediate to the cases above. If you find a common eigenstate of A and B, <[A,B]> is zero for this state and the uncertainties can be zero. But there are also states which aren't common eigenstates and for which <[A,B]> takes a non-zero value, which restricts the uncertainties.
Thanks, this makes things a bit clearer for me. What's of interest to me now is to prove that no state is a common eigenstate for canonical conjugate observables. I'm going to have to think about this more because I'm not really sure where to begin at the moment. Qualitatively, for the position and momentum observables, I can see they have no common eigenstates since the eigenstates of position are delta functions while the eigenstates of momentum are sinusoidal waves.

#### kith

Science Advisor
What's of interest to me now is to prove that no state is a common eigenstate for canonical conjugate observables.
Assume you have a common eigenstate and act on it with the commutator.

I forgot to mention that the concept of eigenstates of canonical conjugate observables should be taken with a big grain of salt. These states aren't square-integrable (think of a plane wave), so they don't lie in the usual Hilbert space. They should be thought of as idealized states which can only be resembled approximately by physical states.

#### Avodyne

Science Advisor
OK I see. I haven't been able to study entanglement much yet, but would this be an example of that?
No, this has nothing to do with entanglement.

#### kmm

Gold Member
Assume you have a common eigenstate and act on it with the commutator.
Well if I do that for the position and momentum operator and assume they both share an eigenfunction, then for the position operator $xf = af$ and for the momentum operator $\frac{ \hbar}{i} \frac{d}{dx} f = bf$

If I have the commutator act on the eigenfunction then: $[x, \frac{ \hbar}{i} \frac{d}{dx}] f = x \frac{ \hbar}{i} \frac{d}{dx} f - \frac{ \hbar}{i} \frac{d}{dx} x f = xbf - \frac{ \hbar}{i} \frac{d}{dx} a f = baf - abf = 0$

However, if I work it out directly then: $[x, \frac{ \hbar}{i} \frac{d}{dx}] f = x \frac{ \hbar}{i} \frac{d}{dx} f - \frac{ \hbar}{i} \frac{d}{dx} x f = x \frac{ \hbar}{i} \frac{d}{dx} f - \frac{ \hbar}{i} (f + x\frac{df}{dx}) = i \hbar f$

But if $i \hbar f = 0$ then the eigenfunction must be zero, but this doesn't count since then every number would be an eigenvalue. Therefore, they don't share any eigenfunctions.

I'm not sure how to come up with something more general though. I attempted something general in post #3, but apparently that is wrong.

#### kith

Science Advisor
Proof: If operators $\hat{A}$ and $\hat{B}$ share an eigenfunction then, $\hat{A} f = \lambda f$ and $\hat{B} f = \mu f$

From this $\hat{A} \hat{B} f = \hat{A} \mu f = \mu \hat{A} f = \mu \lambda f$ and $\hat{B} \hat{A} f = \hat{B} \lambda f = \lambda \hat{B} f = \lambda \mu f$. Therefore, subtracting the two equations $(\hat{A} \hat{B} - \hat{B} \hat{A})f = 0$ and so $[\hat{A}, \hat{B}]f = 0$ for any function $f$.
Why do you think this is true for any function? After all, you have used the property of f being an eigenfunction of A and B in order to derive it.

Your result [A,B]f = 0f for a certain f is already sufficient to establish that A and B aren't canonical conjugate observables. If they were you could substitute [A,B] = iħ1 but iħf ≠ 0f.

#### kmm

Gold Member
Why do you think this is true for any function? After all, you have used the property of f being an eigenfunction of A and B in order to derive it.
My thought was that f is any eigenfunction, not just any function. I shouldn't have worded it that way. A and B share the same eigenfunction f, whatever it is(this is why I thought f could be any eigenfunction), with their respective eigenvalues. Under that assumption I came to the conclusion that the commutator acting on f is zero and since f can't be zero the commutator must be. So I took this to mean that commutators that equaled zero involve operators that share any eigenfunction.

Your result [A,B]f = 0f for a certain f is already sufficient to establish that A and B aren't canonical conjugate observables. If they were you could substitute [A,B] = iħ1 but iħf ≠ 0f.
I actually didn't understand that in general canonical conjugate observables equaled iħ1. When you refer to "a certain f" I assume you mean that we could have degeneracy. Is that correct?

#### kith

Science Advisor
My thought was that f is any eigenfunction, not just any function. I shouldn't have worded it that way. A and B share the same eigenfunction f, whatever it is(this is why I thought f could be any eigenfunction), with their respective eigenvalues. Under that assumption I came to the conclusion that the commutator acting on f is zero and since f can't be zero the commutator must be.
That's not correct. For example if the commutator of two observables on a two-dimensional system equals $$\pmatrix{ 1 & 0 \cr 0 & 0}$$ it has one eigenvector with eigenvalue 0 and one with eigenvalue 1.

I actually didn't understand that in general canonical conjugate observables equaled iħ1. When you refer to "a certain f" I assume you mean that we could have degeneracy. Is that correct?
No. With "a certain f" I meant the specific f you assumed to be a common eigenstate of the observables in your proof.

What definition of canonical conjugate do you use?

#### kmm

Gold Member
That's not correct. For example if the commutator of two observables on a two-dimensional system equals $$\pmatrix{ 1 & 0 \cr 0 & 0}$$ it has one eigenvector with eigenvalue 0 and one with eigenvalue 1.
I see, so that shows that a commutator of two observables acting on a common eigenvector can equal something nonzero which disproves my "proof".

No. With "a certain f" I meant the specific f you assumed to be a common eigenstate of the observables in your proof.

What definition of canonical conjugate do you use?
I actually didn't have a specific definition. As an example, I thought that $\frac{ \hbar}{i} \frac{d}{dx}$ was the canonical conjugate of momentum. I thought this was the sense in which you were referring to it. I assume there is actually a more specific definition?

#### kith

Science Advisor
I actually didn't have a specific definition. As an example, I thought that $\frac{ \hbar}{i} \frac{d}{dx}$ was the canonical conjugate of momentum. I thought this was the sense in which you were referring to it. I assume there is actually a more specific definition?
See canonical commutation relation.

#### kmm

Gold Member
OK, so I see that in general the canonical commutation relation says that $[x_{i}, p_{j}] = i \hbar \delta_{i,j}$. So it seems to me that what I showed in post #12 should be true. If there is such an eigenfunction that is simultaneous to x and p, then the commutator acting on that function would be zero. Since we don't get zero, they must not share simultaneous eigenfunctions.

#### kith

Science Advisor
Yes. But instead of using the specific form of P in the second line of your proof, you could have substituted iħ1 for the commutator directly. This way, the prove gets more general because it assumes only the canonical commutation relation and applies to all conjugate variables (see the section 'Generalization' in the wikipedia article above) and not only X and P.

#### Demystifier

Science Advisor
2018 Award
OK. So this seems like it would create the possibility in quantum mechanics to make a simultaneous measurement of noncommuting operators. That is, I could make a measurement and find the same eigenstate for both observables. I don't think this is actually right, I'm not sure why though. What's also confusing is that I thought it is fundamental to quantum mechanics that the generalized uncertainty principle is a consequence of the fact that two non commuting operators can't share simultaneous eigenfunctions.
Yes, there can be common eigenstates of two non-commuting operators. But these eigenstates span a subspace on which the two operators actually commute. So yes, if the result of measurement belongs to this subspace, then you can make a simultaneous measurement of the two operators, but in that case the operators on the relevant subspace commute.

#### kmm

Gold Member
kith, After thinking about it, I'm not seeing how the example you gave in post #15 shows that what I said in post #14 is wrong. You showed a commutator and it has two eigenvalues and two corresponding eigenvectors. But that doesn't show that the eigenvectors were common to the two observables that make the commutator. So whats confused me is I'm having a hard time seeing a fundamental difference in what I did in my proof in post #3, and the first line of my proof in post #12.

Yes, there can be common eigenstates of two non-commuting operators. But these eigenstates span a subspace on which the two operators actually commute. So yes, if the result of measurement belongs to this subspace, then you can make a simultaneous measurement of the two operators, but in that case the operators on the relevant subspace commute.
That definitely makes things clearer for me.

#### kith

Science Advisor
kith, After thinking about it, I'm not seeing how the example you gave in post #15 shows that what I said in post #14 is wrong. You showed a commutator and it has two eigenvalues and two corresponding eigenvectors. But that doesn't show that the eigenvectors were common to the two observables that make the commutator.
Yes, my bad. There is no example in two dimensions at all because we can't find a subspace where the observables don't commute if they share a common eigenstate. But Avodyne already has shown how it works in three-dimensions in post #5.

#### kmm

Gold Member
Yes, my bad. There is no example in two dimensions at all because we can't find a subspace where the observables don't commute if they share a common eigenstate. But Avodyne already has shown how it works in three-dimensions in post #5.
Oh that's right. Although, so that I can have a better understanding, I'm still not sure what the fundamental difference is between what I did in my proof in post #3, and the first line of my proof in post #12. Here's my reasoning; in post #3 I attempted to show that commutator of two operators acting on some common eigenfunction, will equal zero, but Avodyne showed that isn't always true. In post #12 I attempted to show that if two canonical conjugate observables share an eigenfunction, then their commutator acting on that eigenfunction would equal zero. Since the commutator acting on an eigenfunction cannot equal zero there can't be a shared eigenfunction. But how do we know that we don't have the same situation that Avodyne showed where the two observables share an eigenfunction and the commutator acting on the eigenfunction gave a nonzero result? I'm not sure what I'm missing in my reasoning here. If all I had was my second proof then I think I wouldn't have a problem, what's thrown me off is Avodyne's example.

#### kith

Science Advisor
In post #12 I attempted to show that if two canonical conjugate observables share an eigenfunction, then their commutator acting on that eigenfunction would equal zero. Since the commutator acting on an eigenfunction cannot equal zero there can't be a shared eigenfunction. But how do we know that we don't have the same situation that Avodyne showed where the two observables share an eigenfunction and the commutator acting on the eigenfunction gave a nonzero result?
The commutator acting on the state doesn't give a non-zero result. Avodyne showed that while your derivation of [A,B]f = 0f was correct, this doesn't imply that [A,B] itself is the zero operator. What you have actually shown is that if f is a common eigenstate of A and B it has to be an eigenstate of the commutator with eigenvalue 0.

If you take canonical conjugate observables for A and B you get a contradiction. The commutator of canonical conjugate observables is iħ1. If there was a common eigenstate, the condition above would require the corresponding eigenvalue of the commutator to be zero. But since iħ is the eigenvalue of iħ1 for all states, zero isn't an eigenvalue. Therefore the observables can't have a common eigenstate.

I'm not really sure where exactly your problem lies.

kmm

#### kmm

Gold Member
The commutator acting on the state doesn't give a non-zero result. Avodyne showed that while your derivation of [A,B]f = 0f was correct, this doesn't imply that [A,B] itself is the zero operator. What you have actually shown is that if f is a common eigenstate of A and B it has to be an eigenstate of the commutator with eigenvalue 0.

If you take canonical conjugate observables for A and B you get a contradiction. The commutator of canonical conjugate observables is iħ1. If there was a common eigenstate, the condition above would require the corresponding eigenvalue of the commutator to be zero. But since iħ is the eigenvalue of iħ1 for all states, zero isn't an eigenvalue. Therefore the observables can't have a common eigenstate.
That cleared it up for me. The problem for me was that I thought, Avodyne was showing that my derivation of [A,B]f = 0f was incorrect. I thought you were saying it's incorrect also and so that meant I couldn't apply it at all. For whatever reason I didn't catch that, but I see that you both were just disagreeing with my conclusion about it. My confusion also stemmed from misunderstanding my own conclusion about that derivation. That is, I couldn't see how my derivation didn't imply that [A,B] was the zero operator. As you worded it, I can see clearly now what that derivation showed. I see that the example Avodyne gave has a common eigenstate which is also an eigenstate of the commutator with eigenvalue zero, which agrees with my derivation.

Applying that to my derivation in post 12, I see now that since we can't have an eigenvalue of zero for the commutator, there must not be a common eigenstate for the observables. Thanks a lot for helping me out with that.

### Physics Forums Values

We Value Quality
• Topics based on mainstream science
• Proper English grammar and spelling
We Value Civility
• Positive and compassionate attitudes
• Patience while debating
We Value Productivity
• Disciplined to remain on-topic
• Recognition of own weaknesses
• Solo and co-op problem solving