Non-degenerate and degenerate perturbation theory

In summary, the conversation discusses the use of perturbation theory in a system of a rigid rotator with a uniform E-field directed along the z-axis. The book suggests using non-degenerate perturbation theory to calculate the perturbed energy and wavefunction, which raises the question of why this approach is valid. It is explained that the original and perturbed Hamiltonians commute with Lz, but this does not necessarily mean that the first term of the perturbed wavefunction must be the original unperturbed eigenfunction. The reason for using non-degenerate perturbation theory is due to the fact that all eigenstates of m experience the same shift in energy to first order.
  • #1
Lucien1011
24
0
Consider a system of a rigid rotator together with a uniform E-field directing along z-axis. So to calculate the perturbed energy and wavefunction we have to use perturbation theory. But the book said we can use non-degenerate one to calculate the result. I wonder why. It is because the original hamitonian and the perturbed one commute with Lz so that we are sure that the first term of the perturbed wavefunction must be the original unperturbed eigenfunction? Does it mean that there are no more physical quantities except Lz which could be certain if we are certain of its energy?
Also, why do we use a linear combination of the degenerate eigenfunction for the first term of the perturbed wavefunction? I think it is becos we are not sure of how other physical quantities behave under the perturbed situation. What do you guys think?:smile:
 
Physics news on Phys.org
  • #2
Sorry lucien, your post is a bit hard to follow. Can you give us a better description of your rotator- perhaps a mathematical statement?
 
  • #3
muppet said:
Sorry lucien, your post is a bit hard to follow. Can you give us a better description of your rotator- perhaps a mathematical statement?
Sorry for poor presentation
The hamitonian of the system is [tex]\frac{L^2}{2m} - kBz[/tex]

and the unperturbed eigenfunction is [tex]Y_{l,m}(\theta,\phi)[/tex]

As you know, the unperturbed eigenfunction is degenerate with respect to m (where m =-l,-l+1,...,l ). But the book said we can use non-degenerate perturbation theory to handle this case. I wonder why.

And I have to make it clear that the second paragraph is an independent question and not related to the rotator system. It is just my thought.
 
Last edited:
  • #4
Thanks for the clarification. I think you're right that the relevant point is that the commutation relations associated with Lz, although in general "A commutes with B" and "B commutes with C" does not mean that "A commutes with C", so you have to be careful about how you structure the argument.
Physically, I don't think it's quite as strange as you first think. The case m= +/-l means that the motion is in a plane parallel to the xy plane, so z is constant; as the modulus of m gets smaller this plane is rotated, but the expectation- the average- value of z won't change, and the 1st order correction to the energy is directly proportional to the expectation value of z. It's not like the zeeman effect in atoms, where the fact that electrons are charged leads to additional interaction between their magnetic moment and the applied field that is dependent upon m.

As for your second comment, I think I understand you now. The whole point of perturbation theory is that you assume that your system is "close to" one that you can solve exactly- if it's not, then the whole thing blows up in your face. In the derivation of these results it's this closeness that you use to simplify expressions. Perhaps it's the presentation in my lecture notes, but the whole thing really seems like a natural, sensible way to proceed.
 
  • #5
muppet said:
Thanks for the clarification. I think you're right that the relevant point is that the commutation relations associated with Lz, although in general "A commutes with B" and "B commutes with C" does not mean that "A commutes with C", so you have to be careful about how you structure the argument.
Physically, I don't think it's quite as strange as you first think. The case m= +/-l means that the motion is in a plane parallel to the xy plane, so z is constant; as the modulus of m gets smaller this plane is rotated, but the expectation- the average- value of z won't change, and the 1st order correction to the energy is directly proportional to the expectation value of z. It's not like the zeeman effect in atoms, where the fact that electrons are charged leads to additional interaction between their magnetic moment and the applied field that is dependent upon m.

As for your second comment, I think I understand you now. The whole point of perturbation theory is that you assume that your system is "close to" one that you can solve exactly- if it's not, then the whole thing blows up in your face. In the derivation of these results it's this closeness that you use to simplify expressions. Perhaps it's the presentation in my lecture notes, but the whole thing really seems like a natural, sensible way to proceed.
I can't quite understand the first paragraph. Can you explain it more?

Yeah my thought is kind of related to the closeness you mentioned. Thanks for reminding me this. I feel I understand it more now.
 
  • #6
I should really have put my first paragraph as two separate points, sorry.

Lucien1011 said:
... I wonder why. It is because the original hamitonian and the perturbed one commute with Lz so that we are sure that the first term of the perturbed wavefunction must be the original unperturbed eigenfunction?
The first point- you have to be careful about what conclusions you draw from the commutation relations. For example, Lz commutes with L^2 and L^2 commutes with Lx, but Lx and Lz do not commute. The fact that old and new hamiltonians commute with Lz means that old and new energy eigenstates are both eigenstates of Lz, but it doesn't mean that they are the same eigenstates of Lz. And even if they were, it wouldn't necessarily follow that an eigenstate of Lz had the same energy eigenvalues under the two hamiltonians.
In this example, we are sure that the first term of the perturbed wavefunction is the unperturbed one, but not because of the commutation relations- it's because that's the way perturbation theory works! Once you start thinking about the corrections to the unperturbed states, however, you're in dangerous territory -they're neither a basis set nor orthogonal.

The second point is the actual reason we can use non-degenerate perturbation theory, which is because all of the eigenstates of m experience the same shift in their energies
to first order. The simplest way of seeing this is to examine the 1st order correction:
[tex]<\psi|H'|\psi>=-kB<\psi|z|\psi>[/tex]
-it's completely independent of the angular momentum about the z axis. Here you can say that the commutator of z, Lz is 0 in order to support this statement.

However, I don't see how you could possibly compute the new state without using degenerate perturbation theory- does the book do actually do this, or does it just compute the correction to the energy?

One final aside:
Does it mean that there are no more physical quantities except Lz which could be certain if we are certain of its energy?
Not quite- we've taken the radius to be fixed, I think?
 
  • #7
Yeah I know if A,B commute and B,C commute doesn't mean A,C commute.
Actually I just think of a good way to present my thought. Can you comment on this?

So both unperturbed hamilotonian and perturbed one commute with Lz. Therefore, under both unperturbed and perturbed situations the energy and the angular momentum about z axis can be determined simultaneously.
Let the perturbed wavefunction be
[tex]\phi^{(0)}+\lambda\phi^{(1)}+\lambda^{2}\phi^{(2)}+...[/tex]
and this wavefunction corresponds to mh as its angular momentum about z axis
So when we substitute [tex]\lambda=0[/tex] into the equation, the resulting wavefunction, which is the unperturbed wavefunction, also correspond to mh as its angular momentum about z axis. So the first term of the series is exactly [tex]Y_{m,l}(\theta,\phi)[/tex]
 
  • #8
As I mentioned above, this is true but it's not interesting- as it's just because you're doing perturbation theory. The first term in your expansion is ALWAYS the unperturbed term; what you're interested in is the 1st order correction(s) to this term, i.e. [tex]\lambda=1[/tex].

That the 1st order correction keeps the degeneracy is because of the specific form of this perturbation, and isn't generally true. If your rotator was charged, for example, then you'd have an additional term as a result of its magnetic moment interacting with the B-field, and hence the different values of m would no longer be degenerate.
 
  • #9
Do you mean that since the perturbed wavefunction to the first order of [tex]\lambda[/tex] is degenerate to m so that non-degenerate wavefunction can be applied? Why is it true? Can you explain a bit more? Thank you:smile:
 
  • #10
I'm not convinced that you actually can use non-degenerate perturbation theory to compute the new wavefunction. What I've tried to explain above is why you can use non-degenerate perturbation theory to compute the corrections to the energy. Can you give us a reference so I can have a look at this book please?
 
  • #11
Oh you are only talking about the energy shift, I am so careless to take this important point. Sorry for that. I understand all you said now.
Well my book is Quantum Mechanics by B.H.Bransden and C.J.Joachain(Second Edition).
That part is CH12.1 "The Stark effect and the rigid rotator".
Sorry I try to find you a ebook for quick look but fail.

The book only calculates the first and second order correction of energy by non-degenerate perturbation theory. But it said," Since H0 and H commute with Lz, non-degenerate perturbation theory is applicable."
 
  • #12
Lucien1011 said:
Yeah I know if A,B commute and B,C commute doesn't mean A,C commute.
Actually I just think of a good way to present my thought. Can you comment on this?

So both unperturbed hamilotonian and perturbed one commute with Lz. Therefore, under both unperturbed and perturbed situations the energy and the angular momentum about z axis can be determined simultaneously.
Let the perturbed wavefunction be
[tex]\phi^{(0)}+\lambda\phi^{(1)}+\lambda^{2}\phi^{(2)}+...[/tex]
and this wavefunction corresponds to mh as its angular momentum about z axis
So when we substitute [tex]\lambda=0[/tex] into the equation, the resulting wavefunction, which is the unperturbed wavefunction, also correspond to mh as its angular momentum about z axis. So the first term of the series is exactly [tex]Y_{m,l}(\theta,\phi)[/tex]
Can I use these to say that we can use non-degenerate perturbation theory to calculate the new wavefunction
 
  • #13
As it happens I own the book! :smile:
I'll try and dig it out tomorrow, but I'm fairly certain you can't compute the correction to the actual wavefunction without using degenerate perturbation theory- the commutation relations wouldn't stop you dividing by zero!

Also, I'm off to Germany tomorrow evening, so if I don't reply for a few days to any further responses that's why.
 
  • #14
If you use the non-degenerate perturbation theory, it happens that the correction to wavefunction has undetermined coefficients. I guess maybe these coefficients are not physically important and hence can be arbitrarily chosen. What do you think?
 

1. What is the difference between non-degenerate and degenerate perturbation theory?

Non-degenerate perturbation theory is used to solve problems where the energy levels of the unperturbed system are not degenerate, meaning they are all unique. Degenerate perturbation theory, on the other hand, is used when the energy levels of the unperturbed system are degenerate, meaning they have the same energy value.

2. How does perturbation theory help in solving problems in quantum mechanics?

Perturbation theory is a mathematical approach used to solve problems in quantum mechanics where the exact solution is difficult to obtain. It allows us to approximate the solution by considering a small perturbation to an already known solution.

3. What are the limitations of perturbation theory?

Perturbation theory can only be applied to small perturbations, and the results obtained may not be accurate for large perturbations. It also assumes that the perturbation is time-independent and that the system is in a stable state.

4. How is perturbation theory used in other fields of science?

Perturbation theory is not limited to quantum mechanics and is also used in other fields of science, such as classical mechanics, electromagnetism, and statistical mechanics. It is a general mathematical approach that can be applied to various systems with small perturbations.

5. Can perturbation theory be used to calculate higher order corrections?

Yes, perturbation theory can be used to calculate higher order corrections, but the accuracy of the results decreases as the order of the perturbation increases. Higher order corrections are often neglected in perturbation theory to keep the calculations simple and accurate.

Similar threads

Replies
10
Views
1K
  • Quantum Physics
Replies
7
Views
2K
  • Quantum Physics
Replies
12
Views
2K
Replies
2
Views
1K
Replies
18
Views
2K
Replies
3
Views
1K
  • Quantum Physics
Replies
1
Views
1K
  • Quantum Physics
Replies
14
Views
1K
  • Advanced Physics Homework Help
Replies
1
Views
791
  • Quantum Physics
Replies
5
Views
1K
Back
Top