Understanding Griffiths' Perturbation Theory in Quantum Mechanics

cepheid
Staff Emeritus
Science Advisor
Gold Member
Messages
5,197
Reaction score
38
I'm looking at the beginning of of Chapter 6 of the 2nd edition of Griffiths Introduction to Quantum Mechanics.

He starts out by writing the hamiltonian for a system we'd like to solve as the sum of a hamiltonian with a known solution and a small perturbation:

H^0 + \lambda H^\prime

He says that lambda is meant to be a small number for now, but later it will be set to 1, and when it is, that will be the true Hamiltonian. My first question is huh? Why? Then later he says that the eigenfunctions and eigenvalues can be written as a power series in lambda. E.g.:

\psi_n = \psi_n^0 + \lambda \psi_n^1 + \lambda^2 \psi_n^2 + \cdotsAgain, huh? How can can just write this down? How does he know that \psi_n can be written in this form? Why does the lambda from the perturbation appear in the corrections to the wavefunction and allowed energies in this way? It just seems to come from nowhere. Am I missing something?

Also, what is meant by "first order correction" and "second order correction?" First order in what? The above equation would seem to indicate that the answer is "in lambda." But later on, he claims that lambda is just a "device for keeping track of the orders" and sets it to 1, effectively eliminating it from the equation. So what is it intrinsically about \psi_n^1 that makes it "first order" for example?

I'm sorry for being such a dunce, but I am having a hard time understanding where he's coming from.
 
Physics news on Phys.org
What he means by "lambda is meant to be a small number for now, but later it will be set to 0", is that he has not said anything about the H' in comparison with H^0. (if I remember the text of griffiths) In order to get perturbation to "work" it must be weak perturbation, so lamda is a small number. Later when we do examples, H' is given (often) and one sees that it is weak in comparison with H^0. When lamda is = 1, then the perturbation is "on" 100%.

So when does derivations, one often keeps the lamda to track the order of expansion, how many times the perturbation enters.

I don't remember, but I don't think it is a derviation or proof that you can write the solution to the perturbed hamiltoian as \psi_n = \psi_n^0 + \lambda \psi_n^1 + \lambda^2 \psi_n^2 + \cdots
And for an introductory course, you don't have to know "why". You can look up more detalied derivations in more advanced textbooks if you like (like Sakurai: modern quantum mechanics).

The "order of correction" is just how careful one have to be, compare with taylor series. \psi_n^0 is the wavefunction satisfying the eigenvalue equation for H^0.

So if you like, get Sakurai and look in chapter 5. I first started with Gritthis to learn how to use this machinery. Then when taking advanced courses in QM, using sakurai, I learned WHY.
 
Last edited:
But i can at least write this down for you:

The solution to the full hamiltonian H = H' + H^0 is |n >, the solution to H^0 is |n^0>

|n > = |n^0> + \dfrac{\phi _n}{E^0_n-H^0} (\lambda H' - \Delta _n ) |n> (takes a couple of pages to get this eq.)

Where \Delta _n = E_n - E_^0
and \phi _n = 1 - |n^0 ><n^0 |

(btw, I hope you know bra-ket notation: <x|n> = psi_n(x))

So you see that the solution is both on the LHS and RHS. So what you do it to stick the LHS into the |n> on the RHS etc. etc. etc. So now you see why the lambda was quite important.

Then you might argue: "But then the soulution is not known exactly" You are right, that's why this only works for weak perturbations, so you can truncate the series somewhere (usally around the 1st or 2nd order)
 
Last edited:
firstly, I never studied Griffiths in details but learned my time-independent perturbation theory via Merzbacher. But still, Griffith's book are usually great


cepheid said:
He starts out by writing the hamiltonian for a system we'd like to solve as the sum of a hamiltonian with a known solution and a small perturbation:

H^0 + \lambda H^\prime

He says that lambda is meant to be a small number for now, but later it will be set to 1, and when it is, that will be the true Hamiltonian. My first question is huh? Why?

why? isn't 1 a small number? setting it to one simply gives you the H you want (which is supposedly some perturbed version of H^0.) actually it doesn't matter.. we can assume it to be anything from 0 to 1 (0 being no perturbation and 1 being full on) if you want you choose use any real number, it doesn't matter. H' is the perturbation not \lambda


Then later he says that the eigenfunctions and eigenvalues can be written as a power series in lambda. E.g.:

\psi_n = \psi_n^0 + \lambda \psi_n^1 + \lambda^2 \psi_n^2 + \cdots


Again, huh? How can can just write this down? How does he know that \psi_n can be written in this form? Why does the lambda from the perturbation appear in the corrections to the wavefunction and allowed energies in this way? It just seems to come from nowhere. Am I missing something?

the ability to expand your energy level and function psi as such is a fundamental assumption of perturbation theory. Note: the expansion (into powers of \lambda) of psi is not unique. But all physically observable quantities like energy levels calculated from your favorite expansion of psi would have unique expansions in \lambda. Remember you always have to go back to the eigenvalue equation
H^0\Psi_n^{(0)} = E^{(0)}_n \Psi^{(0)}_n
when working these things out

Also, what is meant by "first order correction" and "second order correction?" First order in what? The above equation would seem to indicate that the answer is "in lambda." But later on, he claims that lambda is just a "device for keeping track of the orders" and sets it to 1, effectively eliminating it from the equation. So what is it intrinsically about \psi_n^1 that makes it "first order" for example?

I think malawi_glenn may have explained this, but let me say something anyway

Ok, firstly, I think we agree that \lambda = 1 is when you recover the actual H you want (ie. the one that is close to H^0 but not quite). The key here is that in perturbation theory we assume that if E = E0 + E' then E' should be small when we turn on H'. Turning on H' fully means setting \lambda to 1. Now it is convenient to write \lambda rather than 1 because 1^2=1 etc. and makes it hard to keep track of things.

so what is intrinsic about \psi_n^1 that makes it "first order" is that we have defined it so. Through the assumption of perturbation theory we can say we can approximate our function with
f = f0 + f1 + f2 + ...

it is a bit like say y = x + k a
y^2 = x^2 + 2xa\; k + O(k^2)

when the perturbation a is small, we can approximate the function y^2 by just
x^2 (unperturbed)
x^2 + 2xa (first order, 2xa is your \psi_n^1 )

if you set k=1 at the start it is harder to keep track of which is which later on
or course this is an oversimplified example, but I am sure you 've got it.

HTH
 
Unfortunately I have not had a chance to read David Griffith's book, nor have I really studied QM yet, but I think I've encountered this type of perturbation before. I agree that it's a weird way of presenting things, and my professor did not give many hints. I apologize if any of what I say is obvious or tangential.

When the system is perturbed, the value of \psi_n also changes. The perturbation equation

\psi_n = \psi_n^0 + \lambda \psi_n^1 + \lambda^2 \psi_n^2 + \cdots

is simply a way of representing \psi_n in terms of the degree to which the system has been perturbed, \lambda. The quantity \psi_n^0 represents the unperturbed value of \psi_n, i.e. the old value. Thus \lambda \psi_n^1 is the "first order correction" to the unperturbed \psi_n^0 value. The higher-order corrections give smaller values, as \lambda &lt; 1. When the system is fully perturbed, \lambda = 1, so \psi_n = \psi_n^0 + \psi_n^1 + \psi_n^2 + \cdots.

I'm not sure what you want to solve for in the end, but if you wanted to solve for the values of \psi_n^1, \psi_n^2, etc., it would be difficult to do so from this equation alone. But setting \lambda &lt; 1 allows you to "keep track of the orders," in the sense that by taking derivatives of the perturbation equation with respect to \lambda should allow you to solve for the correction factors. The variable \lambda thus gives you something to work with.

If you would like, I can ask David next week for an explanation. His answer would probably be much nicer than mine.
 
clathrate said:
Unfortunately I have not had a chance to read David Griffith's book, nor have I really studied QM yet, but I think I've encountered this type of perturbation before. I agree that it's a weird way of presenting things, and my professor did not give many hints. I apologize if any of what I say is obvious or tangential.

When the system is perturbed, the value of \psi_n also changes. The perturbation equation

\psi_n = \psi_n^0 + \lambda \psi_n^1 + \lambda^2 \psi_n^2 + \cdots

is simply a way of representing \psi_n in terms of the degree to which the system has been perturbed, \lambda. The quantity \psi_n^0 represents the unperturbed value of \psi_n, i.e. the old value. Thus \lambda \psi_n^1 is the "first order correction" to the unperturbed \psi_n^0 value. The higher-order corrections give smaller values, as \lambda &lt; 1. When the system is fully perturbed, \lambda = 1, so \psi_n = \psi_n^0 + \psi_n^1 + \psi_n^2 + \cdots.

I'm not sure what you want to solve for in the end, but if you wanted to solve for the values of \psi_n^1, \psi_n^2, etc., it would be difficult to do so from this equation alone. But setting \lambda &lt; 1 allows you to "keep track of the orders," in the sense that by taking derivatives of the perturbation equation with respect to \lambda should allow you to solve for the correction factors. The variable \lambda thus gives you something to work with.

If you would like, I can ask David next week for an explanation. His answer would probably be much nicer than mine.


But the problem the OP also have is WHY the solution to the full hamiltonian looks as it does, as a power series expansion, you must study perturbation theory on a higher level than griffiths. In post #3 I wrote down how the solution looks, and that sheds some light on "why" you expand the solution as you does. But as I wrote, how you get to that solution requires some work, and is often written in more advanced books in QM.
 

Similar threads

  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
1K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 18 ·
Replies
18
Views
4K