Problematic derivations of Noether's theorem

ismaili
Messages
150
Reaction score
0
I am confused by various derivations of the Noether current in various textbooks. However, they either contradict with each other or exist many flaws.
For example, originally I thought the best derivation is at the end of the book of classical mechanics by Goldstein. But I found that in the result eq(13.147) (3rd edition), only global symmetry can be used.

Even in Weinberg's section 7.2, the derivation of the energy-momentum tensor (spacetime translational symmetry), the derivation is quite unreasonable. Suddenly appearing of \epsilon^\mu\partial_\mu\mathcal{L} (this term is usually obtained from the deviation of the measure d^4x in other QFT texts).

Moreover, in Ryder's QFT book, his derivation seems to neglect the difference of two variations (even Weinberg made this mistake I thought):
\Delta\phi(x) \equiv \phi'(x') - \phi(x)
\delta\phi(x) \equiv \phi&#039;(x) - \phi(x)<br /> where \left[\Delta,\partial_\mu\right]\neq 0 while \left[\delta,\partial_\mu\right] = 0. <br /> <br /> The best derivation I have ever seen is Maggiore&#039;s. <br /> Is there anybody agrees with me? <br /> Or is there good and reasonable derivations? <br /> Thanks in advance.
 
Physics news on Phys.org
I really hate Goldstein's book, and the section about Noether's therorem is probably the part I hate the most. I tried asking about his proof in another thread. Link. Let me know if you find it useful.

The replies I got didn't really help me, but I was able to make some progress on my own. The most important posts in the thread are #3 (because it explains the notation) and #11.
 
I like Srednicki's derivation in his QFT book (available free online).
 
ismaili said:
I am confused by various derivations of the Noether current in various textbooks. However, they either contradict with each other or exist many flaws.
For example, originally I thought the best derivation is at the end of the book of classical mechanics by Goldstein. But I found that in the result eq(13.147) (3rd edition), only global symmetry can be used.

For local symmetries you need to look for the 2nd Noether theorem. The proof of the (1st) Noether theorem is given in post#12 of

www.physicsforums.com/showthread.php?t=172461

regards

sam
 
Last edited:
That's an awesome series of posts sam. Now I know where to look if I want to read up on the conformal group. :smile:

Regarding the derivation of Noether's theorem, it would be great if someone (anyone who's interested in an easy derivation) could take a look at what I did in the other thread (link in my previous post). I didn't get any feedback there. Post #3 is the derivation of the Euler-Lagrange equations. It's included only as an explanation of the notation. Post #11 is a derivation of the conserved currents from an invariance condition. (I omitted steps in both derivations, but I'll explain them if someone asks for it).

I really don't think there's an easier way to do these things than the way I'm doing it. It's almost too easy. I just hope that doesn't mean I've made a mistake.
 
Fredik, your derivation is essentially the same as the one in Srednicki's book.
 
samalkhaiat said:
For local symmetries you need to look for the 2nd Noether theorem. The proof of the (1st) Noether theorem is given in post#12 of

www.physicsforums.com/showthread.php?t=172461

regards

sam

Thanks, Sam. But I'm going to another school to attend a seminar later, so I haven't read your articles. I will study it later, because I would like to understand somewhat conformal field theory too.

I glanced over Srednicki's derivation for Noether current just now. But I still felt uncomfortable for the following reasons. (1) He discussed only the case of internal symmetry. (2) When he dealt with the spacetime translational symmetry, he claimed that the deviation of the Lagrangian density is a total derivative. So, he added a total derivative term \partial_\nu (a^\nu\mathcal{L}) to \delta S, where S is the action.

Actually, my confusion is the situation where the symmetry is a combination of "internal symmetry" and "space-time symmetry." In the books of Maggiore and Goldstein, they said that any field is a scalar representation for the space-time translation, hence, \psi&#039;(x&#039;) = \psi(x) for any generic field \psi. In this way, the \mathcal{L} term in the Noether current J^\mu comes from the deviation of the measure (in Maggiore), or equivalently(?) comes from the slightly difference of the integration domain (in Goldstein).

By the way, I'm also wondering that if the following equation correct or not,
\delta S = \int\delta(d^4x)\mathcal{L} + \int d^4x (\delta\mathcal{L})
If this is correct, then it means even if the Lagrangian density is invariant under certain symmetry transformation, but the whole action may not be invariant under this transformation, hence the theory may not have this symmetry?

Sorry the time is hurry and my representation is not good, so my article is kinda messy. Thanks for your patience and discussion.
Sincerely
 
ismaili said:
(2) When he dealt with the spacetime translational symmetry, he claimed that the deviation of the Lagrangian density is a total derivative. So, he added a total derivative term \partial_\nu (a^\nu\mathcal{L}) to \delta S, where S is the action.
We can think of the Lagrangian either as a multi-variable polynomial

(\phi(x),\partial_\mu\phi(x))\mapsto\mathcal L(\phi(x),\partial_\mu\phi(x))

or a set of functionals (one for each x)

\phi\mapsto\mathcal L_x[\phi]=\mathcal L(\phi(x),\partial_\mu\phi(x))

I prefer to define a variation of the field as a function \epsilon\mapsto\phi_\epsilon where \phi_0=\phi minimizes the action, rather than as a "small" function that we add to the field.

I'm only mentioning these things to make sure that my notation isn't going to be confusing. I'll get to the point in my next post. I'll end this one here, because I'm getting "database error" a lot.
 
Last edited:
OK, here's the point: Suppose that we're considering the variation b\mapsto\phi_b, where \phi_b is defined by \phi_b(x)=\phi(x+b).

\mathcal L_x[\phi_b]=\mathcal L(\phi(x+b),\partial_\mu\phi(x+b))=\mathcal L_{x+b}[\phi] \approx\mathcal L_x[\phi]+b^\mu\partial_\mu\mathcal L_x[\phi]

In this post "\approx" means "equal to first order in the parameters".

Wow..."database error" for that. I can't preview this.
 
Last edited:
  • #10
ismaili said:
Actually, my confusion is the situation where the symmetry is a combination of "internal symmetry" and "space-time symmetry."
I didn't find it useful to distinguish between the two. See #4 in the thread I linked to above.

ismaili said:
By the way, I'm also wondering that if the following equation correct or not,
\delta S = \int\delta(d^4x)\mathcal{L} + \int d^4x (\delta\mathcal{L})
Definitely not if you define a variation the way I'm doing it. \delta S[/tex] would just be<br /> <br /> \epsilon^r\frac{d}{d\epsilon^r}\bigg|_0 S[\phi_\epsilon]<br /> <br /> I&#039;m not sure how it works if we define it using functional derivatives instead, and the textbook &quot;definition&quot; never made sense to me. (&quot;\delta S[/tex] is a small change in S caused by a small change in \phi&amp;quot;).&lt;br /&gt; &lt;br /&gt; It seems to me that you only have to worry about the d^4x if you&amp;#039;re trying to minimize the function&lt;br /&gt; &lt;br /&gt; (\phi,U)\mapsto\int_U d^4x \mathcal L_x[\phi]&lt;br /&gt; &lt;br /&gt; instead of&lt;br /&gt; &lt;br /&gt; \phi\mapsto\int_U d^4x \mathcal L_x[\phi]&lt;br /&gt; &lt;br /&gt; with a fixed U.
 
Last edited:
  • #11
Fredrik said:
I didn't find it useful to distinguish between the two. See #4 in the thread I linked to above.
Thanks, Fredrik!
But, sorry for that, sometimes whenever I was stuck on something, I have to take much time to get other people's point...

Actually, I thought this is the key point that why and where every textbook is not consistent with each other.

Due to the functional dependence, as long as we include the space-time symmetry, the situation becomes complicated. If there are internal symmetries only, then the derivation is clear, e.g. Weinberg's derivation in section 7.2.

If we consider the following symmetry transformation, which includes space-time symmetry,
x^\mu \rightarrow x&#039;^\mu = x^\mu + \delta x^\mu(x)\cdots(*-1)
\phi(x) \rightarrow \phi&#039;(x&#039;) = \phi(x) + \delta\phi(x)(*-2)
The symmetry of the theory means the action is invariant, i.e. the space-time integration of the Lagrangian, which depends on \phi&#039;(x&#039;)[/tex], over the new variables x&amp;#039;[/tex](hence the integration domain is changed, so we need Jacobian here, in general), specifically, &lt;br /&gt; \delta S = \int d^4x&amp;amp;#039;\mathcal{L}(\phi&amp;amp;#039;(x&amp;amp;#039;),\partial&amp;amp;#039;_\mu\phi&amp;amp;#039;(x&amp;amp;#039;)) - \int d^4x\mathcal{L}(\phi(x),\partial_\mu\phi(x)) = 0&lt;br /&gt; &lt;br /&gt; However, the problem I confused in Weinberg&amp;#039;s book is, for the derivation of space-time translational symmetry, he just considered the variation of the action as this, &lt;br /&gt; \delta S = \int d^4x\mathcal{L}(\phi(x&amp;amp;#039;),\partial_\mu\phi(x&amp;amp;#039;)) - \int d^4x\mathcal{L}(\phi(x),\partial_\mu\phi(x))&lt;br /&gt; Then, using the trick of releasing the global symmetry to be local, one can obtain the current.&lt;br /&gt; &lt;br /&gt; He didn&amp;#039;t consider the Jacobian of the measure. It seems that he considered the space-time translation equivalently as the transformation on the fields, &lt;br /&gt; \phi(x) \rightarrow \phi(x+\epsilon(x)) = \phi(x) + \epsilon^\mu\partial_\mu\phi(x) = \phi&amp;amp;#039;(x) &lt;br /&gt; &lt;br /&gt; hmm...after relaxed by having a cup of coffee, I seem to figure out where I got stuck. The key point is, &lt;i&gt;since the space-time coordinates are integrated over, the transformations eq(*-1,2) are equivalently equal to the following &lt;span style=&quot;font-size: 12px&quot;&gt;&lt;b&gt;internal&lt;/b&gt; transformation,&lt;/span&gt;&lt;/i&gt;&lt;span style=&quot;font-size: 12px&quot;&gt; &lt;br /&gt; x^\mu\rightarrow x^\mu&lt;br /&gt; \phi(x)\rightarrow \phi(x) + \delta\phi - \delta x^\mu\partial_\mu\phi(x)&lt;br /&gt; &lt;br /&gt; I think I have understood the derivation of energy-momentum tensor by Weinberg&amp;#039;s method. However, I will reread the articles by you guys to know the points you made. &lt;br /&gt; Cheers&lt;br /&gt; &lt;br /&gt; &lt;blockquote data-attributes=&quot;&quot; data-quote=&quot;Fredrik&quot; data-source=&quot;post: 1920227&quot; class=&quot;bbCodeBlock bbCodeBlock--expandable bbCodeBlock--quote js-expandWatch&quot;&gt; &lt;div class=&quot;bbCodeBlock-title&quot;&gt; Fredrik said: &lt;/div&gt; &lt;div class=&quot;bbCodeBlock-content&quot;&gt; &lt;div class=&quot;bbCodeBlock-expandContent js-expandContent &quot;&gt; Definitely not if you define a variation the way I&amp;#039;m doing it. \delta S[/tex] would just be&amp;lt;br /&amp;gt; &amp;lt;br /&amp;gt; \epsilon^r\frac{d}{d\epsilon^r}\bigg|_0 S[\phi_\epsilon]&amp;lt;br /&amp;gt; &amp;lt;br /&amp;gt; I&amp;amp;#039;m not sure how it works if we define it using functional derivatives instead, and the textbook &amp;amp;quot;definition&amp;amp;quot; never made sense to me. (&amp;amp;quot;\delta S[/tex] is a small change in S caused by a small change in \phi&amp;amp;amp;quot;).&amp;amp;lt;br /&amp;amp;gt; &amp;amp;lt;br /&amp;amp;gt; It seems to me that you only have to worry about the d^4x if you&amp;amp;amp;#039;re trying to minimize the function&amp;amp;lt;br /&amp;amp;gt; &amp;amp;lt;br /&amp;amp;gt; (\phi,U)\mapsto\int_U d^4x \mathcal L_x[\phi]&amp;amp;lt;br /&amp;amp;gt; &amp;amp;lt;br /&amp;amp;gt; instead of&amp;amp;lt;br /&amp;amp;gt; &amp;amp;lt;br /&amp;amp;gt; \phi\mapsto\int_U d^4x \mathcal L_x[\phi]&amp;amp;lt;br /&amp;amp;gt; &amp;amp;lt;br /&amp;amp;gt; with a fixed U. &lt;/div&gt; &lt;/div&gt; &lt;/blockquote&gt;&lt;/span&gt;
 
  • #12
ismaili said:
samalkhaiat said:
By the way, I'm also wondering that if the following equation correct or not,
\delta S = \int\delta(d^4x)\mathcal{L} + \int d^4x (\delta\mathcal{L})
If this is correct, then it means even if the Lagrangian density is invariant under certain symmetry transformation, but the whole action may not be invariant under this transformation, hence the theory may not have this symmetry?

The equation

<br /> \delta \mathcal{L} = \int \mathcal{L} \delta \left(d^{4}x \right) + \int d^{4}x \delta \mathcal{L} = \int d^{4}x \left( \delta^{*} \mathcal{L} + \partial_{a}( \delta x^{a} \mathcal{L}) \right),<br />

is always correct. This is because the local variation symbol

\delta (…) = \delta^{*} (…) + \delta x^{a}\partial_{a} (…)

is a derivation, i.e., it acts like a differential operator. So, a transformation is certainly an invariance (symmetry) transformation if

1) the Lagrangian itself is (form) invariant;

\delta^{*}\mathcal{L} = \mathcal{L}^{‘}(x) - \mathcal{L}(x) = 0

2) the functional form of the Lagrangian changes by a total derivative

\delta^{*}\mathcal{L} = \partial_{a} \Omega^{a}

Notice that in both cases, \mathcal{L} and \mathcal{L}^{‘} lead to the same equation of motion, and this what “symmetry” is all about.
Let me illustrate this. Consider the free particle Lagrangian

L = \frac{m}{2}\dot{x}^{2}

which gives the equation of motion

\ddot{x} = 0

Let us study the possible invariance of this system under a transformation of the form

x^{‘} = x + f, \ \ \dot{x}^{‘} + \dot{f}

where f may be some function of time. We construct the Lagrangian L’ for the new coordinate x’,

L^{‘}(x^{‘}) = \frac{m}{2} ( \dot{x}^{‘} - \dot{f})^{2} = L(x^{‘}) - m \dot{f}\dot{x}^{‘} + \frac{m}{2}\dot{f}^{2}

This leads to the equation of motion

\ddot{x}^{‘} = \ddot{f}

Therefore, the transformation above is a symmetry if

\ddot{f} = 0

or

f(t) = vt + x_{0}

Indeed, the transformations

x^{‘} = x + vt + x_{0}, \ \ \dot{x}^{‘} = \dot{x} + v

are nothing but the Galilean symmetry transformations of the free particle. Also notice that

\delta^{*}L = L^{‘}(x^{‘}) - L(x^{‘}) = \frac{d}{dt}\left( \frac{m}{2}v^{2}t - mvx^{‘} \right)

i.e., under the Galilean symmetry transformation above, the variation in the form of the Lagrangian function is a total time derivative.

Regards

sam
 
  • #13
The equation

<br /> \delta \mathcal{L} = \int \mathcal{L} \delta \left(d^{4}x \right) + \int d^{4}x \delta \mathcal{L} = \int d^{4}x \left( \delta^{*} \mathcal{L} + \partial_{a}( \delta x^{a} \mathcal{L}) \right),<br />

is always correct. This is because the local variation symbol

\delta (...) = \delta^{*} (...) + \delta x^{a}\partial_{a} (...)

is a derivation, i.e., it acts like a differential operator. So, a transformation is certainly an invariance (symmetry) transformation if

1) the Lagrangian itself is (form) invariant;

\delta^{*}\mathcal{L} = \bar{\mathcal{L}}(x) - \mathcal{L}(x) = 0

2) the functional form of the Lagrangian changes by a total derivative

\delta^{*}\mathcal{L} = \partial_{a} \Omega^{a}

Notice that in both cases, \mathcal{L} and \bar{\mathcal{L}} lead to the same equation of motion, and this what “symmetry” is all about.
Let me illustrate this. Consider the free particle Lagrangian

L = \frac{m}{2}\dot{x}^{2}

which gives the equation of motion

\ddot{x} = 0

Let us study the possible invariance of this system under a transformation of the form

\bar{x} = x + f, \ \ \dot{\bar{x}}= \dot{x} + \dot{f}

where f may be some function of time. We construct the Lagrangian \bar{L} for the new coordinate \bar{x},

\bar{L}(\bar{x}) = \frac{m}{2} ( \dot{\bar{x}} - \dot{f})^{2} = L(\bar{x}) - m \dot{f}\dot{\bar{x}} + \frac{m}{2}\dot{f}^{2}

This leads to the equation of motion

\ddot{\bar{x}} = \ddot{f}

Therefore, the transformation above is a symmetry if

\ddot{f} = 0

or

f(t) = vt + x_{0}

Indeed, the transformations

\bar{x} = x + vt + x_{0}, \ \ \dot{\bar{x}} = \dot{x} + v

are nothing but the Galilean symmetry transformations of the free particle. Also notice that

\delta^{*}L = \bar{L}(\bar{x}) - L(\bar{x}) = \frac{d}{dt}\left( \frac{m}{2}v^{2}t - mv\bar{x} \right)

i.e., under the Galilean symmetry transformation above, the variation in the form of the Lagrangian function is a total time derivative.

Regards

sam
 
Last edited:
Back
Top