Undergrad Understanding the Notation in Lagrangian Differentiation

Click For Summary
SUMMARY

This discussion focuses on the differentiation of the Lagrangian density in the context of the Euler-Lagrange (E-L) equation, specifically addressing the expression for the derivative of the Lagrangian with respect to the field's derivatives. The Lagrangian density is given as ℒ = 1/2 (∂μφ)(∂μφ), and it is established that ∂ℒ/∂(∂μφ) = ∂μφ due to the cancellation of the factor of 1/2 through the application of the product rule. The conversation also clarifies the distinction between covariant and contravariant derivatives, emphasizing that they can be interchanged using the raising operator, which is crucial for understanding the notation in field theory.

PREREQUISITES
  • Understanding of Lagrangian mechanics and the Euler-Lagrange equation
  • Familiarity with functional differentiation and the product rule
  • Knowledge of covariant and contravariant derivatives in tensor calculus
  • Basic grasp of summation notation and its application in field theory
NEXT STEPS
  • Study the derivation of the Euler-Lagrange equation from the Lagrangian density
  • Learn about the implications of covariant and contravariant derivatives in general relativity
  • Explore the product rule in functional differentiation in greater detail
  • Review the Klein-Gordon equation and its derivation from the Lagrangian formalism
USEFUL FOR

Students and researchers in theoretical physics, particularly those focusing on classical field theory, Lagrangian mechanics, and general relativity. This discussion is beneficial for anyone seeking to clarify the notation and differentiation techniques used in these areas.

shedrick94
Messages
30
Reaction score
0
I'm just in need of some clearing up of how to differentiate the lagrangian with respect to the covariant derivatives when solving the E-L equation:

Say we have a lagrangian density field
\begin{equation}
\mathcal{L}=\frac{1}{2}(\partial_{\mu}\hat{\phi})(\partial^{\mu}\hat{\phi})
\end{equation}

When solving for E-L why does:

\begin{equation}
\frac{\partial \mathcal{L}}{\partial (\partial_{\mu}\phi)}= \partial^{\mu}\hat{\phi}
\end{equation}

i.e Why do we lose the factor of 1/2
 
Physics news on Phys.org
Could it be due to a power rule like ##y=(1/2) * x^2## to ##dy/dx = (1/2) * 2 * x = x## ?
 
I always found it useful to actually write out the expression in summation notation to see what's happening.
 
jedishrfu said:
Could it be due to a power rule like ##y=(1/2) * x^2## to ##dy/dx = (1/2) * 2 * x = x## ?
Indeed. The rules for functional differentiation follow in many respects the rules for ordinary differentation.

Use the product rule and the fact that derivatives of the fields are linearly independent. Then you get two terms which are exactly the same. Hence the factor of 2, which cancels the factor 1/2.
 
By the way, why do you say "covariant derivatives"? Those are just partial derivatives. But in the end that doesn't matter: the covariant derivatives contain a partial derivative on phi plus a connection*phi. Phi and the partial derivative of phi are considered to be linearly independent. Just to be clear (I use deltas for functional derivation), say
$$D_a \phi = \partial_a \phi - A_a \phi $$
then
$$\frac{\delta }{\delta (\partial_a)\phi} \Bigl(\partial_b \phi - A_b \phi \Bigr) = \delta_b^a - 0 = \delta_b^a $$

so

$$\frac{\delta }{\delta (\partial_c)\phi} \Bigl(\eta^{ab} D_a \phi D_b \phi\Bigr) = \eta^{ab} \Bigl(\delta_a^c D_b \phi + D_a \phi \delta_b^c \Bigr) = 2 D_c \phi $$
 
Write it as
$$\frac{1}{2} (\partial_{\mu} \phi) (\partial^{\mu} \phi) = \frac{1}{2} (\partial_{\mu} \phi) (\partial_{\nu} \phi)g^{\mu \nu}$$
now differentiate.
 
He didn't elaborate on his question but I don't think we're helping. From the question it appears to me that the OP is just learning to deal with the summation notation which can be very confusing to students who have never seen it. Suppose this is just a classical field, no metric, no complex fields. Then the repeated lower and upper index means sum over that index. So, for example, if \varphi = \varphi ({x_1},{x_2},{x_3}) then \frac{1}{2}({\partial _\mu }\varphi )({\partial ^\mu }\varphi ) = \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right) and \frac{{\partial L}}{{\partial ({\partial _1}\varphi )}} = \frac{{\partial L}}{{\partial \left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)}} = \frac{{\partial \varphi }}{{\partial {x_1}}} etc.

I really think his issue is notation and I suspect that's what the first response was suggesting but I think we scared the guy away.
 
alan2 said:
He didn't elaborate on his question but I don't think we're helping. From the question it appears to me that the OP is just learning to deal with the summation notation which can be very confusing to students who have never seen it. Suppose this is just a classical field, no metric, no complex fields. Then the repeated lower and upper index means sum over that index. So, for example, if \varphi = \varphi ({x_1},{x_2},{x_3}) then \frac{1}{2}({\partial _\mu }\varphi )({\partial ^\mu }\varphi ) = \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right) and \frac{{\partial L}}{{\partial ({\partial _1}\varphi )}} = \frac{{\partial L}}{{\partial \left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)}} = \frac{{\partial \varphi }}{{\partial {x_1}}} etc.

I really think his issue is notation and I suspect that's what the first response was suggesting but I think we scared the guy away.
No you didn't scare me away. I do understand what the notation represents too. However, I saw in my notes that they had differentiated quickly the klein gordon lagrangian to get the klein gordon equation.

It is defnitely a notation thing though. I don't really understand the difference between
\begin{equation}
\partial_{\mu} \\ \partial^{\mu}
\end{equation}

except the factor of -1 in the spatial part.

However, if we are differentiating with respect to the covariant derivative (subscript mu) why do we differentiate the contravariant derivative (superscript mu). Or am I missing somehting entirely? Is it just that we can switch between the supersript and subscript using the raising operator?
 
Last edited:

Similar threads

  • · Replies 5 ·
Replies
5
Views
1K
  • · Replies 3 ·
Replies
3
Views
388
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 1 ·
Replies
1
Views
547
Replies
5
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 30 ·
2
Replies
30
Views
7K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K