Understanding the Notation in Lagrangian Differentiation

Click For Summary

Discussion Overview

The discussion revolves around the differentiation of the Lagrangian density in the context of the Euler-Lagrange equations, specifically addressing the treatment of covariant and contravariant derivatives, and the implications of notation in this process.

Discussion Character

  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant questions why the factor of 1/2 is lost when differentiating the Lagrangian density with respect to the derivative of the field.
  • Another participant suggests that the power rule for differentiation may apply, drawing a parallel to ordinary differentiation.
  • A different participant emphasizes the utility of summation notation to clarify the differentiation process.
  • Some participants discuss the nature of covariant versus partial derivatives, with one asserting that they are essentially the same in this context, while another provides a detailed example of functional differentiation.
  • There is a suggestion to express the Lagrangian in a form that explicitly shows the summation over indices to aid in differentiation.
  • One participant expresses concern that the discussion may not be helping the original poster, suggesting that the confusion stems from the notation used in summation and differentiation.
  • The original poster acknowledges understanding the notation but seeks clarification on the differentiation of covariant versus contravariant derivatives.

Areas of Agreement / Disagreement

Participants express differing views on the treatment of covariant and contravariant derivatives, and there is no consensus on the best approach to clarify the original poster's confusion regarding the notation and differentiation process.

Contextual Notes

Some participants note that the original poster may be struggling with the notation and summation conventions, which can be particularly challenging for those new to the topic. There is also mention of the potential confusion arising from the relationship between covariant and contravariant derivatives.

shedrick94
Messages
30
Reaction score
0
I'm just in need of some clearing up of how to differentiate the lagrangian with respect to the covariant derivatives when solving the E-L equation:

Say we have a lagrangian density field
\begin{equation}
\mathcal{L}=\frac{1}{2}(\partial_{\mu}\hat{\phi})(\partial^{\mu}\hat{\phi})
\end{equation}

When solving for E-L why does:

\begin{equation}
\frac{\partial \mathcal{L}}{\partial (\partial_{\mu}\phi)}= \partial^{\mu}\hat{\phi}
\end{equation}

i.e Why do we lose the factor of 1/2
 
Physics news on Phys.org
Could it be due to a power rule like ##y=(1/2) * x^2## to ##dy/dx = (1/2) * 2 * x = x## ?
 
I always found it useful to actually write out the expression in summation notation to see what's happening.
 
jedishrfu said:
Could it be due to a power rule like ##y=(1/2) * x^2## to ##dy/dx = (1/2) * 2 * x = x## ?
Indeed. The rules for functional differentiation follow in many respects the rules for ordinary differentation.

Use the product rule and the fact that derivatives of the fields are linearly independent. Then you get two terms which are exactly the same. Hence the factor of 2, which cancels the factor 1/2.
 
By the way, why do you say "covariant derivatives"? Those are just partial derivatives. But in the end that doesn't matter: the covariant derivatives contain a partial derivative on phi plus a connection*phi. Phi and the partial derivative of phi are considered to be linearly independent. Just to be clear (I use deltas for functional derivation), say
$$D_a \phi = \partial_a \phi - A_a \phi $$
then
$$\frac{\delta }{\delta (\partial_a)\phi} \Bigl(\partial_b \phi - A_b \phi \Bigr) = \delta_b^a - 0 = \delta_b^a $$

so

$$\frac{\delta }{\delta (\partial_c)\phi} \Bigl(\eta^{ab} D_a \phi D_b \phi\Bigr) = \eta^{ab} \Bigl(\delta_a^c D_b \phi + D_a \phi \delta_b^c \Bigr) = 2 D_c \phi $$
 
Write it as
$$\frac{1}{2} (\partial_{\mu} \phi) (\partial^{\mu} \phi) = \frac{1}{2} (\partial_{\mu} \phi) (\partial_{\nu} \phi)g^{\mu \nu}$$
now differentiate.
 
He didn't elaborate on his question but I don't think we're helping. From the question it appears to me that the OP is just learning to deal with the summation notation which can be very confusing to students who have never seen it. Suppose this is just a classical field, no metric, no complex fields. Then the repeated lower and upper index means sum over that index. So, for example, if \varphi = \varphi ({x_1},{x_2},{x_3}) then \frac{1}{2}({\partial _\mu }\varphi )({\partial ^\mu }\varphi ) = \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right) and \frac{{\partial L}}{{\partial ({\partial _1}\varphi )}} = \frac{{\partial L}}{{\partial \left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)}} = \frac{{\partial \varphi }}{{\partial {x_1}}} etc.

I really think his issue is notation and I suspect that's what the first response was suggesting but I think we scared the guy away.
 
alan2 said:
He didn't elaborate on his question but I don't think we're helping. From the question it appears to me that the OP is just learning to deal with the summation notation which can be very confusing to students who have never seen it. Suppose this is just a classical field, no metric, no complex fields. Then the repeated lower and upper index means sum over that index. So, for example, if \varphi = \varphi ({x_1},{x_2},{x_3}) then \frac{1}{2}({\partial _\mu }\varphi )({\partial ^\mu }\varphi ) = \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_2}}}} \right) + \frac{1}{2}\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right)\left( {\frac{{\partial \varphi }}{{\partial {x_3}}}} \right) and \frac{{\partial L}}{{\partial ({\partial _1}\varphi )}} = \frac{{\partial L}}{{\partial \left( {\frac{{\partial \varphi }}{{\partial {x_1}}}} \right)}} = \frac{{\partial \varphi }}{{\partial {x_1}}} etc.

I really think his issue is notation and I suspect that's what the first response was suggesting but I think we scared the guy away.
No you didn't scare me away. I do understand what the notation represents too. However, I saw in my notes that they had differentiated quickly the klein gordon lagrangian to get the klein gordon equation.

It is defnitely a notation thing though. I don't really understand the difference between
\begin{equation}
\partial_{\mu} \\ \partial^{\mu}
\end{equation}

except the factor of -1 in the spatial part.

However, if we are differentiating with respect to the covariant derivative (subscript mu) why do we differentiate the contravariant derivative (superscript mu). Or am I missing somehting entirely? Is it just that we can switch between the supersript and subscript using the raising operator?
 
Last edited:

Similar threads

  • · Replies 3 ·
Replies
3
Views
710
  • · Replies 5 ·
Replies
5
Views
1K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 1 ·
Replies
1
Views
746
Replies
5
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 30 ·
2
Replies
30
Views
7K