Dismiss Notice
Join Physics Forums Today!
The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

Stress-Energy Tensor from Lagrangian: Technical Question

  1. Sep 12, 2005 #1
    Stress-Energy-Momentum Tensor from Lagrangian: Technical Question

    I've been reading about how to generate the stress-energy-momentum tensor [itex]T^{\mu \nu}[/itex] from the action

    [tex]S = \int d^{4}x \sqrt{|g|} \mathcal{L} [/tex]
    [tex]T^{\mu \nu} = \frac{2}{\sqrt{|g|}} \frac{\partial}{\partial g_{\mu \nu}} \left( \sqrt{|g|} \mathcal{L} \right)[/tex]

    My impression is that it should not matter whether we're differentiating with respect to upper indices [itex]g^{\mu \nu}[/itex] or lower [itex]g_{\mu \nu}[/itex] but in actual fact it seems to:


    [tex]\frac{\partial}{\partial g_{\mu \nu}} \left( \sqrt{|g|} \mathcal{L} \right) = \frac{1}{2} \sqrt{|g|} g^{\mu \nu} \mathcal{L} + \sqrt{|g|} \frac{\partial \mathcal{L}}{\partial g_{\mu \nu}} \right)[/tex]


    [tex]\frac{\partial}{\partial g^{\mu \nu}} \left( \sqrt{|g|} \mathcal{L} \right) = -\frac{1}{2} \sqrt{|g|} g_{\mu \nu} \mathcal{L} + \sqrt{|g|} \frac{\partial \mathcal{L}}{\partial g^{\mu \nu}} \right)[/tex]

    the difference is sign comes from the fact that

    [tex]\delta \sqrt{|g|} = \frac{1}{2} \sqrt{|g|} g^{\mu \nu} \delta g_{\mu \nu} = -\frac{1}{2} \sqrt{|g|} \delta g^{\mu \nu} g_{\mu \nu}[/tex]

    But how can the stress-energy-momentum tensor be dependent on whether we're differentiating with respect to lower or upper indices? I am most likely making an error somewhere.

    Also, what about the overall sign? I see Weinberg and Carroll's GR book/notes defining the tensor with a -2 instead of my +2 -- but when I use it on the EM free-lagrangian [itex]-\frac{1}{4} F^{\mu \nu} F_{\mu \nu}[/itex] it gives me negative energy.

    Is there an un-ambiguous manner to determine both the overall sign and whether to take derivatives with respect to metric tensor elements with upper or lower indices?
    Last edited: Sep 12, 2005
  2. jcsd
  3. Sep 12, 2005 #2

    Physics Monkey

    User Avatar
    Science Advisor
    Homework Helper


    Let me jump right to heart of the matter. The simple reason why it matters whether you differentiate with respect to raised or lowered components of the metric is that the metric is the thing doing the raising and lowering. Let me illustrate this.

    Suppose I wanted to differentiate a Lagrangian with respect to a 1 form field [tex] a_\mu [/tex]. It is really easy to show that if I differentiate with respect to the associated vector [tex] a^\mu [/tex] field instead, I get a simple relation between the two derivatives:
    [tex] \frac{\partial \mathcal{L}}{\partial a^\mu} = g_{\mu \nu} \frac{\partial \mathcal{L}}{\partial a_\nu} [/tex]

    just as you would expect. This all arises because the raised components are just linear combinations of the lowered components.

    The situation is different when I am considering the metric. You can still try to say that the raised metric components are linear combinations of the lowered metric components, but the coeffecients are themselves the raised components of the metric!

    [tex] g^{\alpha \beta} = g^{\alpha \mu} g^{\beta \nu} g_{\mu \nu} [/tex]

    See what I mean? With the vector field, the coeffecients connecting the raised and lowered components didn't involve the vector field. The problem is, as I said, that the metric is doing the raising and lowering. The relationship between the raised components of the metric and the lowered components of the metric is a nonlinear inverse relationship rather than a simple linear relationship. So it does make a difference, although it turns out to be a small one. You should try to figure out what this small difference is (i.e. calculate the relationship between derivatives with respect to lower comp. and derivatives with respect to upper comp.), though I'll be happy to help if you get stuck. Is this clear at all?

    Your problem with the overall sign in your definition of the stress-energy tensor is just a matter of sign convention I think. For instance Misner, Wheeler, and Thorne use your definition in their book (positive sign), and they point out at the beginning that they have a different sign convention from Weinberg, for instance. Overall signs are sometimes hard to compare in GR because everyone uses different conventions. If you make sure the kinetic energy comes in positive in the Lagrangian using your metric signature, then you should choose which ever sign makes the kinetic energy come out positive in stress-energy tensor. The key is consistency.

    I haven't checked all your calculations, but it seems like they are ok. As long as you are consistent, all your equations will come out right. For example, if you take the derivative of the geometric part of the action with respect to raised components you had better take the derivative of the matter part of the action also with respect to raised components. Also, the convention is to always have the energy come out positive in the stress-energy tensor, so you adjust your definition accordingly (basically you use either plus or minus).

    Hope this helps!
  4. Sep 13, 2005 #3
    Thank you for your reply, Physics Monkey. It was helpful.

    It seems what you're saying is that

    [tex]\frac{\partial}{\partial g_{\mu \nu}} = -g^{\alpha \mu} g^{\beta \nu} \frac{\partial}{\partial g^{\alpha \beta}}[/tex]

    The way I got this was to consider

    [tex]\frac{\partial}{\partial g_{\mu \nu}} \left( g^{\sigma \lambda} g_{\lambda \tau} \right) = \frac{\partial}{\partial g_{\mu \nu}} \left( g^{\sigma \lambda} \right) g_{\lambda \tau} + g^{\sigma \lambda} \frac{\partial}{\partial g_{\mu \nu}} \left( g_{\lambda \tau} \right) = \frac{\partial}{\partial g_{\mu \nu}} \left( g^{\sigma \lambda} \right) g_{\lambda \tau} + g^{\sigma \lambda} \delta^{\mu}_{\phantom{\lambda} \lambda} \delta^{\nu}_{\phantom{\lambda} \tau}[/tex]
    [tex]\frac{\partial}{\partial g_{\mu \nu}} \left( g^{\sigma \lambda} g_{\lambda \tau} \right) = \frac{\partial}{\partial g_{\mu \nu}} \left( \delta^{\sigma}_{\phantom{\lambda} \tau} \right) = 0[/tex]

    which means

    [tex]\frac{\partial}{\partial g_{\mu \nu}} \left( g^{\sigma \lambda} \right) g_{\lambda \tau} = -g^{\sigma \lambda} \delta^{\mu}_{\phantom{\lambda} \lambda} \delta^{\nu}_{\phantom{\lambda} \tau}[/tex]
    [tex]\frac{\partial g^{\sigma \lambda}}{\partial g_{\mu \nu}} = -g^{\sigma \mu} g^{\lambda \nu}[/tex]

    and by the chain rule

    [tex]\frac{\partial}{\partial g_{\mu \nu}} = \frac{\partial g^{\alpha \beta}}{\partial g_{\mu \nu}} \frac{\partial}{\partial g^{\alpha \beta}} = -g^{\alpha \mu} g^{\beta \nu} \frac{\partial}{\partial g^{\alpha \beta}}[/tex]

    If I am right - I'm not sure if I am, of course - then I have a new question: what is

    [tex] \frac{\partial}{\partial g_{\mu \nu}} \left( g_{\alpha \beta} V^{\alpha} V^{\beta} \right)[/tex]

    Is it

    [tex] \frac{\partial}{\partial g_{\mu \nu}} \left( g_{\alpha \beta} V^{\alpha} V^{\beta} \right) = \delta^{\mu}_{\phantom{\mu}\alpha} \delta^{\nu}_{\phantom{\mu}\beta} V^{\alpha} V^{\beta} = V^{\mu} V^{\nu}[/tex]

    or is it

    [tex] \frac{\partial}{\partial g_{\mu \nu}} \left( g^{\alpha \beta} V_{\alpha} V_{\beta} \right) = - g^{\mu \alpha} g^{\nu \beta} V_{\alpha} V_{\beta} = -V^{\mu} V^{\nu}[/tex]
    Last edited: Sep 13, 2005
  5. Sep 13, 2005 #4

    Physics Monkey

    User Avatar
    Science Advisor
    Homework Helper


    You got! The only difference is an extra minus sign. Basically the minus comes from the fact that what you're doing is calculating
    [tex] \frac{\partial x^{-1}}{\partial x} = - x^{-2} [/tex]

    Ok so now how to we resolve the apparent paradox? Suppose I start with the first statement,
    [tex] \frac{\partial}{\partial g_{\mu \nu}} \left( g_{\alpha \beta} V^{\alpha} V^{\beta} \right) = \delta^{\mu}_{\phantom{\mu}\alpha} \delta^{\nu}_{\phantom{\mu}\beta} V^{\alpha} V^{\beta} = V^{\mu} V^{\nu}[/tex]

    If this statement is true then it means the raised components of V are independent of the metric. Ok, so now how do I seem to get a different answer for the same derivative below? The catch is that you cheated to calculate the derivative. If the raised components of V are independent of metric then the lowered components of V are dependent on the metric since they were obtained by contraction with the metric. In other words,
    [tex] \frac{\partial V_\alpha}{\partial g_{\mu \nu}} \neq 0 [/tex]
    which is contrary to what you have assumed in calculating the derivative. You can check for yourself that you get the term you have plus the extra term,
    [tex] 2 V^\mu V^\nu [/tex] which makes the two expressions agree.

    This obviously begs the question, "How do I know which of the raised or lowered components are the ones independent of the metric?" The answer has to come from context. Suppose I define some vector as tangent to a curve. Well this object naturally comes with a raised index and isn't defined using the metric, so the raised components must be metric independent. However, when you use the metric to find the equivalent 1 form, the 1 form you obtain obviously depends to the metric you use. Thus the lowered components do depend on the metric. This subtlety means that you have to be very careful about how you define various tensors. For instance, is the electromagnetic potential most naturally a 1 form or a vector? Elementary treatments often start with it as a vector, but it gauge theory it appears as a connection 1 form. Consistency must be you guide.

    Does this picture make sense?
  6. Sep 14, 2005 #5
    Thank you for your clarification - again it is very helpful.

    It is rather interesting you brought up the electromagnetic potential, because I started thinking about all this due to my trying to get the correct stress-energy tensor out of Maxwell's "free" lagrangian [itex]\mathcal{L} = -\frac{1}{4} F_{\mu \nu} F^{\mu \nu}[/itex]. I eventually got the correct tensor by reasoning that somehow it is [itex]F_{\mu \nu}[/itex] that was the basic object and so I ought to take the derivative with respect to [itex]g^{\alpha \beta}[/itex] and not the lower index, which had been giving me a wrong relative sign between the two terms in the expression.

    Now, could you explain - I apologize for going off topic here - why exactly we ought to think of the vector potential as components of a 1-form, and not as a vector? This is also somewhat related to my confusion of how to match up the vector notation (in cartesian coordinates) for the E field as

    [tex] \vec{E} = -\frac{\partial \vec{A}}{\partial t} - \nabla A^{0} [/tex]

    with that in the gauge-invariant tensor form

    [tex] E_{i} = \partial_{i} A_{0} - \partial_{0} A_{i}[/tex]

    Why is there no relative negative sign between the two terms in the vector form, whereas there is in the [itex]F_{i0}[/itex]? Is it even legitimate to put a lower index on [itex]E_{i}[/itex], as I've done? How would I write the indices in the vector form? Do the components of [itex]\vec{A}[/itex] have upper or lower indices? Should it be [itex]A^{0}[/itex] or [itex]A_{0}[/itex] in the second term?
  7. Sep 14, 2005 #6


    User Avatar
    Science Advisor
    Gold Member

    An astounding explanation, Physics Monkey. As advertised, it cut right to the heart of the matter. And it was a pretty darn good question to start with, lonelyphysicist. I'm going to the counter for some popcorn. You guys want anything?
  8. Sep 14, 2005 #7
    You need to recheck what you're doing. Take this expression above. On the left side you have a second rank covariant operator acting on a scalar and you're getting a second ran contravariant object. Doesn't that bother you??? It gives me the willies! And I hate those darn willies! :yuck:

  9. Sep 14, 2005 #8


    User Avatar
    Science Advisor
    Gold Member

    My knowledge of GR is extremely limited, and it is 25 years since I did any. However, even with my elementary knowledge, I was under the impression that the derivative with respect to a covariant tensor was a contravariant operator. Or am I confused?
  10. Sep 14, 2005 #9

    Physics Monkey

    User Avatar
    Science Advisor
    Homework Helper

    DrGreg is right about the index positions. The bottom of the bottom is the top of the top. Like a fraction of fractions. That's how I remember it.
    Last edited: Sep 14, 2005
  11. Sep 14, 2005 #10

    Physics Monkey

    User Avatar
    Science Advisor
    Homework Helper

    I would like a coke please, Chronos.
  12. Sep 14, 2005 #11

    Physics Monkey

    User Avatar
    Science Advisor
    Homework Helper


    Fantastic. Glad to see this is making sense to you. Now, why is the EM potential is most naturally a 1 form? The short answer is, "because the partial derivative [tex] \partial_\mu [/tex] naturally comes with a lowered index." What does this mean? Well, if you use the old notation a gauge transformation can be written like this,
    [tex] \vec{A}' = \vec{A} + \vec{\nabla} \Lambda [/tex]
    [tex] \phi' = \phi - \frac{\partial \Lambda}{\partial t} [/tex]

    Note the odd sign difference between the two. What is really going on here is, [tex] A'^\mu = A^\mu + \partial^\mu \Lambda [/tex]. Now, you know that the partial derivative is most naturally defined with a lowered index, and in order to get the upper index we have to use the metric. That minus sign in front of the time derivative is simply the minus from the usual special relativity metric.
    [tex]\partial^0 = \eta^{0 \mu} \partial_\mu = - \partial_0 = -\frac{\partial }{\partial t} [/tex]

    The weird minus sign appears because they are using the metric dependent upper components of the potential. Now the difference here is trivial, but I am led to the conclusion that, generally speaking, if I use the raised components of the potential then my gauge transformations are metric dependent! So the raised components of the potential are metric dependent. It is the lowered components of the potential that don't depend on the metric for their definition. They are introduced in the gauge covariant derivative (a different covariant derivative from the space-time one) [tex] D_\mu = \partial_\mu - i e A_\mu [/tex]. These lowered components change under a gauge transformation according to [tex] A'_\mu = A_\mu + \partial_\mu \Lambda [/tex] with no reference to the metric, and we define the gauge field [tex] A_\mu [/tex] without the metric. I must introduce the potential as a 1 form because the partial derivatives come naturally with a lowered index.

    The lowered component [tex] A_0 [/tex] is correct in your last equation. The key is that [tex] A_0 = - A^0 = - \phi [/tex].

    More, or do I need to clarify something?
  13. Sep 14, 2005 #12
    I still don't fully understand how to go from

    [tex] \vec{E} = -\frac{\partial \vec{A}}{\partial t} - \nabla A^{0} [/tex]


    [tex] E_{i} = \partial_{i} A_{0} - \partial_{0} A_{i}[/tex]

    In the first line, should it be a [itex]A^{0}[/itex] or [itex]A_{0}[/itex]? And when I convert to component notation should I write

    [tex] (\vec{E})^{i} = \left( -\frac{\partial \vec{A}}{\partial t} - \nabla A^{0} \right)^{i} = -\partial_{0} A^{i} - \nabla_{i} A^{0} \textrm{ ?}[/tex]

    Of course that looks wrong, but that's how I'd usually write it if I treat the [itex]\vec{E}[/itex] and [itex]\vec{A}[/itex] fields as 3-vectors with upper indices, and I'll give lower indices to the spacetime derivatives since they transform accordingly.

    I'm also slightly concerned about sign convention. Your [itex]\partial^{0} = -\frac{\partial}{\partial t}[/itex] is a convention choice, right? So are the negative signs in my first line above for the E field also convention dependant? What about the second line in terms of the gauge-invariant [itex]F_{i0}[/itex]?
  14. Sep 14, 2005 #13
    This popcorn you consumed - I appreciate your offer, though I'd politely decline - what's more pressing is, could you advise me, what exactly is its Lagrangian, or "world function", as D. Hilbert calls it? And would you recommend taking the derivative with respect to upper or lower index metric components in order to get the correct form of its stress-energy-momentum tensor?
  15. Sep 15, 2005 #14
    In my opinion, people -except one guy- has already excellently replied. But let me add a very simple comment still.

    It appears that you think that [tex]g^{\mu \nu}[/tex] and [tex]g_{\mu \nu}[/tex] represent the same "physics" and, therefore, does not matter what g do you take in the definition of the tensor in the same representation of GR. However, note that even in the linear regime there is a sign diference between both, one cannot mix both in the same equations.

    [tex]g^{\mu \nu} = \eta^{\mu \nu} - h^{\mu \nu} [/tex]


    [tex]g_{\mu \nu} = \eta_{\mu \nu} + h_{\mu \nu} [/tex]

    I think that you would know what are the variables onthe action and then derive just with respect to those variables. For example for the Palatini action given in function of [tex]g^{\mu \nu}[/tex] you would take the tensor defined via differentiating on [tex]g^{\mu \nu}[/tex] instead of usual [tex]g_{\mu \nu}[/tex].

    About the sign metric there is not single standard. Usually general relativists prefer the +2 whereas particle physicists (working with SR of course) prefer -2. I traditionally used the -2, but it appears that there are advantages in the use of +2 in general relativity problems.
    Last edited: Sep 15, 2005
  16. Sep 15, 2005 #15


    User Avatar
    Science Advisor

    Physics Monkey for Science Advisor! :smile:
  17. Sep 15, 2005 #16

    Physics Monkey

    User Avatar
    Science Advisor
    Homework Helper


    From electrodynamics we know that [tex] E^x = - \frac{\partial \phi}{\partial x} - \frac{\partial A^x}{\partial t} [/tex] and we always define [tex] A^0 = \phi [/tex].

    Now where does the sign convention come in? If I call the components of the usual vector potential [tex] A^i [/tex] then I can write the above equation two ways:

    If I use the metric (-1,1,1,1) then I can write [tex] E^i = \partial^0 A^i - \partial^i A^0 = F^{0 i} [/tex] where I have absorbed the minus sign into the time derivative to raise the index.

    If I use the metric (1,-1,-1,-1) then I can write [tex] E^i = \partial^i A^0 - \partial^0 A^i = F^{i 0}[/tex] where now I have absorbed the minus sign into the space derivative to raise the index.

    Well which is it? It turns out that how I interpret [tex] F^{0 i} [/tex] depends on the metric I use. But there is no contradiction or ambiguity here. To see this, let's ask how the electric field is defined. Its defined in terms of the Lorentz force, right? Well, the Lorentz force is given by [tex] f^\mu = q F^\mu_\nu u^\nu = q F^{\mu \alpha} \eta_{\alpha \nu} u^\nu [/tex], so let's see what we get.

    If I use the metric (-1,1,1,1) then I find [tex] f^i = q F^{i 0} \eta_{0 0} u^0 + ... [/tex] where ... is the magnetic part. For small velocities I thus get [tex] f^i = q (-E^i) (-1) (1) + ... = q E^i + ... [/tex] exactly as I should.

    On the other hand, if I use the metric (1,-1,-1,-1) then I find as before [tex] f^i = q F^{i 0} \eta_{0 0} u^0 + ... [/tex] but now this reduces to [tex] f^i = q (E^i) (1) (1) + ... = q E^i + ... [/tex] for small velocities just as before.

    So you see I get the force right both times, but which of the components of [tex] F^{\mu \nu} [/tex] (or equivalently [tex] F^{\mu \nu} [/tex]) I identify as the electric field is dependent on the metric I choose.

    Does this help clarify things?
  18. Sep 17, 2005 #17
    Physics Monkey - thank you for taking the time to reply. I think you've cleared up my confusion.
  19. Sep 18, 2005 #18
    Juan: could you explain briefly what's the advantage of using +2 in GR problems?
  20. Jan 26, 2007 #19
    can you please explain why when you derive the root of the determinant of the metric, you don't get the inversed root as a factor, but the root itself?
    isn't the 1/2 factor a result of the chain rule?

    thank you.
  21. Jan 26, 2007 #20
    There are 2 methods of deriving this result.

    One is to use the fact that for any matrix A

    [tex] \textrm{det} A = \textrm{det} \exp[\log[A]] = \exp[{\rm Tr}[\log[A]]][/tex]

    Hence if we consider [itex]g \to g+\delta g = g(1+g^{-1}\delta g)[/itex], we have

    [tex] \sqrt{g} \to \sqrt{g} \sqrt{1+g^{-1}\delta g} = \sqrt{g} \exp[(1/2){\rm Tr}[\log[1+g^{-1}\delta g]]] = \sqrt{g} \left(1+\frac{1}{2}{\rm Tr}[g^{-1}\delta g] + \mathcal{O}[\delta g^2] \right)[/tex]

    The second method is to use Cramer's rule from linear algebra. First we observe that

    g^{\mu\nu} g_{\nu\lambda} = \delta^\mu_{\phantom{\mu}\lambda} \\
    \Rightarrow \frac{\delta g^{\mu\nu}}{\delta g_{\tau\sigma}} = - g^{\mu\tau}g^{\nu\sigma}

    Then differentiating Cramer's rule on both sides give

    \frac{\delta g^{\mu\nu}}{\delta g_{\nu\mu}} = \frac{\delta}{\delta g_{\nu\mu}} \frac{(-1)^{\mu+\nu} {\rm det}g_{\hat{\mu\nu}}}{{\rm det} g_{\mu\nu}} \\
    = - \frac{g^{\nu\mu}}{{\rm det}g} \frac{\delta}{\delta g} {\rm det} g_

    where [itex]{\rm det}g_{\hat{\mu\nu}}[/itex] is the determinant of the metric with the [itex]\mu[/itex] column and [itex]\nu[/itex] row removed, and no Einstein summation is implied. But

    \frac{\delta g^{\nu\mu}}{\delta g_{\mu\nu}} = - g^{\mu\nu}g^{\nu\mu}


    \frac{\delta}{\delta g_{\nu\mu}} {\rm det} g = \frac{\delta}{\delta g_{\mu\nu}} {\rm det} g = {\rm det}[ g ] g^{\mu\nu}
    Last edited: Jan 26, 2007
Share this great discussion with others via Reddit, Google+, Twitter, or Facebook