Variation of Einstein-Hilbert action

  • Thread starter Thread starter jdstokes
  • Start date Start date
  • Tags Tags
    Variation
Click For Summary
SUMMARY

The discussion centers on the derivation of the Einstein field equations from the Einstein-Hilbert action, specifically addressing the treatment of metric variations. The action is defined as S = ∫ d⁴x √(-g) (R/2κ + ℒₘ), where the variations lead to the equations of motion. Participants debate why derivatives of the metric g^{αβ} are not treated as independent variables, contrasting this with scalar field theory. The conversation highlights the importance of functional derivatives and the implications of boundary conditions in deriving the Euler-Lagrange equations for general relativity.

PREREQUISITES
  • Understanding of the Einstein-Hilbert action and its formulation.
  • Familiarity with functional derivatives and their application in field theory.
  • Knowledge of the Euler-Lagrange equations in classical mechanics and field theory.
  • Basic concepts of general relativity, including metric tensors and curvature.
NEXT STEPS
  • Study the derivation of the Einstein field equations from the Einstein-Hilbert action in detail.
  • Explore the role of functional derivatives in variational principles, particularly in general relativity.
  • Examine the implications of boundary conditions on the evolution of metrics in general relativity.
  • Read "A Relativist's Toolkit: The Mathematics of Black-Hole Mechanics" by Eric Poisson for advanced insights into variational principles.
USEFUL FOR

The discussion is beneficial for theoretical physicists, graduate students in physics, and researchers focusing on general relativity and field theory, particularly those interested in the mathematical foundations of gravitational theories.

jdstokes
Messages
520
Reaction score
1
The Einstein field equations \mathsf{G} = \kappa \mathsf{T} can be derived by considering stationary metric variations of the Einstein Hilbert action,

S = \int \mathrm{d}^4x \sqrt{-g} (R/2\kappa + \mathcal{L}_\mathrm{M}).

0 = \delta S = \int\mathrm{d}^4 x\left(\frac{1}{2\kappa}\frac{\partial (\sqrt{-g} R)}{\partial g^{\alpha\beta}}+ \frac{\partial (\sqrt{-g}\mathcal{L}_\mathrm{M})}{\partial g^{\alpha\beta}}\right)\delta g^{\alpha\beta}

etc.

In conventional field theory, however, we consider variations of the action integrand with respect to both the field \varphi as well as its first derivative \partial_\mu \varphi.

Why can we avoid doing this when \varphi = g^{\alpha\beta}?.
 
Physics news on Phys.org
The action is always a functional that takes the field(s) to a number. The Lagrangian on the other hand is just a (multi-variable) polynomial (it takes a bunch of numbers to a number). The variables that we plug into it are the field and its derivatives at some spacetime point x. Isn't that the case here too? Aren't there derivatives of g in the definition of R?
 
Yes, there are derivative terms in e.g., the Christoffel symbols.

My question is why don't we treat the field derivatives as independent variables as we do in, for example scalar field theory (or even classical mechanics for that matter).

Let me give you an example. In electromagnetism the equation of motion is

\frac{\partial \mathcal{L}}{\partial A_\mu} - \partial_\nu \frac{\partial \mathcal{L}}{\partial \partial_\nu A_\mu} = 0.

In general relativity the equation of motion is

\frac{\partial \mathcal{L}}{\partial g_{\mu\nu}} = 0. I don't understand why is it not

\frac{\partial \mathcal{L}}{\partial g_{\mu \nu}} - \partial_\rho \frac{\partial \mathcal{L}}{\partial \partial_\rho g_{\mu\nu}}= 0?
 
OK, I see what you mean. Unfortunately I don't know the answer.
 
I'm not sure if I understand your question correctly. If you want to use explicitly the EL equations instead of doing the variation directly, you consider the metric, it's first derivative and second derivative as independent fields. So t he EL equations become

<br /> \frac{\delta\mathcal{L}}{\delta g_{\mu\nu}} = \frac{\partial \mathcal{L}}{\partial g_{\mu \nu}} - \partial_\rho \frac{\partial \mathcal{L}}{\partial (\partial_\rho g_{\mu\nu})} + \partial_{\rho}\partial_{\lambda}\frac{\partial\mathcal{L}}{\partial(\partial_{\rho}\partial_{\lambda}g_{\mu\nu})}= 0<br />

in which I've set the boundary conditions to zero. Using now the Lagrangian

<br /> \mathcal{L} = \sqrt{g}R<br />

gives you the right equations of motion after a very tedious calculation. In d'Inverno, chapter 11 you can find some more information on this.
 
Last edited:
Maybe it's the confusing notation between partial derivatives and functional derivatives that disturbs you. In your

0 = \delta S = \int\mathrm{d}^4 x\left(\frac{1}{2\kappa}\frac{\partial (\sqrt{-g} R)}{\partial g^{\alpha\beta}}+ \frac{\partial (\sqrt{-g}\mathcal{L}_\mathrm{M})}{\partial g^{\alpha\beta}}\right)\delta g^{\alpha\beta}

I would have used functional derivatives \delta instead of partial derivatives \partial.
 
Hang on, the Einstein field equations are

\frac{\partial \mathcal{L}}{\partial g_{\mu\nu}} = 0,

where \mathcal{L} = \sqrt{-g}(R/2\kappa + \mathcal{L}_{\rm matter})

this is a partial derivative, not a functional derivative. Compare with with electromagnetism:

\frac{\partial \mathcal{L}}{\partial A_\mu} - \partial_\nu \frac{\partial \mathcal{L}}{\partial \partial_\nu A_\mu} = 0

these are also partial derivatives.

There is a discrepancy.

I don't understand where you got this equation from:

\frac{\delta\mathcal{L}}{\delta g_{\mu\nu}} = \frac{\partial \mathcal{L}}{\partial g_{\mu \nu}} - \partial_\rho \frac{\partial \mathcal{L}}{\partial (\partial_\rho g_{\mu\nu})} + \partial_{\rho}\partial_{\lambda}\frac{\partial\ma thcal{L}}{\partial(\partial_{\rho}\partial_{\lambd a}g_{\mu\nu})}= 0

It's not the Einstein field equations.
 
I retract my last post.

Looking at the variation

\delta(\sqrt{-g} g^{\alpha\beta}R_{\alpha\beta}) = (\delta \sqrt{-g})g^{\alpha\beta} R_{\alpha\beta} + \sqrt{-g}\delta g^{\alpha\beta}R_{\alpha\beta} + \sqrt{-g}g^{\alpha\beta}\delta R_{\alpha\beta}

you can see that the first two terms come from partial derivatives with respect to the metric. In order for the last term to vanish, however, you need to consider variations with respect to the Christoffel symbols (or equivalently higher derivatives of the metric).

Thanks.
 
For a careful treatment of variational principles of general relativity, including variation of the Christoffel symbols, I suggest that you look at chapter 4, Lagrangian and Hamiltonian formulations of general relativity, from the book A Relativist's Toolkit: The Mathematics of Black-Hole Mechanics by Eric Poisson.
 
  • #10
I think you've got your answer, but maybe this helps. Just write the whole calculation explicitly from the start and try to look at how the functional derivative is derived in the first place.

So define

<br /> \delta\mathcal{L} = \frac{\partial\mathcal{L}}{\partial g_{\mu\nu}}\delta g_{\mu\nu} + \frac{\partial\mathcal{L}}{\partial(\partial_{\lambda}g_{\mu\nu})}\delta(\partial_{\lambda}g_{\mu\nu}) + \frac{\partial\mathcal{L}}{\partial (\partial_{\lambda}\partial_{\rho}g_{\mu\nu})}\delta(\partial_{\lambda}\partial_{\rho}g_{\mu\nu})<br />

Here you explicitly use that the field and it's derivative are independent fields. Some partial integrations give you the Euler Lagrange equations plus the boundary terms,

<br /> \delta\mathcal{L} = \Bigl(\frac{\partial \mathcal{L}}{\partial g_{\mu \nu}} - \partial_\rho \frac{\partial \mathcal{L}}{\partial (\partial_\rho g_{\mu\nu})} + \partial_{\rho}\partial_{\lambda}\frac{\partial\mathcal{L}}{\partial(\partial_{\rho}\partial_{\lambda}g_{\mu\nu})}\Bigr)\delta g_{\mu\nu} + BC<br />

Imposing boundary conditions on a hyperplane enables you to define the initial conditions with which you can describe the evolution of the metric to another hyperplane. So if BC=0 this makes people write

<br /> \frac{\delta\mathcal{L}}{\delta g_{\mu\nu}} \equiv \frac{\partial \mathcal{L}}{\partial g_{\mu \nu}} - \partial_\rho \frac{\partial \mathcal{L}}{\partial (\partial_\rho g_{\mu\nu})} + \partial_{\rho}\partial_{\lambda}\frac{\partial\mathcal{L}}{\partial(\partial_{\rho}\partial_{\lambda}g_{\mu\nu})}<br />

I must say that I'm also sometimes confused between the different notations :)
 
  • #11
Hello,

jdstokes said:
Hang on, the Einstein field equations are

\frac{\partial \mathcal{L}}{\partial g_{\mu\nu}} = 0,

where \mathcal{L} = \sqrt{-g}(R/2\kappa + \mathcal{L}_{\rm matter})

this is a partial derivative, not a functional derivative. Compare with with electromagnetism:

\frac{\partial \mathcal{L}}{\partial A_\mu} - \partial_\nu \frac{\partial \mathcal{L}}{\partial \partial_\nu A_\mu} = 0

these are also partial derivatives.

The similarity is closer than it first looks... you can integrate (by parts) all of the second derivatives of g_{\mu\nu} in the E-H action, leaving only g_{\mu\nu} and its first partial derivatives. Then you can use Euler-Lagrange equations analogous to the electromagnetic case you've mentioned.

Just beware that the integration by parts breaks general covariance of the integrand (but obviously not the integral as a whole). I presume this is why it is rarely mentioned in introductory courses.

Dave
 

Similar threads

  • · Replies 8 ·
Replies
8
Views
1K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 16 ·
Replies
16
Views
4K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 13 ·
Replies
13
Views
2K