Issues with the variation of Christoffel symbols

Click For Summary

Discussion Overview

The discussion revolves around the variation of Christoffel symbols and the nature of the variation of the metric tensor, specifically addressing whether the variation of the metric, denoted as δgμν, can be considered a tensor. Participants explore the implications of this variation in the context of covariant derivatives and the mathematical properties of tensors.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants assert that δgμν represents the variation of the components of the metric tensor, while others argue it is not a tensor due to its dependence on the metric itself.
  • One participant suggests that the covariant derivative of δgμν can be interpreted as a tensor, leading to the equation ∇ρδgμν = ∂ρδgμν - Γλρμδgλν - Γλρνδgμλ.
  • Another participant points out that δgμν can be expressed as the difference between two metric tensors in the same coordinate system, which would imply it is a tensor.
  • Some participants highlight the conflicting interpretations of δgμν, referencing established equations that suggest both tensorial and non-tensorial properties.
  • There is a discussion about the relationship between the variation of the metric and its inverse, with some noting that δ(g^{-1}) and δg are related but not identical.
  • Concerns are raised about the treatment of basis vectors during variations, questioning why they remain unchanged under certain conditions.

Areas of Agreement / Disagreement

Participants express differing views on whether δgμν is a tensor, with some supporting its tensorial nature and others contesting this interpretation. The discussion remains unresolved, with multiple competing perspectives on the nature of the variation and its implications.

Contextual Notes

Participants note that the distinction between δgμν and (δg)μν is significant, with implications for how variations are treated mathematically. The discussion also touches on the assumptions regarding the constancy of basis vectors during variations, which remains a point of contention.

JuanC97
Messages
48
Reaction score
0
Hello everyone,
I'm sure a lot of you know that the Christoffel symbols are not tensors by themselves but, their variation is a tensor.
I want to revive a post that was made in 2016 about this: The Variation of Christoffel Symbol and ask again "How is that you can calculate ∇ρδgμν if δ{gμν} is not a tensor?"

You know that δgμν can be written as (- gμα gνβ δgαβ) since δ{δμν}=0 so... at first glance, you could say that δgμν is not a tensor since you can't lower its indices just by using two metrics BUT the truth is that those indices are not the indices of the variation so you can't just think like that, it is, in fact, a matter of notation as I will explain now:

As far as I've tought, δgμν stands for "the variation of the components" of the metric ( i.e. δ{gμν} )
and that's really different from "the components of the variation" ( i.e. (δg)μν - which, of course, would be tensorial since that's just the difference of two metrics ), take this into account:

δg = δ{gμνdxμ⊗dxν} = δ{gμν}dxμ⊗dxν + gμνδ{dxμ⊗dxν} so (δg)μν is, in general different from δ{gμν}.

Now...maybe the problem is just about notation...
The issue that really makes me think twice about it is... What does it really mean that covariant derivative of δgμν that you found in the formula for the variation of the Christoffel symbols? and, even more, how can you interpret that equation in the proper form?

Could you please rephrase ∇ρδgμν in a different notation or in words, or maybe interpreting as multilinear maps?.

Thanks btw.
 
Last edited:
Physics news on Phys.org
UPDATE: I've been reading some thing and made some conclusions, help me to understand this please.
https://anhngq.wordpress.com/2010/0...ariant-derivative-of-tensors-in-a-short-form/.

First of all, by ∇ρδgμν we refer to the scalar that results from the evaluation of (∇ρδg)(∂μ,∂ν).
Then, you should interpret (∇ρδg)(∂μ,∂ν) as the covariant derivative of the metric tensor along ∂ρ evaluated in the basis given by ∂μ and ∂ν .

Using the properties of the covariant derivative you can find that (∇ρδg)(∂μ,∂ν) is given by
ρδg(∂μ,∂ν) - δg(∇ρμ,∂ν) - δg(∂μ,∇ρν)
= ∂ρδg(∂μ,∂ν) - δg( Γλρμλ,∂ν) - δg(∂μ, Γλρνλ)
= ∂ρδg(∂μ,∂ν) - Γλρμδg( ∂λ,∂ν) - Γλρνδg(∂μ,∂λ)
= ∂ρδgμν - Γλρμδgλν - Γλρνδgμλ

The conclusion is that ∇ρδgμν = ∂ρδgμν - Γλρμδgλν - Γλρνδgμλ has to be interpreted as the components of ∇ρδg so... this is a tensor (?)

Hmmm, and... what would happen if I use the same reasoning with δg(∂μ,∂ν) = δgμν ?
As I see it, δg(∂μ,∂ν) stands for the components of δg so δgμν is a tensor (?).
 
Just my 2 cents: the variation of the metric is the difference between two metric tensors in the same coordinate system and hence a tensor. Take e.g. the Lie derivative of the metric as explicit example.
 
haushofer said:
Just my 2 cents: the variation of the metric is the difference between two metric tensors in the same coordinate system and hence a tensor. Take e.g. the Lie derivative of the metric as explicit example.

I agree with you when you said that (δg) is a tensor but we have to remember that we also have this two equations from the literature that are always used when computing variations:

(i) (∇ρδgμν) = ∂ρδgμν - Γλρμδgλν - Γλρνδgμλ
states that δgμν is the component of a tensor and this relation can be deduced from its interpretation as (δg)μν.
(ii) δgμν = - gμα gνβ δgαβ
states that δgμν is not the component of a tensor and this relation can be deduced from its interpretation as δ{gμν}.

Also, taking into account that
δg = δ{gμνdxμ⊗dxν} = δ{gμν}dxμ⊗dxν + gμνδ{dxμ⊗dxν}
and the second term in the right-hand side is, in general, different from 0, it's clear that (δg)μν ≠ δ{gμν} so both interpretations are, in principle, uncompatible. The question is then... Why we usually use both of this formulas?, maybe I am misreading something?.
 
It seems to me that there is no real distinction between ##(\delta \boldsymbol{g})_{\mu \nu}## and ##\delta g_{\mu \nu}##. As you say,

##\boldsymbol{g} = g_{\mu \nu} dx^\mu \otimes dx^\nu##

So if we vary ##\boldsymbol{g}##, you get:

##\delta \boldsymbol{g} = \delta g_{\mu \nu} dx^\mu \otimes dx^\nu + g_{\mu \nu} \delta(dx^\mu \otimes dx^\nu)##

But you're not varying your coordinate system, so ##dx^\mu## isn't varying. So the last term is zero.
 
  • Like
Likes   Reactions: JuanC97
I think the confusion is that ##g^{\alpha \beta}## should really be written as ##(g^{-1})^{\alpha \beta}##. So both ##\delta \boldsymbol{g}## and ##\delta (\boldsymbol {g}^{-1})## are tensors, but they are different tensors. They are related by:

##\delta (\boldsymbol{g}^{-1})^{\mu \nu} = - g^{\mu \alpha} g^{\nu \beta} \delta \boldsymbol {g}_{\alpha \beta}##

In other words, ##\delta (\boldsymbol{g}^{-1})_{\mu \nu} \neq \delta \boldsymbol {g}_{\mu \nu}##. They are actually the negatives of each other.
 
  • Like
Likes   Reactions: JuanC97
stevendaryl said:
In other words, ##\delta (\boldsymbol{g}^{-1})_{\mu \nu} \neq \delta \boldsymbol {g}_{\mu \nu}##. They are actually the negatives of each other.

Did you mean to write the indexes on ##\delta (\boldsymbol{g}^{-1})_{\mu \nu}## as lower indexes here? They were upper indexes in your earlier formula.
 
PeterDonis said:
Did you mean to write the indexes on ##\delta (\boldsymbol{g}^{-1})_{\mu \nu}## as lower indexes here? They were upper indexes in your earlier formula.

Peter, he is saying that ##\, (\delta\boldsymbol{g^{-1}}) \,## is a tensor, hence:
## (\delta\boldsymbol{g^{-1}})^{\mu \nu} \equiv g^{\mu \alpha} g^{\nu \beta} (\delta\boldsymbol{g^{-1}})_{\alpha \beta}##

but we've found that
## \delta\{ \delta_\mu^\nu \} = \delta\{ g_{\mu\alpha} (\boldsymbol{g^{-1}})^{\alpha\nu} \} = 0
\;\Leftrightarrow\;
(\delta\boldsymbol{g^{-1}})^{\mu \nu} = - g^{\mu \alpha} g^{\nu \beta} \delta g_{\alpha \beta} \hspace{0.3cm}##, so ##\,\, (\delta\boldsymbol{g^{-1}})_{\alpha \beta} = - \delta g_{\alpha \beta}##

I agree with this point of view :smile: but I'm still dealing with the argument of ## \delta\{ dx^\mu \} = 0 ## and ## \delta\{ \partial_\mu \}=0 ## since, I think about ## \delta ## as a general variation (it could be a variation due to a change in coordinates or anything else) which makes it difficult to me to understand why the basis (co-)vectors would remain unchanged under any kind of variation.
 
Last edited:
PeterDonis said:
Did you mean to write the indexes on ##\delta (\boldsymbol{g}^{-1})_{\mu \nu}## as lower indexes here? They were upper indexes in your earlier formula.

No, I meant for them to be lowered. if ##\delta(\boldsymbol g^{-1})## is a tensor, then we can use ##g## to raise and lower its indices. So

##(\delta(\boldsymbol g^{-1}))_{\mu \nu} \equiv g_{\mu \alpha} g_{\nu \beta} (\delta(\boldsymbol g^{-1}))^{\alpha \beta} = - (\delta \boldsymbol{g})_{\mu \nu}##
 
  • #10
JuanC97 said:
I agree with this point of view :smile: but I'm still dealing with the argument of ## \delta\{ dx^\mu \} = 0 ## and ## \delta\{ \partial_\mu \}=0 ## since, I think about ## \delta ## as a general variation (it could be a variation due to a change in coordinates or anything else) which makes it difficult to me to understand why the basis (co-)vectors would remaining unchanged under any kind of variation.

You can certainly allow more general variations. But in the context of variational principles in deriving Einstein's field equations, the way that I think of it is this:
  • You have a candidate metric, ##\boldsymbol{g}##
  • You consider a second tensor, ##\boldsymbol{\bar{g}}## that is infinitesimally different than ##\boldsymbol{g}##
  • You subtract them to get ##\delta \boldsymbol{g} \equiv \boldsymbol{\bar{g}} - \boldsymbol{g}##
  • You calculate the effect on the action to first-order in ##\delta \boldsymbol{g}##.
  • Your candidate ##\boldsymbol{g}## is the real metric if the first-order variation vanishes.
 
  • Like
Likes   Reactions: JuanC97
  • #11
In the euler lagrange eqns you only consider functional changes, not coordinate ones. That's why variations and partial derivatives commute. In the Lie derivative you consider the difference between two tensors in the same coordinate system. I can't think of an explicit example of more general ones which I have encountered.
 

Similar threads

  • · Replies 19 ·
Replies
19
Views
2K
  • · Replies 12 ·
Replies
12
Views
2K
  • · Replies 16 ·
Replies
16
Views
4K
  • · Replies 19 ·
Replies
19
Views
6K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 5 ·
Replies
5
Views
5K
  • · Replies 2 ·
Replies
2
Views
5K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 11 ·
Replies
11
Views
4K
  • · Replies 5 ·
Replies
5
Views
4K