Covariant Derivative - where does the minus sign come from?

unscientific · Apr 20, 2015

I was reading through hobson and my notes where the covariant acts on contravariant and covariant tensors as
[tex]\nabla_\alpha V^\mu = \partial_\alpha V^\mu + \Gamma^\mu_{\alpha \gamma} V^\gamma[/tex]
[tex]\nabla_\alpha V_\mu = \partial_\alpha V_\mu - \Gamma^\gamma_{\alpha \mu} V_\gamma[/tex]

Why is there a minus sign in the second equation?

I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]
[tex]\nabla_\alpha V_{\gamma} = \partial_\alpha V_\gamma + \Gamma^\mu_{\alpha \gamma} V_\mu[/tex]
Swapping ##\mu## and ##\gamma## we have
[tex]\nabla_\alpha V_{\mu} = \partial_\alpha V_\mu + \Gamma^\gamma_{\alpha \mu} V_\gamma[/tex]

PeterDonis · Apr 20, 2015

unscientific said:

Why is there a minus sign in the second equation?

Because of the different transformation laws that are obeyed by vectors and covectors, i.e., by upper and lower index quantities.

Dale · Apr 20, 2015

unscientific said:

I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]

I don't know the answer to your question, but this part is wrong. You cannot have one upstairs and two downstairs repeated indexes.

unscientific · Apr 20, 2015

I tried to look up a derivation, but couldn't find it anywhere, so I'm asking if anyone has seen a proof or working that starts from the first equation and shows how the minus sign appears.

JorisL · Apr 20, 2015

Carroll's notes seem to treat this in quite some detail.
http://preposterousuniverse.com/grnotes/grnotes-three.pdf

I haven't thoroughly checked them since I used the book that came from those.

However I suspect these show even more steps if memory serves me.

P.S. try checking those and come back if you have problems

[edit] added link

Mentz114 · Apr 20, 2015

JorisL said:

Carroll's notes seem to treat this in quite some detail.
http://preposterousuniverse.com/grnotes/grnotes-three.pdf

I haven't thoroughly checked them since I used the book that came from those.

However I suspect these show even more steps if memory serves me.

P.S. try checking those and come back if you have problems

[edit] added link

Good answer. Carrolls explanation is very well put together.

Matterwave · Apr 20, 2015

unscientific said:

I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]
[tex]\nabla_\alpha V_{\gamma} = \partial_\alpha V_\gamma + \Gamma^\mu_{\alpha \gamma} V_\mu[/tex]

Notice that there is a summation for ##\gamma## on the vector and Christoffel symbols in the original formula (as Dalespam said, this is invalid). Multiplying by ##g_{\mu\gamma}## will not work. What you can do is introduce another index and multiply by ##g_{\mu\tau}## For the second term though, you actually lower the Christoffel symbol index instead of the vector index so you will get: [tex]\nabla_\alpha V_\mu = \partial_\alpha V_\mu +\Gamma_{\mu\alpha\gamma}V^\gamma[/tex]

As you can see, now you are dealing with a Christoffel symbol of the first kind (##\Gamma_{\mu\nu\gamma}\equiv g_{\mu\tau}\Gamma^{\tau}_{~~\nu\gamma}##) instead of a Christoffel symbol of the second kind that we are usually more familiar with.

EDIT: WARNING: There's an error in my first term.

Orodruin · Apr 21, 2015

For some intuition, I usually first try to tell students how it works in curvilinear coordinates on the flat ##\mathbb R^n##. Given a set of curvilinear coordinates ##x^\mu## we can define two sets of basis vectors ##\vec E_\mu = \partial \vec r /\partial x^\mu## and ##\vec E^\mu = \nabla x^\mu## (where ##\nabla## is the gradient and ##x^\mu## a coordinate function). These bases are neither orthogonal nor normalised, but they do obey ##\vec E_\nu \cdot \vec E^\mu = \delta^\mu_\nu## and will generalise into the coordinate bases for vectors and covectors. The Christoffel symbols are given by how the basis changes with respect to the coordinates (since this is ##\mathbb R^n##, we can compare vectors at different points without problem) ##\vec E^\sigma\cdot \partial_\mu \vec E_\nu = \Gamma^\sigma_{\mu\nu}##, i.e., ##\partial_\mu \vec E_\nu = \Gamma^\sigma_{\mu\nu} \vec E_\sigma##. But at the same time we have
$$
0 = \partial_\mu \vec E^\sigma \cdot \vec E_\nu = \vec E^\sigma \cdot \partial_\mu \vec E_\nu + \vec E_\nu \cdot \partial_\mu \vec E^\sigma = \Gamma^\sigma_{\mu\nu} + \vec E_\nu \cdot \partial_\mu \vec E^\sigma.
$$
It follows that ##\vec E_\nu \cdot \partial_\mu \vec E^\sigma = - \Gamma^\sigma_{\mu\nu}##, or in other terms ##\partial_\mu \vec E^\sigma = -\Gamma^\sigma_{\mu\nu} \vec E^\nu## and there you have your minus sign.

This generalises readily to curved spaces and the absolutely easiest way to see that it must be like that is to apply the covariant derivative to the scalar field ##V_\mu W^\mu## for which it reduces to a partial derivative. At the same time, the covariant derivative should obey the product rule and we would have
$$
\nabla_\sigma(V_\mu W^\mu) = (\nabla_\sigma V_\mu) W^\mu + V_\mu \nabla_\sigma W^\mu = W^\mu \nabla_\sigma V_\mu + V_\mu \partial_\sigma W^\mu + \Gamma^\mu_{\rho\nu} V_\mu W^\rho = V_\mu \partial_\sigma W^\mu + W^\mu \partial_\sigma V_\mu.
$$
The only way of satisfying this for arbitrary vector fields is if
$$
\nabla_\sigma V_\mu = \partial_\sigma V_\mu - \Gamma^\rho_{\mu\nu} V_\rho.
$$

samalkhaiat · Apr 24, 2015

unscientific said:

I was reading through hobson and my notes where the covariant acts on contravariant and covariant tensors as
[tex]\nabla_\alpha V^\mu = \partial_\alpha V^\mu + \Gamma^\mu_{\alpha \gamma} V^\gamma[/tex]
[tex]\nabla_\alpha V_\mu = \partial_\alpha V_\mu - \Gamma^\gamma_{\alpha \mu} V_\gamma[/tex]

Why is there a minus sign in the second equation?

I mean, if you simply multiply ##g_{\mu \gamma}## on both side of the equation:
[tex]\nabla_\alpha V^\mu g_{\mu \gamma} = \partial_\alpha V^\mu g_{\mu \gamma} + \Gamma^\mu_{\alpha \gamma} V^\gamma g_{\mu \gamma}[/tex]

1) You need to be careful with the indices.

2) It is true that [tex]\left( \nabla_{\alpha} V^{\mu}\right) g_{\mu \tau} = \nabla_{\alpha} \left( V^{\mu} g_{\mu \tau} \right) = \nabla_{\alpha} V_{\tau} ,[/tex] because [itex]\nabla_{\alpha} g_{\mu \nu} = 0[/itex].

3) But [tex]\partial_{\alpha} ( V^{\mu} g_{\mu \tau} ) \neq ( \partial_{\alpha} V^{\mu} ) g_{\mu \tau} = \partial_{\alpha} ( V^{\mu} g_{\mu \tau} ) - V^{\mu} \partial_{\alpha} g_{\mu \tau} .[/tex] So, when you contract with [itex]g_{\mu \tau}[/itex], you get [tex]\nabla_{\alpha} V_{\tau} =\partial_{\alpha} V_{\tau} + \left( g_{\mu \tau} \Gamma^{\mu}_{\alpha \gamma} - \partial_{\alpha} g_{\tau \gamma} \right) g^{\gamma \beta} V_{\beta} .[/tex] Now the minus sign show up, because [tex]g^{\gamma \beta} \left( g_{\mu \tau} \Gamma^{\mu}_{\alpha \gamma} - \partial_{\alpha} g_{\tau \gamma} \right) = - \Gamma^{\beta}_{\alpha \tau} .[/tex]

Matterwave · Apr 26, 2015

Ah whoops, Sam's answer is correct. I was too careless in mine. Please disregard. Thanks. :)

Covariant Derivative - where does the minus sign come from?

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Undergrad Why is gravity a fictitious force?

Undergrad Relativistic Space Travel: Optimizing Proper Time [Project Hail Mary]

Undergrad KE of rotating disc

Undergrad Why is the Lorentz Force always perpendicular to velocity?

Graduate How valid is the Block Universe theory?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect