Is Parallel Transport Invariance Maintained in General Relativity Calculations?

Click For Summary

Discussion Overview

The discussion revolves around the concept of parallel transport in the context of general relativity, specifically examining whether the dot product of two vectors remains invariant during parallel transport along a geodesic. Participants explore the mathematical implications and definitions related to metric compatibility and covariant derivatives.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant attempts to show that the time derivative of the dot product of two vectors is zero, questioning the validity of their approach.
  • Another participant clarifies that the vectors are parallel transported along a geodesic and emphasizes the goal of demonstrating the invariance of the dot product under parallel transport.
  • Concerns are raised about the interpretation of metric compatibility and how it relates to parallel transport, with some participants expressing confusion about the definitions involved.
  • Several participants discuss the implications of the covariant derivative and its relationship to the total derivative, noting that the distinction between different types of derivatives can lead to misunderstandings.
  • One participant acknowledges that their initial approach was overly complicated and expresses gratitude for the simpler proof presented by others.
  • There is a discussion about the notation used for derivatives, with participants clarifying the meanings of different symbols and their implications in the context of parallel transport.

Areas of Agreement / Disagreement

Participants express varying levels of understanding regarding the definitions and implications of parallel transport and metric compatibility. While some participants find clarity in the discussion, others remain confused about certain aspects, indicating that the discussion does not reach a consensus.

Contextual Notes

Participants highlight the potential for confusion arising from the notation used for derivatives and the assumptions made regarding the nature of the vectors and their transport. The discussion also touches on the mathematical rigor required to prove the invariance of the dot product, suggesting that some steps may be dependent on specific definitions or interpretations.

bueller11
Messages
17
Reaction score
0
I'm trying to show that \frac{d}{dt}\; g_{\mu \nu} u^{\mu} v^{\nu} = 0 in the context of parallel transport (or maybe not zero), and I'm rather insecure about the procedure. This is akin to problem 3.14 in Hobson's et al. book (General Relativity an introduction for physicists).

As a guess, I tried the take the time derivative:
\frac{d}{dt}\; g_{\mu \nu} u^{\mu} v^{\nu}+g_{\mu \nu} \dot{u^{\mu}} v^{\nu}+g_{\mu \nu} u^{\mu} \dot{v^{\nu}}=0

I was assuming a stationary metric, so the first part would be zero, leaving

g_{\mu \nu} \dot{u^{\mu}} v^{\nu}+g_{\mu \nu} u^{\mu} \dot{v^{\nu}}=0

From there I can substitute in for \dot{u^{\mu}} and \dot{v^{\nu}}.

Is this the right path to take? It seems there's then some index trickery involved to solve this.

Thanks!
 
Physics news on Phys.org
Did I post this in the wrong forum?

Some more info: vectors u and v are parallel transported along a geodesic, the point is to show that the dot product is invariant under parallel transport.

Also, this isn't homework, but it was a question asked on a midterm.
 
I'm a bit confused how general relativity treats this. It seems that this is the very definition of metric-compatible connection and parallel transport.
 
That was similar to my thought. I was thinking: well, we are only including the things that make this true so what's the point in showing it? As a result, I was thinking it was more of a mathematical question about topology and manifolds and I lacked understanding. In other words, it seemed that it's a mathematical argument not a physical one and I don't understand it. Overall, I'm really confused.

But what I was trying... Parallel transport says the covariant derivative is zero along a curve parameterized by, in this case, t. That means for a generic vector v,

\frac{dv^{a}}{du} = -\Gamma^{a}_{bc}v^{b} \frac{dx^{c}}{du},

And I can rewrite the equations to be:

g_{\mu \nu} (-\Gamma^{\mu}_{\beta \sigma}u^{\beta} \dot{dx^{\sigma}}) v^{\nu}+g_{\mu \nu} u^{\mu} (-\Gamma^{\nu}_{\beta \sigma}v^{\beta} \dot{dx^{\sigma}})=0

Is that true? This doesn't seem like it's going anywhere.
 
The definition of a metric-compatible connection is that

\frac{d}{dt} <X,Y> = <D_tX,Y> + <X,D_tY>

By the definition of parallel transport, the right-hand side is zero. You're on the right track, but you don't have to assume a stationary metric.
 
zhentil said:
The definition of a metric-compatible connection is that

\frac{d}{dt} <X,Y> = <D_tX,Y> + <X,D_tY>

By the definition of parallel transport, the right-hand side is zero. You're on the right track, but you don't have to assume a stationary metric.

Do you mean:

D_t <X,Y> = <D_tX,Y> + <X,D_tY> ?

Then this is just the Leibniz rule and there is no need for metric compatibility.

In all the standard physics books the proof goes something like this:

\partial_k (V^iU_i)=\partial_k (V^i) U_i+V^i\partial_k (U_i)

parallel transport implies:

D_kV^i=\partial_k V^i+\Gamma V=0
D_k U_i=\partial_k U_i-\Gamma U=0

where the indices on second term on the RHS have been suppressed.

If you solve for the partial derivative of the vector in terms of gamma, and then plug that into the above equation, you get it's equal to zero.
 
Ok, I got it. It's so much simpler than I was making it. For some reason it didn't connect to drop an index so I could get the signs to change in the covariant derivative then to the math.

Thank you so much zhentil and RedX!
 
RedX said:
Do you mean:

D_t <X,Y> = <D_tX,Y> + <X,D_tY> ?

Then this is just the Leibniz rule and there is no need for metric compatibility.
That's the definition of metric compatibility. You can easily construct connections that don't obey this rule. I'm not sure why you've drawn a distinction between D_t<X,Y> and \frac{d}{dt} <X,Y>. By definition, a covariant derivative acts like a normal derivative on functions.
 
zhentil said:
I'm not sure why you've drawn a distinction between D_t<X,Y> and \frac{d}{dt} <X,Y>. By definition, a covariant derivative acts like a normal derivative on functions.

You're right. Your proof is much simpler than the one my instructor gave. In fact, the proof is practically tautological.

The reason I made a distinction is because usually little d means a coordinate derivative and big D means a covariant derivative, and the person wrote it with a little d. I forgot that when acting on a scalar, they are the same. So that is why I wasn't able to come up with your simple proof. Once you note that little d acting on a scalar is the same as big D acting on a scalar, then the proof is just automatic.
 
  • #10
RedX said:
The reason I made a distinction is because usually little d means a coordinate derivative and big D means a covariant derivative, and the person wrote it with a little d. I forgot that when acting on a scalar, they are the same. So that is why I wasn't able to come up with your simple proof. Once you note that little d acting on a scalar is the same as big D acting on a scalar, then the proof is just automatic.

Little d/dt means total derivative; a total derivative with respect to a parameter along a path is generally assumed to mean the total covariant derivative.

The coordinate derivative is \partial/\partial x where x is some coordinate (NOT a parameter along a path).
 
  • #11
Ben Niehoff said:
Little d/dt means total derivative; a total derivative with respect to a parameter along a path is generally assumed to mean the total covariant derivative.

The coordinate derivative is \partial/\partial x where x is some coordinate (NOT a parameter along a path).

Yeah. I got confused because of the overloading of the parameter t. So it'd be:

t=t
x,y,z=x(t),y(t),z(t)

instead of something like:

t,x,y,z={t(s)=s},x(s),y(s),z(s)

Also, it is true that for parallel transport:

\partial_k <X,Y>=0

but I guess that's just a path with a direction only on the kth-coordinate basis.

So the total derivative along a path parametrized by s would be:

\frac{dx^k}{ds}D_k X for a vector X and for a scalar you can replace D with a partial derivative.

I think I got it now.

btw, I have written in my notes:

<df,X>=Xf
<dxj,X>=\delta^{i}_j

where X is a basis vector (an operator). Is that a general rule, that you ignore the 'd' in the first term of the angled brackets, and let X operate on the term next to the 'd'? What would <f,X> mean (i.e., without a 'd' in front of the f)?
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 1 ·
Replies
1
Views
943
  • · Replies 21 ·
Replies
21
Views
1K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 17 ·
Replies
17
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 8 ·
Replies
8
Views
1K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 9 ·
Replies
9
Views
1K
  • · Replies 4 ·
Replies
4
Views
5K