Four Velocity in GR: Covariant Derivative Needed?

  • Context: Graduate 
  • Thread starter Thread starter GarageDweller
  • Start date Start date
  • Tags Tags
    Gr Velocity
Click For Summary

Discussion Overview

The discussion revolves around the treatment of four-velocity in General Relativity (GR) compared to Special Relativity (SR), specifically whether the covariant derivative is necessary for four-velocity and its implications for the proper time derivative along a world line.

Discussion Character

  • Debate/contested
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • Some participants question whether the covariant derivative is needed for four-velocity in GR, as it was defined as the proper time derivative in SR.
  • Others explain that the covariant derivative is a generalization of the vector derivative and discuss its relationship to derivatives with respect to a scalar parameter.
  • A participant mentions the absolute derivative and its connection to the four-acceleration, suggesting that it can be expressed using the covariant derivative along the world line.
  • There is a discussion about the distinction between four-vectors and scalars in curved spacetime, with some arguing that position is not a four-vector but a tuple of scalars.
  • Some participants express confusion about the application of certain equations, particularly regarding the relationship between the covariant derivative and the four-velocity.
  • One participant provides a detailed exploration of the connection coefficients in curvilinear coordinates, illustrating how they affect the derivatives of quantities defined along a path.
  • Another participant asserts that the definition of four-velocity should not involve circular definitions and emphasizes the need for clarity in the equations presented.

Areas of Agreement / Disagreement

Participants express differing views on the necessity and application of the covariant derivative for four-velocity in GR. There is no consensus on whether position can be treated as a four-vector in curved spacetime, and the discussion remains unresolved regarding the implications of these mathematical relationships.

Contextual Notes

Some participants highlight that the definitions and relationships discussed depend on the choice of coordinates and the nature of the quantities involved, indicating potential limitations in the assumptions made during the discussion.

GarageDweller
Messages
103
Reaction score
0
In SR, Four velocity was the proper time derivative of a world line, do we have to take the covariant derivative instead in GR?
 
Physics news on Phys.org
The covariant derivative is a generalization of the vector derivative. How and why exactly would this connect to the derivative with respect to a scalar parameter?

(They do connect, but I think you need to think this through to understand the question you're asking first.)
 
Through an absolute derivative? But that wouldn't make sense as the expression would involve the four velocity itself.

hmmm
 
Yes, relationship [itex]d/d\tau = v \cdot D[/itex] holds, and it's useful for when you know something as a function of position and don't want to go through to the dependence of the path on proper time.

That said, this is obviously not useful to calculate the proper time itself from [itex]x(\tau)[/itex], because you need to know the four-velocity first.
 
As pointed out in the previous thread, the covariant generalization of the proper time derivative is called the absolute derivative. For example the 4-acceleration is simply

Aμ = DUμ/Dτ

If the object being differentiated is a field (i.e. defined everywhere) you can express it as the covariant derivative projected along the world line,

DWμ/Dt = Wμ Uν

However it's more general than the covariant derivative, and can be applied to quantities that are defined only on the world line. In which case you write it instead explicitly using a Christoffel symbol, as was done in Wikipedia.
 
Last edited:
GarageDweller said:
In SR, Four velocity was the proper time derivative of a world line, do we have to take the covariant derivative instead in GR?
An event in spacetime is described by X = (ct, x, y, z). A displacement in spacetime is
dX = (cdt, dx, dy, dz). The 4-velocity is a displacement in spacetime divided by the propertime it took for the displacement to occur, i.e. U = dX/d[itex]\tau[/itex].

In terms of spacetime coordinates this reads Uα = dxα/d[itex]\tau[/itex]. If one had to find the 4-acceleration then you'd take the total derivative of the 4-velocity, i.e. A = dU/dτ. In spacetime coordinates this reads Aα = DUα/Dτ.

D/Dτ is not the covariant derivative, its called the derivative along the curve. Its related to the covariant derivative by

DAα/Dτ = Aμ dxβ/dτ

or

DAα/Dτ = dAμ/dτ + [itex]\Gamma^{μ}_{αβ}[/itex] Aα dxβ/dτ
 
I think nobody has really answered what I thought was the original question.

If [itex]Q[/itex] is any 4-vector defined along a path [itex]P(\tau)[/itex], then

[itex]\dfrac{DQ^{\mu}}{d\tau} = \dfrac{dQ^{\mu}}{d\tau} + \Gamma^{\mu}_{\nu \lambda} Q^{\nu} U^{\lambda}[/itex]

where [itex]U^{\mu}[/itex] is the velocity 4-vector.

The way I understood the original question was: what about the case [itex]Q^{\mu} = X^{\mu}[/itex]? (Where [itex]X^{\mu}[/itex] is the position 4-vector). Then it's definitely not the case that

[itex]\dfrac{DX^{\mu}}{d\tau} = \dfrac{dX^{\mu}}{d\tau} + \Gamma^{\mu}_{\nu \lambda} X^{\nu} U^{\lambda}[/itex]

But why not? The real fact is that position is not a 4-vector in curved spacetime. It's really just a 4-tuple of scalars.
 
stevendaryl said:
I think nobody has really answered what I thought was the original question.
I don't understand. That's what I just posted.

stevendaryl said:
But why not? The real fact is that position is not a 4-vector in curved spacetime. It's really just a 4-tuple of scalars.
A point in spacetime not a 4-vector but a spacetime displacemen is a 4-vector.
 
Boston_Guy said:
I don't understand. That's what I just posted.

Well, I don't see how your post explained why

[itex]\dfrac{DQ^{\mu}}{d \tau} = \dfrac{dQ^{\mu}}{d \tau} + \Gamma^{\mu}_{\nu \lambda} Q^{\nu} U^{\lambda}[/itex]

doesn't apply in the case [itex]Q^{\mu} = X^{\mu}[/itex]. It doesn't apply because [itex]X^{\mu}[/itex] is not a 4-vector.
 
  • #10
stevendaryl said:
Well, I don't see how your post explained why

[itex]\dfrac{DQ^{\mu}}{d \tau} = \dfrac{dQ^{\mu}}{d \tau} + \Gamma^{\mu}_{\nu \lambda} Q^{\nu} U^{\lambda}[/itex]

doesn't apply in the case [itex]Q^{\mu} = X^{\mu}[/itex]. It doesn't apply because [itex]X^{\mu}[/itex] is not a 4-vector.

It certainly applies when using curvilinear coordinates in flat space. Try it with cylindrical coordinates and see what you get.
 
  • #11
Chestermiller said:
It certainly applies when using curvilinear coordinates in flat space.
No. If you were right then you would be saying that the 4-velocity U is defined by the equation
[tex]U^\mu= \dfrac{dX^\mu}{d\tau} + \Gamma^\mu_{\nu \lambda} X^\nu U^\lambda[/tex]
That's a circular definition because U appears on both sides.
 
  • #12
Chestermiller said:
It certainly applies when using curvilinear coordinates in flat space. Try it with cylindrical coordinates and see what you get.

What do you mean "it certainly applies"? The definition of [itex]U^\mu[/itex] is just

[itex]U^\mu = \dfrac{dX^\mu}{d \tau}[/itex],

not

[itex]U^\mu = \dfrac{dX^\mu}{d \tau} + \Gamma^{\mu}_{\nu}{\lambda} U^\nu X^\lambda[/itex]

The latter is not a correct equation, and it's a circular definition of [itex]U^{\mu}[/itex], since that appears on both sides.

But since you suggested working with cylindrical coordinates, let me work out the latter. Let's have coordinates [itex]r[/itex] and [itex]\theta[/itex], which are defined in terms of cartesian coordinates [itex]x[/itex] and [itex]y[/itex] via

[itex]x = r cos(\theta)[/itex]
[itex]y = r sin(\theta)[/itex]

The connection coefficients [itex]\Gamma^{\mu}_{\nu \lambda}[/itex] turn out to be (derivation skipped)

[itex]\Gamma^{r}_{r r} = \Gamma^{r}_{r \theta} = \Gamma^{r}_{\theta r} = 0[/itex]
[itex]\Gamma^{r}_{\theta \theta} = -r[/itex]
[itex]\Gamma^{\theta}_{r r} = \Gamma^{\theta}_{\theta \theta} = 0[/itex]
[itex]\Gamma^{\theta}_{r \theta} = \Gamma^{\theta}_{\theta r} = \dfrac{1}{r}[/itex]

So the path-derivative, or whatever it is called is:

[itex]\dfrac{DQ^r}{dt} = \dfrac{dQ^r}{dt} - r Q^{\theta} U^{\theta}[/itex]

[itex]\dfrac{DQ^\theta}{dt} = \dfrac{dQ^\theta}{dt} + \dfrac{1}{r} Q^{\theta} U^{r} + \dfrac{1}{r} Q^{r} U^{\theta}[/itex]

In the particular case [itex]Q^\mu = X^{\mu}[/itex], with [itex]X^r = r[/itex] and [itex]X^\theta = \theta[/itex], this becomes:


[itex]\dfrac{Dr}{dt} = \dfrac{dr}{dt} - r \theta U^{\theta}[/itex]

[itex]\dfrac{D \theta}{dt} = \dfrac{d \theta}{dt} + \dfrac{1}{r} \theta U^{r} + \dfrac{1}{r} r U^{\theta}[/itex]
[itex]= \dfrac{d \theta}{dt} + \dfrac{1}{r} \theta U^{r} + U^{\theta}[/itex]

I don't know what those quantities are supposed to mean. Typically, the quantity [itex]X^{\mu}[/itex] is not a vector, and it doesn't make sense to take its covariant derivative, or to parallel-transport it.
 
  • #13
Stevens first post cleared it up for me, i don't see my why there need be this debate. So essentially dX is a vector, and X is not, hence dX/dt is really just the vector dX divided by dt.
 
Last edited:
  • #14
stevendaryl said:
What do you mean "it certainly applies"? The definition of [itex]U^\mu[/itex] is just

[itex]U^\mu = \dfrac{dX^\mu}{d \tau}[/itex],

not

[itex]U^\mu = \dfrac{dX^\mu}{d \tau} + \Gamma^{\mu}_{\nu}{\lambda} U^\nu X^\lambda[/itex]

The latter is not a correct equation, and it's a circular definition of [itex]U^{\mu}[/itex], since that appears on both sides.

But since you suggested working with cylindrical coordinates, let me work out the latter. Let's have coordinates [itex]r[/itex] and [itex]\theta[/itex], which are defined in terms of cartesian coordinates [itex]x[/itex] and [itex]y[/itex] via

[itex]x = r cos(\theta)[/itex]
[itex]y = r sin(\theta)[/itex]

The connection coefficients [itex]\Gamma^{\mu}_{\nu \lambda}[/itex] turn out to be (derivation skipped)

[itex]\Gamma^{r}_{r r} = \Gamma^{r}_{r \theta} = \Gamma^{r}_{\theta r} = 0[/itex]
[itex]\Gamma^{r}_{\theta \theta} = -r[/itex]
[itex]\Gamma^{\theta}_{r r} = \Gamma^{\theta}_{\theta \theta} = 0[/itex]
[itex]\Gamma^{\theta}_{r \theta} = \Gamma^{\theta}_{\theta r} = \dfrac{1}{r}[/itex]

So the path-derivative, or whatever it is called is:

[itex]\dfrac{DQ^r}{dt} = \dfrac{dQ^r}{dt} - r Q^{\theta} U^{\theta}[/itex]

[itex]\dfrac{DQ^\theta}{dt} = \dfrac{dQ^\theta}{dt} + \dfrac{1}{r} Q^{\theta} U^{r} + \dfrac{1}{r} Q^{r} U^{\theta}[/itex]

In the particular case [itex]Q^\mu = X^{\mu}[/itex], with [itex]X^r = r[/itex] and [itex]X^\theta = \theta[/itex], this becomes:


[itex]\dfrac{Dr}{dt} = \dfrac{dr}{dt} - r \theta U^{\theta}[/itex]

[itex]\dfrac{D \theta}{dt} = \dfrac{d \theta}{dt} + \dfrac{1}{r} \theta U^{r} + \dfrac{1}{r} r U^{\theta}[/itex]
[itex]= \dfrac{d \theta}{dt} + \dfrac{1}{r} \theta U^{r} + U^{\theta}[/itex]

I don't know what those quantities are supposed to mean. Typically, the quantity [itex]X^{\mu}[/itex] is not a vector, and it doesn't make sense to take its covariant derivative, or to parallel-transport it.

In cylindrical coordinates, the component of the position vector in the θ direction is zero, not Xθ. See what happens when you make this correction in your equations. You will find that the covariant derivative of the 4 position vector actually is the 4 velocity vector, particularly when you revert to the original form of the right hand side of the equation, where there are partial derivatives of the position vector components with respect to tau, rather than the components of the 4 velocity vector. We already know that the position vector in flat space is a 4 vector, since it transforms according to the Lorentz Transformation. In fact, that is how the Lorentz Transformation is usually presented for flat space.
 
  • #15
Chestermiller said:
In cylindrical coordinates, the component of the position vector in the θ direction is zero, not Xθ.

This might be a terminology issue about what is the "position" of an object. You are interpreting it to mean the displacement vector from the origin to the location of the object. A displacement vector certainly is a vector (well, in curved spacetime, that's only true in the limit as the displacement is small).

However, the whole point of a coordinate system is to be able to identify points in space by an n-tuple of coordinate values. In order to specify the location of an object in polar coordinates, you have to give two numbers: the distance [itex]r[/itex] from the origin, and the angle [itex]\theta[/itex] that one must pass through on a circle of radius [itex]r[/itex] centered at the origin until one gets to the object. So the location of the object is specified by the pair [itex](r,\theta)[/itex]. This pair is NOT a vector. However, the velocity vector [itex]U^{\mu}[/itex] created by differentiating the pair with respect to [itex]t[/itex] IS a vector: [itex]U^r = \dfrac{dr}{dt}[/itex], [itex]U^{\theta} = \dfrac{d \theta}{dt}[/itex]

As I said, it's a matter of terminology, I suppose, but I think it's pretty useless to describe the "position" of an object by saying "It's at a distance r from the origin along the [itex]\widehat{r}[/itex] direction. It's much more useful to say that the location is the n-tuple of coordinates needed to locate the object.
 
  • #16
Then what you're doing is not choosing a coordinate basis but an orthogonal (or orthonormal) basis. I think the mathematics (and the picture built on that) is a little different from the common choice of a coordinate basis.
 
  • #17
stevendaryl said:
This might be a terminology issue about what is the "position" of an object. You are interpreting it to mean the displacement vector from the origin to the location of the object. A displacement vector certainly is a vector (well, in curved spacetime, that's only true in the limit as the displacement is small).

However, the whole point of a coordinate system is to be able to identify points in space by an n-tuple of coordinate values. In order to specify the location of an object in polar coordinates, you have to give two numbers: the distance [itex]r[/itex] from the origin, and the angle [itex]\theta[/itex] that one must pass through on a circle of radius [itex]r[/itex] centered at the origin until one gets to the object. So the location of the object is specified by the pair [itex](r,\theta)[/itex]. This pair is NOT a vector. However, the velocity vector [itex]U^{\mu}[/itex] created by differentiating the pair with respect to [itex]t[/itex] IS a vector: [itex]U^r = \dfrac{dr}{dt}[/itex], [itex]U^{\theta} = \dfrac{d \theta}{dt}[/itex]

As I said, it's a matter of terminology, I suppose, but I think it's pretty useless to describe the "position" of an object by saying "It's at a distance r from the origin along the [itex]\widehat{r}[/itex] direction. It's much more useful to say that the location is the n-tuple of coordinates needed to locate the object.

Yes. I finally see what you have been trying to say regarding the notational ambiguity involved here. It is something I've known all along, but was not able to make the connection with your explanation. Here is my own spin on the subject, in terms of notation that I am used to:

Let s be a position vector drawn from an arbitrary origin to a point in flat space. If curvilinear coordinates are being used, then s can be expressed in component form in terms of coordinate basis vectors by:

s = si ai

where ai are the coordinate basis vectors.

But, in curvilinear coordinates, the si are not equal to the coordinates themselves:

si ≠ Xi

They are functions of the coordinates:

si = si (X0, ... , Xn)

Unlike the spatial position vector s, the differential position vector between two neighboring points ds can be expressed linearly in terms of the differentials of the spatial coordinates according to:

ds = dXi ai

From this, it follows immediately that the 4 velocity is given by:

U = [itex]\frac{d<b>s</b>}{dτ}[/itex] = [itex]\frac{dX<sup>i</sup>}{dτ}[/itex] ai

Yiiiii! I HATE LATEX

It is also possible to obtain this same result using the covariant differentiation formula, starting from the equations:

s = si ai

and si = si (X0, ... , Xn)

specifically for the particular coordinate system involved. We already showed this in our earlier posts for the case of cylindrical coordinates.

I hope this gives us some closure on the subject.

Chet
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 34 ·
2
Replies
34
Views
4K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 16 ·
Replies
16
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 13 ·
Replies
13
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 16 ·
Replies
16
Views
3K