Alternative definitions of geodesic

PAllen · Oct 18, 2015

Here is what is true within some small neighborhood of two sufficiently close points p and q within a Lorentzian 4-manifold:

1) Spacelike paths (not geodesics) connecting p and q may have any length > 0 and < infinity (yes, even in a tiny neighborhood).
2) If any timelike path can connect p and q, there is a maximal timelike path within the neighborhood and it is the unique geodesic.
3) If no timelike or null path can connect p and q, then there is a unique spacelike geodesic defined by stationary variation, but not by any extremal property. [obviously, it could also be defined by parallel transport, but the original spirit of this thread was to explore extremal or at least variational definitions)].
4) Otherwise, p and q are connected by a unique null geodesic. I have never been satisfied with variational definitions of null geodesics, for this case I would always use parallel transport definition.

Dale · Oct 18, 2015

@bcrowell have you decided if you are going to go with the "straight line" definition or the "shortest distance between two points" definition?

bcrowell · Oct 18, 2015

DaleSpam said:

@bcrowell have you decided if you are going to go with the "straight line" definition or the "shortest distance between two points" definition?

Shortest distance (maximal time). All I really need for my talk is timelike geodesics, so I think it works. I'll probably give out a handout with some of the technical details on it, so that I don't have to run through all of that in the talk itself. For my purposes, the difficult points that I really want to concentrate on and get at rigorously are the definition of a singularity in terms of geodesic completeness, and the distinction between a curvature singularity and a non-curvature singularity. The definition of a geodesic is just some preliminary apparatus, so I think relegating its details to a handout will be fine.

BTW, thanks, everyone, for a very helpful discussion!

andrewkirk · Oct 21, 2015

bcrowell said:

I think this can be strengthened quite a bit. For points spacelike in relation to one another and sufficiently close together, I think the geodesic between them has vanishing variation. That is, if you move points on the geodesic by epsilon, the length changes by something on the order of epsilon squared. (Of course this is not a rigorous formulation, e.g., you need some machinery to define how you measure epsilon.)

PAllen said:

Vanishing variation, yes, but still no extremal property.

I've been thinking about this. I had to brush up on my calculus of variations to do so.

I am doubtful that the variation functional is vanishing, in the sense that is used in calculus of variations to identify a necessary but not sufficient condition for a function being an extremum of a functional.

Assume for simplicity that, in a given coordinate system the two spacetime points differ only in a single spatial coordinate, say x. Then there is a basis for the vector space of paths between the two points, that is made up of basis functions each of which is confined to one of the x-t, x-y or x-z planes. The functions would use x as parameter and give a result (y,z,t). That basis wouldn't span curves that backtrack in the x dimension but for now I'm assuming that doesn't affect the argument. The result of applying the variation of the length functional to these basis functions will be positive for paths in the x-y and y-z planes and negative for paths in the x-t plane. So the variation functional must be nonzero.

In fact, I think that the spacelike geodesic will be the functional equivalent of a saddle point in ordinary calculus.

What we can say (I think) is that the geodesic is a minimum for paths for which the time coordinate is constant (provided the geodesic has constant time parameter. It may not, and if it doesn't then it will be more complex to characterise the set of paths over which it is a minimum), and a maximum for paths for which the y and z coordinates are constant. This is like how for a saddle point in calculus we can say that the surface gives a maximum value of z in the x direction and a minimum value of z in the y direction. For the saddle point, ##\frac{\partial z}{\partial x}=\frac{\partial z}{\partial y}=0;\ \frac{\partial^2 z}{\partial x^2}>0## and ##\frac{\partial^2 z}{\partial y^2}<0##.

PAllen · Oct 21, 2015

A saddle point has vanishing variation. In fact, a generalized saddle point is all that vanishing variation guarantees, at least in the terminology of my (math, not physics) book on calculus of variations.

andrewkirk · Oct 21, 2015

In the terminology of my text (Gelfand and Fomin 'Calculus of Variations'), for the saddle point that I linked the term 'variation' has no meaning, because the saddle point depicted is a function (from ##\mathbb{R}^2## to ##\mathbb{R}##) and 'variation' is a property of a functional, at a function. Based on their definition of 'variation' I would expect - based on the above argument - that the variation of the length functional at the spacelike geodesic is non-vanishing. They define 'vanishing' to mean identically zero. At least I think that's what they mean, but there's some uncertainty because they say '##\delta J_y## is vanishing at ##y##' means that ##\delta J_y[h]=0## 'for all admissible h', and they don't explain what they mean by 'admissible' anywhere that I can find (h is the incremental function added to the function at which the variation is zero.

What text are you using? I have to say that I'm not in love with Gelfand and Fomin, as they keep saying things like 'the functional J[h]' whereas J is the functional and J[h] is a real number that is obtained when one applies the functional to function h. It caused me no end of confusion.

PAllen · Oct 21, 2015

andrewkirk said:

In the terminology of my text (Gelfand and Fomin 'Calculus of Variations'), for the saddle point that I linked the term 'variation' has no meaning, because the saddle point depicted is a function (from ##\mathbb{R}^2## to ##\mathbb{R}##) and 'variation' is a property of a functional, at a function. Based on their definition of 'variation' I would expect - based on the above argument - that the variation of the length functional at the spacelike geodesic is non-vanishing. They define 'vanishing' to mean identically zero. At least I think that's what they mean, but there's some uncertainty because they say '##\delta J_y## is vanishing at ##y##' means that ##\delta J_y[h]=0## 'for all admissible h', and they don't explain what they mean by 'admissible' anywhere that I can find (h is the incremental function added to the function at which the variation is zero.

What text are you using? I have to say that I'm not in love with Gelfand and Fomin, as they keep saying things like 'the functional J[h]' whereas J is the functional and J[h] is a real number that is obtained when one applies the functional to function h. It caused me no end of confusion.

The book I am using is a monograph by Gilbert Ames Bliss. What you describe is similar to part of his presentation, except that he does define admissibility of h. The h functions need only be continuous, meet boundary conditions, and be c2 smooth except for a finite number of points. If, without assuming anything specific about h, the derivative of the integral with respect to the parameter applied to it is zero, it is said the variation is zero. This is equivalent to satisfying the Euler Lagrange equations. Spacelike geodesics clearly satisfy these equations, so they have stationary (zero) variation. However, since they are not locally extremal, they are some type of saddle point.

bcrowell · Oct 22, 2015

andrewkirk said:

Assume for simplicity that, in a given coordinate system the two spacetime points differ only in a single spatial coordinate, say x. Then there is a basis for the vector space of paths between the two points, that is made up of basis functions each of which is confined to one of the x-t, x-y or x-z planes.

Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.

stevendaryl · Oct 22, 2015

bcrowell said:

Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.

Well let's suppose that we have a definition of geodesic, "locally geodesic", that applies in a small enough simply-connected, open subset of the manifold. Then wouldn't that extend to arbitrary geodesics by saying that the path is locally geodesic in every sufficiently small simply-connected open subset of the manifold that the geodesic passes through?

andrewkirk · Oct 22, 2015

bcrowell said:

Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.

Re the coverage by a single chart: only locally. The intended context of the comment is two spacelike separated points inside the same geodesic ball. That fits in with the overall picture via the finite set of points ##h_k## you posed in the OP that break up the full geodesic. If the points are close enough together, each pair can be in a geodesic ball.

The vector space to which I referred is an infinite-dimensional vector space of functions, not a 4D one of spacetime directions. Each element of the vector space denotes a path (not necessarily geodesic) between the two points. We choose a local coordinate system in which the ##x## two points have coordinates ##(a^t,a^x,a^y,a^z)## and ##(a^t,b^x,a^y,a^z)##. Then the vector space mentioned is the set of suitably nice (ie continuous etc) functions ##f## from the real interval ##[a^x,b^x]## to ##\mathbb{R}^3## such that ##f(x')## is the ##t, y, z## coordinates of the point on the path that has ##x## coordinate ##x'##. The functional being considered is ##f\mapsto L(\pi^{-1}\circ g_f([a^x,b^x]))## where

##g_f## is the function ##x\mapsto (x,f(x))##
##\pi## is the coordinate map for the geodesic ball
##L## is the length function for paths in ##M##

I think I have almost convinced myself that the variation of the length functional really is zero at the spacelike geodesic, but I need to think it through more before I can have any confidence that I understand why.

stevendaryl · Oct 23, 2015

andrewkirk said:

I think I have almost convinced myself that the variation of the length functional really is zero at the spacelike geodesic, but I need to think it through more before I can have any confidence that I understand why.

Are you defining geodesic to mean a parametrized path satisfying the geodesic equation (or the definition in terms of parallel transport, which is mathematically equivalent for a connection defined in terms of the metric)? In that case, doesn't the usual Lagrange-Euler equations imply that (spacelike or timelike) geodesics have zero variation?

If we consider a parametrized curve x^\mu(s), then the invariant length of the curve is given by \int ds \sqrt{g_{\mu \nu} U^\mu U^\nu}, where U^\mu = \frac{dx^\mu}{dx^\nu}.If you treat this like an action integral, with lagrangian L = \sqrt{g_{\mu \nu} U^\mu U^\nu}, then the Lagrange-Euler equations for a path with zero variation leads to

\dfrac{d}{ds} \dfrac{\partial L}{\partial U^\mu} - \dfrac{\partial L}{\partial x^\mu} = 0

After reparametrizing to use an affine parameter, this becomes the geodesic equation.

I made the restriction to spacelike or timelike because the derivation of Euler-Lagrange equations breaks down if the path is a null path. That's because the quantity

\dfrac{\partial L}{\partial U^\mu} = \dfrac{g_{\mu \nu} U^\nu}{L}

is undefined whenever L = 0, which is always the case for null paths.

bcrowell · Oct 23, 2015

stevendaryl said:

Well let's suppose that we have a definition of geodesic, "locally geodesic", that applies in a small enough simply-connected, open subset of the manifold. Then wouldn't that extend to arbitrary geodesics by saying that the path is locally geodesic in every sufficiently small simply-connected open subset of the manifold that the geodesic passes through?

Yes. As written, that sounds like a much weaker condition that the definition I proposed in #1, but I assume they're actually equivalent.

Allin · Oct 23, 2015

Geodesics is land surveying at the scale of the whole planet.

DrGreg · Oct 23, 2015

Allin said:

Geodesics is land surveying at the scale of the whole planet.

I think you mean "geodesy" or "geodetics". That's not what we are talking about here.

andrewkirk · Oct 23, 2015

stevendaryl said:

the Lagrange-Euler equations for a path with zero variation leads to

\dfrac{d}{ds} \dfrac{\partial L}{\partial U^\mu} - \dfrac{\partial L}{\partial x^\mu} = 0

After reparametrizing to use an affine parameter, this becomes the geodesic equation.

That sounds like a promising approach that ought to work. I tried doing this but, because of the complexity of L, the algebra soon became very ugly and I ran out of paper ( or patience - one or the other).

Maybe I took a wrong turn somewhere. Have you worked it through?

stevendaryl · Oct 23, 2015

andrewkirk said:

That sounds like a promising approach that ought to work. I tried doing this but, because of the complexity of L, the algebra soon became very ugly and I ran out of paper ( or patience - one or the other).

Maybe I took a wrong turn somewhere. Have you worked it through?

It's easy to get into a blind alley, but I don't think this is that bad.

With L = \sqrt{g_{\mu \nu} U^\mu U^\nu},

\dfrac{\partial}{\partial U^\mu} L = \dfrac{1}{L} g_{\mu \nu} U^\nu
\dfrac{\partial}{\partial x^\mu} L = \dfrac{1}{2L} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu

So the Euler-Lagrange equations give:

\dfrac{1}{L} \dfrac{d}{ds} (g_{\mu \nu} U^\nu) + g_{\mu \nu} U^\nu \dfrac{d}{ds} \dfrac{1}{L} = \dfrac{1}{2L} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu

Now, the great simplification comes from assuming \dfrac{d}{ds} \dfrac{1}{L} = 0, so L = a constant along the path. With this assumption, the equation simplifies to (multiplying both sides by L)

\dfrac{d}{ds} (g_{\mu \nu} U^\nu) = \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu

We expand the left-hand side to get:

(\dfrac{d}{ds} g_{\mu \nu}) U^\nu + g_{\mu \nu} \dfrac{d}{ds} U^\nu = \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu

Then you use: \dfrac{d}{ds} g_{\mu \nu} = (\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) \dfrac{d}{ds} x^{\mu'} = (\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) U^{\mu'} (the chain rule for derivatives)

So we now have:
(\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) U^{\mu'} U^\nu + g_{\mu \nu} \dfrac{d}{ds} U^\nu = \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu

Rearranging gives:

g_{\mu \nu} \dfrac{d}{ds} U^\nu = - ( (\dfrac{\partial}{\partial x^{\mu'}} g_{\mu \nu}) U^{\mu'} U^\nu - \dfrac{1}{2} \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu} U^{\mu'} U^\nu)

Finally, get rid of the g_{\mu \nu} from the left-hand side by multiplying by the inverse matrix, g^{\mu \nu'} and summing over \mu. this gives:

\dfrac{d U^{\nu'}}{ds} = - \frac{1}{2} g^{\mu \nu'} (2 \dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} - \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu}) U^{\mu'} U^\nu

So if we define Q^{\nu'}_{\nu \mu'} = \frac{1}{2} g^{\mu \nu'} (2 \dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} - \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu}), then this becomes:
\dfrac{d U^{\nu'}}{ds} = -Q^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu

Almost there! The usual connection coefficient is \Gamma^{\nu'}_{\nu \mu'} = \frac{1}{2} g^{\mu \nu'} (\dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} + \dfrac{\partial g_{\mu \mu'}}{\partial x^{\nu}} - \dfrac{\partial g_{\mu' \nu}}{\partial x^\mu}), So we can write:

Q^{\nu'}_{\nu \mu'} = \Gamma^{\nu'}_{\nu \mu'} + \frac{1}{2} g^{\mu \nu'} (\dfrac{\partial g_{\mu \nu}}{\partial x^{\mu'}} - \dfrac{\partial g_{\mu \mu'}}{\partial x^{\nu}})

Notice that the last term on the right is antisymmetric under the exchange \nu \Rightarrow \mu'. On the other hand, U^{\mu'} U^\nu is symmetric under that exchange. The product of an antisymmetric tensor with a symmetric tensor is zero. So:

Q^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu = \Gamma^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu

So the Euler-Lagrange equations boil down to:

\dfrac{d U^{\nu'}}{ds} = -\Gamma^{\nu'}_{\nu \mu'} U^{\mu'} U^\nu

which is the geodesic equation (whew!). Okay, I guess it was pretty bad...

Alternative definitions of geodesic

Similar threads

I Euclidean geometry and gravity

A Dirac's "GTR" Eq (27.4): how momentum ##p^\mu## varies

A Question on Dirac's derivatives of the 4-velocity w.r.t. coordinates

A Weyl tensor and coordinate acceleration

I Synchronizing clocks in an inertial frame if light is anisotropic

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers