Alternative definitions of geodesic

Click For Summary
The discussion revolves around defining geodesics in a way that is accessible to students with varying mathematical backgrounds. The standard definition, which involves parallel transport of tangent vectors, is deemed too complex for a general audience. Alternative definitions, such as viewing geodesics as curves of maximal or minimal length, are explored for their intuitive appeal. The concept of conjugate points is highlighted as a critical technical issue, with suggestions for how to present this in a simplified manner. Ultimately, the aim is to convey the essence of geodesics while maintaining some level of rigor suitable for a diverse audience.
  • #31
Here is what is true within some small neighborhood of two sufficiently close points p and q within a Lorentzian 4-manifold:

1) Spacelike paths (not geodesics) connecting p and q may have any length > 0 and < infinity (yes, even in a tiny neighborhood).
2) If any timelike path can connect p and q, there is a maximal timelike path within the neighborhood and it is the unique geodesic.
3) If no timelike or null path can connect p and q, then there is a unique spacelike geodesic defined by stationary variation, but not by any extremal property. [obviously, it could also be defined by parallel transport, but the original spirit of this thread was to explore extremal or at least variational definitions)].
4) Otherwise, p and q are connected by a unique null geodesic. I have never been satisfied with variational definitions of null geodesics, for this case I would always use parallel transport definition.
 
Last edited:
Physics news on Phys.org
  • #32
@bcrowell have you decided if you are going to go with the "straight line" definition or the "shortest distance between two points" definition?
 
  • #33
DaleSpam said:
@bcrowell have you decided if you are going to go with the "straight line" definition or the "shortest distance between two points" definition?

Shortest distance (maximal time). All I really need for my talk is timelike geodesics, so I think it works. I'll probably give out a handout with some of the technical details on it, so that I don't have to run through all of that in the talk itself. For my purposes, the difficult points that I really want to concentrate on and get at rigorously are the definition of a singularity in terms of geodesic completeness, and the distinction between a curvature singularity and a non-curvature singularity. The definition of a geodesic is just some preliminary apparatus, so I think relegating its details to a handout will be fine.

BTW, thanks, everyone, for a very helpful discussion!
 
  • #34
bcrowell said:
I think this can be strengthened quite a bit. For points spacelike in relation to one another and sufficiently close together, I think the geodesic between them has vanishing variation. That is, if you move points on the geodesic by epsilon, the length changes by something on the order of epsilon squared. (Of course this is not a rigorous formulation, e.g., you need some machinery to define how you measure epsilon.)
PAllen said:
Vanishing variation, yes, but still no extremal property.
I've been thinking about this. I had to brush up on my calculus of variations to do so.

I am doubtful that the variation functional is vanishing, in the sense that is used in calculus of variations to identify a necessary but not sufficient condition for a function being an extremum of a functional.

Assume for simplicity that, in a given coordinate system the two spacetime points differ only in a single spatial coordinate, say x. Then there is a basis for the vector space of paths between the two points, that is made up of basis functions each of which is confined to one of the x-t, x-y or x-z planes. The functions would use x as parameter and give a result (y,z,t). That basis wouldn't span curves that backtrack in the x dimension but for now I'm assuming that doesn't affect the argument. The result of applying the variation of the length functional to these basis functions will be positive for paths in the x-y and y-z planes and negative for paths in the x-t plane. So the variation functional must be nonzero.

In fact, I think that the spacelike geodesic will be the functional equivalent of a saddle point in ordinary calculus.

What we can say (I think) is that the geodesic is a minimum for paths for which the time coordinate is constant (provided the geodesic has constant time parameter. It may not, and if it doesn't then it will be more complex to characterise the set of paths over which it is a minimum), and a maximum for paths for which the y and z coordinates are constant. This is like how for a saddle point in calculus we can say that the surface gives a maximum value of z in the x direction and a minimum value of z in the y direction. For the saddle point, ##\frac{\partial z}{\partial x}=\frac{\partial z}{\partial y}=0;\ \frac{\partial^2 z}{\partial x^2}>0## and ##\frac{\partial^2 z}{\partial y^2}<0##.
 
  • #35
A saddle point has vanishing variation. In fact, a generalized saddle point is all that vanishing variation guarantees, at least in the terminology of my (math, not physics) book on calculus of variations.
 
  • #36
In the terminology of my text (Gelfand and Fomin 'Calculus of Variations'), for the saddle point that I linked the term 'variation' has no meaning, because the saddle point depicted is a function (from ##\mathbb{R}^2## to ##\mathbb{R}##) and 'variation' is a property of a functional, at a function. Based on their definition of 'variation' I would expect - based on the above argument - that the variation of the length functional at the spacelike geodesic is non-vanishing. They define 'vanishing' to mean identically zero. At least I think that's what they mean, but there's some uncertainty because they say '##\delta J_y## is vanishing at ##y##' means that ##\delta J_y[h]=0## 'for all admissible h', and they don't explain what they mean by 'admissible' anywhere that I can find (h is the incremental function added to the function at which the variation is zero.

What text are you using? I have to say that I'm not in love with Gelfand and Fomin, as they keep saying things like 'the functional J[h]' whereas J is the functional and J[h] is a real number that is obtained when one applies the functional to function h. It caused me no end of confusion.
 
  • #37
andrewkirk said:
In the terminology of my text (Gelfand and Fomin 'Calculus of Variations'), for the saddle point that I linked the term 'variation' has no meaning, because the saddle point depicted is a function (from ##\mathbb{R}^2## to ##\mathbb{R}##) and 'variation' is a property of a functional, at a function. Based on their definition of 'variation' I would expect - based on the above argument - that the variation of the length functional at the spacelike geodesic is non-vanishing. They define 'vanishing' to mean identically zero. At least I think that's what they mean, but there's some uncertainty because they say '##\delta J_y## is vanishing at ##y##' means that ##\delta J_y[h]=0## 'for all admissible h', and they don't explain what they mean by 'admissible' anywhere that I can find (h is the incremental function added to the function at which the variation is zero.

What text are you using? I have to say that I'm not in love with Gelfand and Fomin, as they keep saying things like 'the functional J[h]' whereas J is the functional and J[h] is a real number that is obtained when one applies the functional to function h. It caused me no end of confusion.
The book I am using is a monograph by Gilbert Ames Bliss. What you describe is similar to part of his presentation, except that he does define admissibility of h. The h functions need only be continuous, meet boundary conditions, and be c2 smooth except for a finite number of points. If, without assuming anything specific about h, the derivative of the integral with respect to the parameter applied to it is zero, it is said the variation is zero. This is equivalent to satisfying the Euler Lagrange equations. Spacelike geodesics clearly satisfy these equations, so they have stationary (zero) variation. However, since they are not locally extremal, they are some type of saddle point.
 
Last edited:
  • Like
Likes bcrowell
  • #38
andrewkirk said:
Assume for simplicity that, in a given coordinate system the two spacetime points differ only in a single spatial coordinate, say x. Then there is a basis for the vector space of paths between the two points, that is made up of basis functions each of which is confined to one of the x-t, x-y or x-z planes.

Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.
 
  • #39
bcrowell said:
Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.

Well let's suppose that we have a definition of geodesic, "locally geodesic", that applies in a small enough simply-connected, open subset of the manifold. Then wouldn't that extend to arbitrary geodesics by saying that the path is locally geodesic in every sufficiently small simply-connected open subset of the manifold that the geodesic passes through?
 
  • #40
bcrowell said:
Here you seem to be assuming that spacetime can be covered by a single chart and that it has the structure of a vector space. Neither of these is true in GR.
Re the coverage by a single chart: only locally. The intended context of the comment is two spacelike separated points inside the same geodesic ball. That fits in with the overall picture via the finite set of points ##h_k## you posed in the OP that break up the full geodesic. If the points are close enough together, each pair can be in a geodesic ball.

The vector space to which I referred is an infinite-dimensional vector space of functions, not a 4D one of spacetime directions. Each element of the vector space denotes a path (not necessarily geodesic) between the two points. We choose a local coordinate system in which the ##x## two points have coordinates ##(a^t,a^x,a^y,a^z)## and ##(a^t,b^x,a^y,a^z)##. Then the vector space mentioned is the set of suitably nice (ie continuous etc) functions ##f## from the real interval ##[a^x,b^x]## to ##\mathbb{R}^3## such that ##f(x')## is the ##t, y, z## coordinates of the point on the path that has ##x## coordinate ##x'##. The functional being considered is ##f\mapsto L(\pi^{-1}\circ g_f([a^x,b^x]))## where
  • ##g_f## is the function ##x\mapsto (x,f(x))##
  • ##\pi## is the coordinate map for the geodesic ball
  • ##L## is the length function for paths in ##M##
I think I have almost convinced myself that the variation of the length functional really is zero at the spacelike geodesic, but I need to think it through more before I can have any confidence that I understand why.
 
  • #41
andrewkirk said:
I think I have almost convinced myself that the variation of the length functional really is zero at the spacelike geodesic, but I need to think it through more before I can have any confidence that I understand why.

Are you defining geodesic to mean a parametrized path satisfying the geodesic equation (or the definition in terms of parallel transport, which is mathematically equivalent for a connection defined in terms of the metric)? In that case, doesn't the usual Lagrange-Euler equations imply that (spacelike or timelike) geodesics have zero variation?

If we consider a parametrized curve x^\mu(s), then the invariant length of the curve is given by \int ds \sqrt{g_{\mu \nu} U^\mu U^\nu}, where U^\mu = \frac{dx^\mu}{dx^\nu}.If you treat this like an action integral, with lagrangian L = \sqrt{g_{\mu \nu} U^\mu U^\nu}, then the Lagrange-Euler equations for a path with zero variation leads to

\dfrac{d}{ds} \dfrac{\partial L}{\partial U^\mu} - \dfrac{\partial L}{\partial x^\mu} = 0

After reparametrizing to use an affine parameter, this becomes the geodesic equation.

I made the restriction to spacelike or timelike because the derivation of Euler-Lagrange equations breaks down if the path is a null path. That's because the quantity

\dfrac{\partial L}{\partial U^\mu} = \dfrac{g_{\mu \nu} U^\nu}{L}

is undefined whenever L = 0, which is always the case for null paths.
 
  • #42
stevendaryl said:
Well let's suppose that we have a definition of geodesic, "locally geodesic", that applies in a small enough simply-connected, open subset of the manifold. Then wouldn't that extend to arbitrary geodesics by saying that the path is locally geodesic in every sufficiently small simply-connected open subset of the manifold that the geodesic passes through?

Yes. As written, that sounds like a much weaker condition that the definition I proposed in #1, but I assume they're actually equivalent.
 
  • #43
Geodesics is land surveying at the scale of the whole planet.
 
  • #44
Allin said:
Geodesics is land surveying at the scale of the whole planet.
I think you mean "geodesy" or "geodetics". That's not what we are talking about here.
 
  • #45
stevendaryl said:
the Lagrange-Euler equations for a path with zero variation leads to

\dfrac{d}{ds} \dfrac{\partial L}{\partial U^\mu} - \dfrac{\partial L}{\partial x^\mu} = 0

After reparametrizing to use an affine parameter, this becomes the geodesic equation.
That sounds like a promising approach that ought to work. I tried doing this but, because of the complexity of L, the algebra soon became very ugly and I ran out of paper ( or patience - one or the other).

Maybe I took a wrong turn somewhere. Have you worked it through?
 
  • #46
andrewkirk said:
That sounds like a promising approach that ought to work. I tried doing this but, because of the complexity of L, the algebra soon became very ugly and I ran out of paper ( or patience - one or the other).

Maybe I took a wrong turn somewhere. Have you worked it through?

It's easy to get into a blind alley, but I don't think this is that bad.

With L = \sqrt{g_{\mu \nu} U^\mu U^\nu},
  • \dfrac{\partial}{\partial U^\mu} L = \dfrac{1}{L} g_{\mu \nu} U^\nu
  • \dfrac{\partial}{\partial x^\mu} L = \dfrac{1}{2L} \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu} U^{\mu&#039;} U^\nu
So the Euler-Lagrange equations give:

\dfrac{1}{L} \dfrac{d}{ds} (g_{\mu \nu} U^\nu) + g_{\mu \nu} U^\nu \dfrac{d}{ds} \dfrac{1}{L} = \dfrac{1}{2L} \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu} U^{\mu&#039;} U^\nu

Now, the great simplification comes from assuming \dfrac{d}{ds} \dfrac{1}{L} = 0, so L = a constant along the path. With this assumption, the equation simplifies to (multiplying both sides by L)

\dfrac{d}{ds} (g_{\mu \nu} U^\nu) = \dfrac{1}{2} \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu} U^{\mu&#039;} U^\nu

We expand the left-hand side to get:

(\dfrac{d}{ds} g_{\mu \nu}) U^\nu + g_{\mu \nu} \dfrac{d}{ds} U^\nu = \dfrac{1}{2} \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu} U^{\mu&#039;} U^\nu

Then you use: \dfrac{d}{ds} g_{\mu \nu} = (\dfrac{\partial}{\partial x^{\mu&#039;}} g_{\mu \nu}) \dfrac{d}{ds} x^{\mu&#039;} = (\dfrac{\partial}{\partial x^{\mu&#039;}} g_{\mu \nu}) U^{\mu&#039;} (the chain rule for derivatives)

So we now have:
(\dfrac{\partial}{\partial x^{\mu&#039;}} g_{\mu \nu}) U^{\mu&#039;} U^\nu + g_{\mu \nu} \dfrac{d}{ds} U^\nu = \dfrac{1}{2} \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu} U^{\mu&#039;} U^\nu

Rearranging gives:

g_{\mu \nu} \dfrac{d}{ds} U^\nu = - ( (\dfrac{\partial}{\partial x^{\mu&#039;}} g_{\mu \nu}) U^{\mu&#039;} U^\nu - \dfrac{1}{2} \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu} U^{\mu&#039;} U^\nu)

Finally, get rid of the g_{\mu \nu} from the left-hand side by multiplying by the inverse matrix, g^{\mu \nu&#039;} and summing over \mu. this gives:

\dfrac{d U^{\nu&#039;}}{ds} = - \frac{1}{2} g^{\mu \nu&#039;} (2 \dfrac{\partial g_{\mu \nu}}{\partial x^{\mu&#039;}} - \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu}) U^{\mu&#039;} U^\nu

So if we define Q^{\nu&#039;}_{\nu \mu&#039;} = \frac{1}{2} g^{\mu \nu&#039;} (2 \dfrac{\partial g_{\mu \nu}}{\partial x^{\mu&#039;}} - \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu}), then this becomes:
\dfrac{d U^{\nu&#039;}}{ds} = -Q^{\nu&#039;}_{\nu \mu&#039;} U^{\mu&#039;} U^\nu

Almost there! The usual connection coefficient is \Gamma^{\nu&#039;}_{\nu \mu&#039;} = \frac{1}{2} g^{\mu \nu&#039;} (\dfrac{\partial g_{\mu \nu}}{\partial x^{\mu&#039;}} + \dfrac{\partial g_{\mu \mu&#039;}}{\partial x^{\nu}} - \dfrac{\partial g_{\mu&#039; \nu}}{\partial x^\mu}), So we can write:

Q^{\nu&#039;}_{\nu \mu&#039;} = \Gamma^{\nu&#039;}_{\nu \mu&#039;} + \frac{1}{2} g^{\mu \nu&#039;} (\dfrac{\partial g_{\mu \nu}}{\partial x^{\mu&#039;}} - \dfrac{\partial g_{\mu \mu&#039;}}{\partial x^{\nu}})

Notice that the last term on the right is antisymmetric under the exchange \nu \Rightarrow \mu&#039;. On the other hand, U^{\mu&#039;} U^\nu is symmetric under that exchange. The product of an antisymmetric tensor with a symmetric tensor is zero. So:

Q^{\nu&#039;}_{\nu \mu&#039;} U^{\mu&#039;} U^\nu = \Gamma^{\nu&#039;}_{\nu \mu&#039;} U^{\mu&#039;} U^\nu

So the Euler-Lagrange equations boil down to:

\dfrac{d U^{\nu&#039;}}{ds} = -\Gamma^{\nu&#039;}_{\nu \mu&#039;} U^{\mu&#039;} U^\nu

which is the geodesic equation (whew!). Okay, I guess it was pretty bad...
 
  • Like
Likes andrewkirk

Similar threads

Replies
3
Views
786
  • · Replies 8 ·
Replies
8
Views
1K
  • · Replies 17 ·
Replies
17
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 76 ·
3
Replies
76
Views
3K
  • · Replies 57 ·
2
Replies
57
Views
4K
  • · Replies 15 ·
Replies
15
Views
1K
Replies
2
Views
743
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 8 ·
Replies
8
Views
2K