Parallel transport general relativity

Jufa · Apr 3, 2021

Suppose you have a tensor quantity called "B" referenced in a certain locally inertial frame (with four Minkowski components for instance). As far as I know, a parallel transportation of this quantity from a certain point "p" to another point "q" consists in expressing it in terms of the reference frame of "q" in such a way that the tensor quantity "B" remains invariant. If it is the case that the space is flat and a coordinate independent reference frame is choosen (such as the Minkowski's), then "p" and "q" share local reference frames and therefore the components of "B" coincide in both locations. This is not the case in curved spaces, i.e. where gravity shows up.
Here is my problem:
Once known the metric of the space, we can fix a single local reference frame at every location (event). Then I don't understand how it is possible that the parallel transport can depend on the path or, equivalently, how can a parallel transport on a closed path give a different set of tensor coordinates than the initial ones. If the initial and final points of the path coincide their reference frame should be the same as well and therefore different coordinates in initial and final tensors would result into actually modifying the tensor quantity, which is not what parallel transport does.
The only solution I encounter to this apparent paradox is that, given the fact that local inertial frames (free falling frames) are not unique, one can choose the frame of the initial and final points of the path (which are the same) to be different. Why does this happen? I mean theoretically we are free to choose the local inertial frame that most suits our needs. But why would not we choose that the same location (although after covering a whole closed path) has the same reference frame? I guess that we are not that free to choose the reference frame of that final point once covering an entire close path. If we were, the difference of the tensor components after a certain parallel transport would not be even well defined.
My question is then the following:
Once fixing the initial point "p", its locally inertial reference frame and the curve we want to follow to carry out the parallel transport, how does this curve affect the choice of the locally inertial reference frames of every location covered? Maybe an example with a certain curve it would make it easier for me to understand it.
Thanks in advance.

Dale · Apr 3, 2021

Jufa said:

As far as I know, a parallel transportation of this quantity from a certain point "p" to another point "q" consists in expressing it in terms of the reference frame of "q" in such a way that the tensor quantity "B" remains invariant.

I don’t recognize that as an accurate description of parallel transport.

Parallel transport intuitively means that you move a vector (or other tensor) without turning it. If your path turns 10 degrees left then a vector that doesn’t turn with you will be 10 degrees to your right.

A good example is on the 2D surface of a sphere. Suppose you start at the equator with a vector pointing north, and walk due north to the North Pole, the vector will point forward that whole segment. Now, at the North Pole turn 90 deg to the right, so the vector is now pointing to your left. Walk due south back to the equator and the vector will point to your left the whole way. On the equator turn 90 deg to the right again, so the vector is pointing behind you. Walk back to your start and the vector will continue to point behind you. When you return to your starting point turn 90 deg to the right. Finally, you have returned to your original location and bearing, but now the vector is pointing to your right (east) instead of north.

Nugatory · Apr 3, 2021

Jufa said:

Summary:: I am having serious difficulties to understand the concept of parallel transport.

Then I don't understand how it is possible that the parallel transport can depend on the path or, equivalently, how can a parallel transport on a closed path give a different set of tensor coordinates than the initial ones. If the initial and final points of the path coincide their reference frame should be the same as well and therefore different coordinates in initial and final tensors would result into actually modifying the tensor quantity, which is not what parallel transport does.

Here's an easily visualized example.

At a point on the Earth's equator we have a vector pointing due north; for definiteness let's choose that point to be at zero degrees of longitude. We parallel transport the vector to the north pole; it is now pointing south along the 180 degrees of longitude line (if this is not clear, imagine that we started at the equator holding a spear pointing due north, and then started walking north until we reached the pole. What direction does the spear point now?). Next, parallel transport the vector along the 90 degree west longitude line until we reach the equator; now it points west. Parallel transport it along the equator to the east to return to our starting point, and now the vector is pointing west; the round trip rotated it by 90 degrees.

Next, try the same round trip in the other direction, again starting with the vector pointing due north: first 90 degrees west, then 90 degrees north, then 90 degrees south on the zero longitude line. We find that the vector ends up pointing east instead of west so we see different results on different paths.

We didn't do anything with reference frames or coordinates here, we just took advantage of the fact that a geodesic (the equator and the latitude lines are all geodesics) parallel transports a vector onto itself.

nucl34rgg · Apr 3, 2021

You're transporting the tangent vectors from the tangent space at one point on the curve to a different tangent space at a different point so that the vectors remain parallel as you travel an infinitesimal amount along a curve from the first point to the second point. Does that make sense?

Jufa · Apr 3, 2021

Dale said:

I don’t recognize that as an accurate description of parallel transport.

Well, the curve followed by a parallel transportation is defined as these curve where the covariant derivative of the vector vanishes. This means that the tensor quantity to transported does not change throghout the whole path. Am I not right?

DrGreg · Apr 3, 2021

Dale said:

Suppose you start at the equator with a vector pointing north, and walk due north to the North Pole, the vector will point forward that whole segment. Now, at the North Pole turn 90 deg to the right, so the vector is now pointing to your left. Walk due south back to the equator and the vector will point to your left the whole way. On the equator turn 90 deg to the right again, so the vector is pointing behind you. Walk back to your start and the vector will continue to point behind you. When you return to your starting point turn 90 deg to the right. Finally, you have returned to your original location and bearing, but now the vector is pointing to your right (east) instead of north.

Illustration from Wikipedia:

Source: https://commons.wikimedia.org/wiki/File:Parallel_Transport.svg
Creator: Fred the Oyster
Licence: CC-BY-SA 4.0

PeterDonis · Apr 3, 2021

Jufa said:

I am having serious difficulties to understand the concept of parallel transport.

I think part of the issue is that you are trying to think about parallel transport in terms of local reference frames, and you appear to have some misconceptions in your understanding of local reference frames. Also you appear to be thinking about the relationship between parallel transport and local reference frames backwards; see the comments at the end of this post.

Jufa said:

If it is the case that the space is flat and a coordinate independent reference frame is choosen (such as the Minkowski's), then "p" and "q" share local reference frames

No, they don't. They share (assuming they are at rest relative to each other, or more precisely that observers located at p and q are at rest relative to each other) the same global reference frame, since they are at rest relative to each other, but their local reference frames will not be the same since they will be centered on different points.

Jufa said:

Once known the metric of the space, we can fix a single local reference frame at every location (event).

You can do this even without knowing the metric, since local reference frames are treated as flat and any effects due to spacetime curvature are ignored. But there won't be just one local reference frame at any given event in spacetime; as you recognize later on in your post, there will be an infinite number of them, in fact a six-parameter infinite group of them, since it takes six independent parameters to specify a 4-velocity vector at an event (corresponding to the six independent parameters that specify an arbitrary local Lorentz transformation).

Jufa said:

Once fixing the initial point "p", its locally inertial reference frame and the curve we want to follow to carry out the parallel transport, how does this curve affect the choice of the locally inertial reference frames of every location covered?

It doesn't have to affect the choice of local reference frames at all. But if you want to take the basis vectors of a local reference frame at p, and parallel transport those vectors along some curve (which we'll here assume to be a geodesic, to keep things simple) to q in order to obtain a local reference frame at q, that is indeed a well-defined operation. And if you do that operation around a closed curve in a curved spacetime, you will find, in general, that the final local reference frame you end up with will not be the same as the one you started with.

Jufa · Apr 3, 2021

PeterDonis said:

It doesn't have to affect the choice of local reference frames at all.

Well, you say that if you choose a certain closed curve it might be possible that the final basis does not coincide with the inicial. Nonetheless if you don't perform any curve (which Can be thought as a particular case of a closed curve) then the final basis is clearly the same as the initial. From this point of view it seems clear to me that fixing the curve implies fixing the reference basis of the locations in the curve and I wonder what is the relation.

Dale · Apr 3, 2021

Jufa said:

Well, the curve followed by a parallel transportation is defined as these curve where the covariant derivative of the vector vanishes.

Parallel transport doesn’t define a curve, you can do parallel transport along any curve.

Jufa said:

This means that the tensor quantity to transported does not change throghout the whole path. Am I not right?

That is correct. But I don’t know what it has to do with reference frames.

PeterDonis · Apr 3, 2021

Jufa said:

you say that if you choose a certain closed curve it might be possible that the final basis does not coincide with the inicial

Not "might be possible" if the spacetime is curved; the final basis will not coincide with the initial one, definitely.

Jufa said:

if you don't perform any curve

Meaning the "curve" is just one point? Yes, this obviously would not change anything. So what?

Jufa said:

From this point of view it seems clear to me that fixing the curve implies fixing the reference basis of the locations in the curve

No, all it means is that a "curve" consisting of only a single point is not a useful concept for this discussion. You have to look at curves that have a continuum of points.

What fixing the curve does is determine what you will end up with if you parallel transport a vector, or a set of vectors such as a basis, from one endpoint of the curve, along the curve to the other endpoint. But that has nothing whatever to do with choosing a basis at any point along the curve. The basis at the initial point is just one way of obtaining a set of vectors to be transported--and of course it's not the only way. You can pick vectors to be transported however you want. Once you've chosen vectors, how they are transported is independent of any choice of reference frame; that's why I said you were thinking of this backwards.

Orodruin · Apr 4, 2021

What strikes me when reading through all of this is that OP’s entire argument is made without referring back to the mathematical definitions and deductions which means that it is heuristic at best and ultimately leads to the wrong place. When all else fails, I find there is often a point in trying to formalise the argument as this may make it easier to see where the heuristic fails.

FactChecker · Apr 4, 2021

Orodruin said:

What strikes me when reading through all of this is that OP’s entire argument is made without referring back to the mathematical definitions and deductions which means that it is heuristic at best and ultimately leads to the wrong place. When all else fails, I find there is often a point in trying to formalise the argument as this may make it easier to see where the heuristic fails.

I think that one conceptual error is the implication that a tensor must have the same coordinates and coordinate system when parallel transport along a cyclical path gets back to the starting point. The tensor can have different coordinates in the new coordinate system as long as the tensor coordinate transformation requirements are satisfied.

Ibix · Apr 4, 2021

Jufa said:

To me it has to do that the components of the transported tensor will along the curve as long as the reference frame choice changes along the curve.

There seems to be verb missing here, and I'm not at all sure what you are trying to say.

The point about parallel transport is that it defines what we mean by "not changing" for the tangent and cotangent spaces. If you parallel transport the basis vectors of a local frame along some path, those parallel transported vectors will always be the basis vectors of the local frame of someone following that path. If the path is a closed loop, the parallel transported frame may be different on a subsequent visit to a point from on its previous visit.

Jufa · Apr 4, 2021

Ibix said:

If you parallel transport the basis vectors of a local frame along some path, those parallel transported vectors will always be the basis vectors of the local frame of someone following that path.

I think this is the key point. First of all. What do you mean by transporting the basis vectors? As far as I know what changes after a certain transport are the coordinates, not the vectors them selves. This is all I don't Understand.

Ibix · Apr 4, 2021

Jufa said:

I think this is the key point. First of all. What do you mean by transporting the basis vectors? As far as I know what changes after a certain transport are the coordinates, not the vectors them selves. This is all I don't Understand.

Vectors at different points are in separate tangent spaces, so after parallel transport along a non-closed path you most definitely do not have the same vectors. The whole point is that there isn't even a way to compare the vectors at one point to those at another, except to define some mapping of a vector from one space to the other. That's what parallel transport does. It says that a vector in the tangent space at one point is, in some sense, pointing in the same direction as another vector in another tangent space.

But what about closed paths? In that case, parallel transport is a map from a tangent space to itself. It isn't necessarily an identity map, so a vector parallel transported around a curve isn't necessarily itself. The transported vector will typically point in a different direction to its original. So how could it be the same vector?

The key point is that parallel transport is a map from one vector space to another (or itself). It doesn't actually take a vector out of one tangent space and carry it to another space (what would that even mean?), so it can only be a mathematical method for mapping between two vectors (possibly in different spaces).

I'm not sure what point you are trying to make when you talk about "the coordinates [changing], not the vectors them selves". Are you confusing this with a basis change? In that case, you decide that you don't like your basis vectors anymore and pick new ones, and then you must re-express all your quantities in terms of your new basis - i.e., change their components without changing the vectors/tensors themselves. But in the case of parallel transport, we're applying some maths to find out what carrying our basis vectors around a loop would do to the direction they point. You could then change to this new basis, but you aren't required to do so.

stevendaryl · Apr 4, 2021

Jufa said:

I think this is the key point. First of all. What do you mean by transporting the basis vectors? As far as I know what changes after a certain transport are the coordinates, not the vectors them selves. This is all I don't Understand.

Yes, the basis vectors change as you move around.

Let's look at a very simple case that has connection coefficients which is the 2-dimensional plane, in polar coordinates versus cartesian coordinates.

In cartesian coordinates, you have coordinates ##x## and ##y## and basis vectors ##e_x##, which is a unit vector in the x-direction, and ##e_y##, which is a unit vector in the y-direction. Transforming to polar coordinates ##r## and ##\phi## where ##x = r cos(\phi)## and ##x = r sin(\phi)##, we have:

##e_r = \dfrac{\partial x}{\partial r} e_x + \dfrac{\partial y}{\partial r} e_y = cos(\phi) e_x + sin(\phi) e_y##
##e_\phi = \dfrac{\partial x}{\partial \phi} e_x + \dfrac{\partial y}{\partial \phi} e_y = - r sin(\phi) e_x + r cos(\phi) e_y##

Obviously, if ##e_x## and ##e_y## are constant vectors (which they are), then ##e_r## and ##e_\phi## are not constant vectors. They are functions of ##r## and ##\phi##.

As a matter of fact, one way to define the connection coefficient ##\Gamma^i_{jk}## in a coordinate basis is this way:

##\Gamma_{ijk} = e_i \cdot \nabla_j e_k##

(Then ##\Gamma^m_{jk} = g^{im} \Gamma_{ijk}##)

If the basis vectors didn't change from point to point, then the ##\Gamma## coefficients would all be zero.

DrGreg · Apr 4, 2021

It's easier to consider an example of a curved 2D space instead of a curved 4D spacetime, so consider the 2D surface of a sphere. The advantage is that we can draw diagrams of the 2D space embedded in 3D Euclidean space. The third dimension is never used in the maths but it can help you visualise what is happening.

Below we have a blue 2D manifold. The yellow squares represent 2D tangent spaces at various points on the manifold and the red and green arrows represent a choice for the two basis vectors for each tangent space.

?hash=4bc89fef1c2067ea1031cd87854b063b.png

The left diagram shows basis vectors that have been parallel-transported from a point ##p## along two different paths.

The right diagram shows basis vectors that have been parallel-transported from another point ##r## along two different paths.

And what is clear is that both diagrams agree over the basis vectors at ##p## and ##r##, but they disagree over the basis vectors at ##q##.

It also shows that parallel transport direct from ##p## to ##q## is not the same as parallel transport from ##p## to ##r## to ##q##.

A.T. · Apr 4, 2021

Jufa said:

I am having serious difficulties to understand the concept of parallel transport.

Maybe this mechanical model can help with the intuition:

A.T. said:

Imagine a tank, with the gun turret rotation inversely coupled to the tank steering: When the tank hull turns X degree relative to the local ground, the turret turns -X degree relative to the hull, so the gun keeps its orientation relative to the local ground. This is how you parallel transport a gun.

On the sphere, the only way to prevent the turret from rotating relative to the hull, is to move on great circles (geodesics). To move on a circle of constant latitude off the equator, the tracks of the tank must be running at different speeds, so the tank is turning, and the turret is rotating relative the hull. When the tank arrives at the starting position, the gun will have a different orientation, then it started with.

PeterDonis · Apr 4, 2021

Jufa said:

What do you mean by transporting the basis vectors? As far as I know what changes after a certain transport are the coordinates, not the vectors them selves.

No. Again you have it backwards. If you are talking about local coordinates centered on two different points, and all you have is the coordinates themselves, you have no way of comparing them at all. Components in local coordinates centered on point p have no relationship whatsoever with components in local coordinates centered on point q, if the coordinates are all you have.

The only way to have any relationship between anything at point p and anything at point q is to transport geometric objects, like vectors (you can also transport tensors of higher rank), from point p to point q, along some specific curve. And geometric objects are independent of any choice of coordinates; you have to get the geometric objects and how they transport straight first, before even looking at how they can be used to relate coordinates at one point to coordinates at another.

FactChecker · Apr 4, 2021

Jufa said:

I think this is the key point. First of all. What do you mean by transporting the basis vectors? As far as I know what changes after a certain transport are the coordinates, not the vectors them selves. This is all I don't Understand.

It is important to distinguish between a "vector" in the physics sense versus a "vector" in the pure mathematical sense. In physics, a vector often means something physical that can be represented in many different coordinate systems but has a fixed physical meaning independent of the coordinate system. In mathematics, a vector is more general and some may be defined that do not represent anything physical and are dependent on the coordinate system. Orthonormal basis vectors can be used to define a coordinate system and are clearly dependent on the coordinate system.

Jufa · Apr 4, 2021

I am sorry guys but I definitely don't get it. In the sphere example, which I have seen everywhere, I don't know which is the rule to choose the local reference frame at a certain point of the curve. In the beginning it seems that the basis vectors chosen are the ones corresponding to the azimuth angle and the polar angle. This makes sense to me, because these two vectors do change along the curve chosen in such a way that in order to keep some vector (in the abstract physical sense) constant their components must change at every point. Nevertheless, when arriving to the north pole this basis vectors are no longer well defined, since it is a singular point.

Jufa · Apr 4, 2021

I will try to make my question as simple as possible:
Let us consider the physical quantity ##A^{\mu}(x^{\nu}_0) e^\alpha_{\mu}(x^\nu_0)## which it is obtained by evaluating the vectorial field ##A^{\mu}(x^{\nu})## at ##x^\nu_0##, considering a local reference frame defined by the basis vectors ##e^\alpha_{\mu}(x^\nu_0)##. This basis vectors, in contrast with the coordinates ##A^{\mu}(x^{\nu}_0)##, reffer to an actual physical quantity and, therefore, do not depend on the reference frame chosen.
If we perform a parallel transport of this vector from ##x^{\nu}_0## to ##y^{\nu}_0## we obtain ## \hat{A}^{\mu}(y^{\nu}_0) ## which satisfies the following:

$$ \hat{A}^{\mu}(y^{\nu}_0) e^\alpha_{\mu}(y^\nu_0) = A^{\mu}(x^{\nu}_0) e^\alpha_{\mu}(x^\nu_0) $$

My only question is how do we know the relation between the basis vectors ##e_{\mu}(y^\nu_0)## and ##e^\alpha_{\mu}(x^\nu_0)##. Because it seems that they don't just depend on the position considered but also on the path we have followed to reach that position. I am asking for the rule we follow to choose the basis vectors at every single point of the curve, which ultimately determines how do the components of the vector transported must vary.

Someone might say that there is no way of relating ##e^\alpha_{\mu}(y^\nu_0)## and ##e^\alpha_{\mu}(x^\nu_0)## since they are vectors referred to different locations and therefore different local frames. But I don't see this clear, since ##e^\alpha_{\mu}(y^\nu_0)## and ##e^\alpha_{\mu}(x^\nu_0)## are actual physical quantities i.e. they do not depend of the reference frame considered. In the sphere example, for instance, the basis vectors are all 3-D vectors which can be related considering theme as free vectors, independently of their position.

A.T. · Apr 4, 2021

Jufa said:

I am sorry guys but I definitely don't get it. In the sphere example, which I have seen everywhere, I don't know which is the rule to choose the local reference frame at a certain point of the curve.

Does the mechanical analogy in post #18 make sense to you? it doesn't explicitly use specific local reference frames.

Jufa · Apr 4, 2021

A.T. said:

Does the mechanical analogy in post #18 make sense to you? it doesn't explicitly use specific local reference frames.

I'm affraid I don't get it.

Jufa · Apr 4, 2021

DrGreg said:

a choice for the two basis vectors for each tangent space.

Here you state that the reference frame at every point is out of choice.

Jufa · Apr 4, 2021

DrGreg said:

basis vectors that have been parallel-transported

Nevertheless you say here that the new basis at each point is not obtained by a mere choice but by parallel transporting it.

DrGreg · Apr 4, 2021

Jufa said:

Nevertheless you say here that the new basis at each point is not obtained by a mere choice but by parallel transporting it.

What I mean is I have made a choice, and my choice is to use parallel-transported basis vectors. I could have chosen a different basis, but I didn't.

Jufa · Apr 4, 2021

etotheipi said:

@Jufa, try reading through [the relevant sections of] this document: https://arxiv.org/pdf/1412.2393.pdf

Many thanks. I'll give it a try.

Orodruin · Apr 4, 2021

Jufa said:

In the sphere example, for instance, the basis vectors are all 3-D vectors which can be related considering theme as free vectors, independently of their position.

This is where (one of) your misconception lies. There is generally no need to refer to the embedding of the space in a higher dimensional one. The description is purely two-dimensional and the tangent spaces are fundamentally different. If you do want to consider the embedding, the tangent spaces at two different points are still different as a vector that is tangent to the sphere at one point will generally not be tangent to it at another point. Remember, the description is two-dimensional - you are utterly forbidden to use vectors that are not tangent to the sphere as those would represent translations out of the manifold.

Jufa · Apr 4, 2021

DrGreg said:

What I mean is I have made a choice, and my choice is to use parallel-transported basis vectors. I could have chosen a different basis, but I didn't.

Okey. Then the rule I was asking for that we follow to chose the reference frame at each point of the curve is that we transport the initial basis vectors. The only thing I don't get is how we do transport a basis vector. A basis vector is a physical quantity (independent of the reference system) and therefore it can not vary due to a parallel transport (maybe you refer to transport the coordinates of the reference basis). Also, in order to transport some vector to a a certain position "q" we need to know in advance what the reference frame we will use at "q". So to me it sounds weird to say that in order to know which reference basis we choose in "q" we need to perform a parallel transport, which already requires of knowing the reference frame in "q".

Parallel transport general relativity

Attachments

Similar threads

Undergrad Euclidean geometry and gravity

Undergrad Synchronizing clocks in an inertial frame if light is anisotropic

Undergrad Question about Parallel Transport

Undergrad The Einstein Clock aka Light Clock

Graduate Assumptions of Hawking-Penrose 1970 Singularity Theorem

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers