# How to Study Fermi-Walker Transport in Minkowski Spacetime

[Total: 1    Average: 5/5]

This is the first of several posts that will develop some mathematical machinery for studying Fermi-Walker transport. In this first post, we focus on Minkowski spacetime in order to introduce the basic concepts without having to deal with the complications introduced by spacetime curvature.

Before looking at Fermi-Walker transport, we first need to introduce the concept of a frame field. A frame field is defined as an assignment of four mutually orthogonal vectors (one timelike and three spacelike) to every event in some region of spacetime. Typically the timelike vectors are tangent to some family of worldlines of interest, and the spacelike vectors represent particular “directions in space” that have some useful meaning. A simple physical realization of a frame field would be a family of observers whose worldlines fill some region of spacetime, and each of whom carries a clock and a set of three mutually orthogonal rulers.

The general idea behind Fermi-Walker transport is that it defines a frame field whose spatial basis vectors are “not rotating”. This is basically a generalization of the concept of parallel transport, which takes into account the fact that, for a non-geodesic worldline, the tangent vector, which for timelike worldlines will be the timelike vector of the frame field, is not parallel transported along the worldline. MTW goes into this in some detail, and their discussion brings out a key point: as it stands, “not rotating” makes no sense for an accelerated worldline, because the timelike vector of the frame field must be rotating in the spacetime sense if it is accelerated, in order to remain tangent to the worldline. The best we can do is to ensure that there is no additional rotation of the spatial basis vectors, over and above the rotation due to proper acceleration, and this is what Fermi-Walker transport does. In terms of the simple physical realization we described above, Fermi-Walker transport means that each observer’s rulers are gyro-stabilized so their directions do not rotate (although, as we will see, the term “rotate” has to be interpreted carefully and “not rotating” doesn’t quite mean the same thing as it does in Newtonian mechanics).

Let’s start with a simple case to see how this works: the Rindler congruence, a family of uniformly accelerated worldlines in Minkowski spacetime. We will use the Rindler chart to describe this congruence, which has the line element (note that we have normalized the units):

$$ds^2 = – X^2 dT^2 + dX^2 + dY^2 + dZ^2$$

In this chart, the uniformly accelerated Rindler observers are at rest, so the frame field that describes them is:

$$\hat{e}_0 = \frac{1}{X} \partial_T$$
$$\hat{e}_1 = \partial_X$$
$$\hat{e}_2 = \partial_Y$$
$$\hat{e}_3 = \partial_Z$$

The proper acceleration is given by

$$\nabla_{\hat{e}_0} \hat{e}_0 = \frac{1}{X} \hat{e}_1$$

as we can see by cranking through the coordinate version (we write ##u^a## for the components of ##\hat{e}_0##):

$$\nabla_{\hat{e}_0} \hat{e}_0 = u^a \nabla_a u^b = u^a \partial_a u^b + u^a \Gamma^b_{ac} u^c = u^T \Gamma^{X}_{TT} u^T \partial_X = \frac{1}{X} X \frac{1}{X} \partial_X = \frac{1}{X} \hat{e}_1$$

We can also compute the covariant derivative of ##\hat{e}_1## the same way (we write ##e^b## for the components of ##\hat{e}_1## ):

$$\nabla_{\hat{e}_0} \hat{e}_1 = u^a \nabla_a e^b = u^a \partial_a e^b + u^a \Gamma^b_{ac} e^c = u^T \Gamma^{T}_{TX} e^X \partial_T = \frac{1}{X} \frac{1}{X} \partial_T = \frac{1}{X} \hat{e}_0$$

Note the symmetry between the two results, which illustrates the (hyperbolic) rotation in the ##T-X## plane that is induced by the proper acceleration of the worldline. Note also that there is *no* additional rotation in the ##Y-Z## plane. This is an example of Fermi-Walker transport.

Note also that we can write

$$\nabla_{\hat{e}_0} \hat{e}_1 = \left( \hat{e}_1 \cdot \nabla_{\hat{e}_0} \hat{e}_0 \right) \hat{e}_0$$

which simply expresses the ##T-X## rotation we have just described. This suggests that we can define a “Fermi derivative” as follows:

$$D_F \hat{e}_{\mu} = \nabla_{\hat{e}_0} \hat{e}_{\mu} – \left( \hat{e}_{\mu} \cdot \nabla_{\hat{e}_0} \hat{e}_0 \right) \hat{e}_0 + \left( \hat{e}_{\mu} \cdot \hat{e}_0 \right) \nabla_{\hat{e}_0} \hat{e}_0$$

so that Fermi-Walker transport just means ##D_F \hat{e}_{\mu} = 0##. Note that we have added a third term, which is zero for a spatial basis vector, but which is ##- \nabla_{\hat{e}_0} \hat{e}_0## for the 4-velocity itself, since its norm is ##-1##; this is needed because the 4-velocity itself is also Fermi-Walker transported, so we must also have ##D_F \hat{e}_0 = 0##. In fact, ##D_F## as written above can be applied to *any* 4-vector whatsoever, to see whether or not it is being Fermi-Walker transported; if ##D_F## gives zero, it is, otherwise it is not.

We can also simplify the computation considerably by noting some things:

(1) The metric and the frame field vectors are all independent of the coordinates that appear in the 4-velocity. (In this case, the only such coordinate is ##T##, and all the frame field vectors are independent of ##T##. But we’ll cover other cases in which the 4-velocity has components in other coordinate directions, and all of them will still meet this requirement.) This means that all of the partial derivative terms that appear in ##\nabla_{\hat{e}_0}## are zero and can be ignored; we only have to evaluate the terms in the connection coefficients.

(2) Only a few particular connection coefficients come into play: the ones with indexes that are either 4-velocity coordinates or coordinates that the metric and frame field vectors depend on. And if our chart is chosen such that the frame field is independent of any coordinates that appear in the 4-velocity (as above), then there are only two possible index patterns for such a connection coefficent: ##\Gamma^b_{ac}## or ##\Gamma^{a}_{cb}##, where ##b## is a coordinate (that does not appear in the 4-velocity) that the metric and frame field vectors depend on, and ##a, c## are coordinates that appear in the 4-velocity (and which the metric and frame field vectors do *not* depend on). (In the Rindler example above, these would be ##\Gamma^X_{TT}## and ##\Gamma^T_{TX}##.)

(3) The metric is diagonal in the coordinates which do not appear in the 4-velocity; i.e., it is diagonal in any coordinate which the metric and frame field vectors depend on. That means only one ##b## coordinate can appear in any connection coefficient. (Note that if the metric is also diagonal in the remaining coordinates, as it will be in the congruences in this post, then we must have ##a = c## and only one of these coordinates can appear in any coefficient as well.)

Using (3), we can write out each of the possible connection coefficients from (2):

$$\Gamma^b_{ac} = \frac{1}{2} g^{bb} \left( – \partial_b g_{ac} \right)$$

$$\Gamma^a_{cb} = \frac{1}{2} g^{ad} \left( \partial_b g_{cd} \right)$$

The covariant derivative of a basis vector ##\hat{e}_{\mu}## in the direction of the 4-velocity can then be written as:

$$\left( \nabla_{\hat{e}_0} \hat{e}_{\mu} \right)^b = \Gamma^b_{ac} u^a e^c + \Gamma^b_{ed} u^d e^e = \frac{1}{2} \left[ g^{bb} \left( – \partial_b g_{ac} \right) u^a e^c + g^{bf} \left( \partial_e g_{df} \right) u^d e^e \right]$$

(Note also that in many cases, including all the ones we’ll be discussing, only one of the two terms above will be nonzero for a given derivative, which simplifies things even further.)

Now let’s try out a more complicated case in flat spacetime, the Langevin congruence. For this case, we will use a cylindrical chart on Minkowski spacetime, with line element

$$ds^2 = – dT^2 + dZ^2 + dR^2 + R^2 d\Phi^2$$

The frame field of static observers (i.e., observers at rest in the global inertial frame described by this chart) is

$$\hat{s}_0 = \partial_T$$
$$\hat{s}_1 = \partial_Z$$
$$\hat{s}_2 = \partial_R$$
$$\hat{s}_3 = \frac{1}{R} \partial_{\Phi}$$

This congruence obviously has zero proper acceleration and is therefore boring :wink:. But the more interesting Langevin congruence, describing observers at rest on a disk that is rotating about the ##Z## axis, has the frame field:

$$\hat{p}_0 = \gamma \partial_T + \gamma \omega \partial_{\Phi} = \gamma \hat{s}_0 + \gamma v \hat{s}_3$$
$$\hat{p}_1 = \partial_Z$$
$$\hat{p}_2 = \partial_R$$
$$\hat{p}_3 = \gamma \omega R \partial_T + \frac{\gamma}{R} \partial_{\Phi} = \gamma v \hat{s}_0 + \gamma \hat{s}_3$$

where ##\omega## is the angular velocity of rotation of the disk, ##v = \omega R## is the tangential velocity of the Langevin observer at radius ##R## on the disk, and ##\gamma = 1 / \sqrt{1 – v^2} = 1 / \sqrt{1 – \omega^2 R^2}##. Note the pleasing symmetry of ##\hat{p}_0## and ##\hat{p}_3## when written in terms of the static frame field vectors; it just expresses the obvious fact that the Langevin observer is moving at velocity ##v## relative to the static observer.

Using our machinery above, we can see that the only ##b## coordinate is ##R## and the only ##a## coordinate is ##\Phi##, so we can immediately read off:

$$\nabla_{\hat{p}_0} \hat{p}_0 = – \frac{1}{2} g^{RR} \partial_R g_{\Phi \Phi} u^{\Phi} u^{\Phi} \partial_R = – \gamma^2 \omega^2 R \partial_R = – \frac{\gamma^2 v^2}{R} \hat{p}_2$$

$$\nabla_{\hat{p}_0} \hat{p}_1 = 0$$

$$\nabla_{\hat{p}_0} \hat{p}_2 = \frac{1}{2} g^{\Phi \Phi} \partial_R g_{\Phi \Phi} u^{\Phi} r^{R} \partial_{\Phi} = \frac{\gamma \omega}{R} \partial_{\Phi} =\frac{\gamma v}{R} \hat{s}_3 = – \frac{\gamma^2 v^2}{R} \hat{p}_0 + \frac{\gamma^2 v}{R} \hat{p}_3$$

$$\nabla_{\hat{p}_0} \hat{p}_3 = – \frac{1}{2} g^{RR} \partial_R g_{\Phi \Phi} u^{\Phi} e^{\Phi} \partial_R = – \gamma^2 \omega \partial_R = – \frac{\gamma^2 v}{R} \hat{p}_2$$

We can see that ##\hat{p}_2## and ##\hat{p}_3## are *not* Fermi-Walker transported; in fact, noting that ##\hat{p}_3## is orthogonal to the proper acceleration, and that the first term of ##\nabla_{\hat{p}_0} \hat{p}_2## is ##\left( \hat{p}_2 \cdot \nabla_{\hat{p}_0} \hat{p}_0 \right) \hat{p}_0##, we can write

$$D_F \hat{p}_2 = \frac{\gamma^2 v}{R} \hat{p}_3$$
$$D_F \hat{p}_3 = – \frac{\gamma^2 v}{R} \hat{p}_2$$

This can be written as a “vorticity vector”:

$$\vec{\Omega} = \frac{\gamma^2 v}{R} \hat{p}_1 = \gamma^2 \omega \hat{p}_1$$

which captures the failure of the basis vectors to be Fermi-Walker transported; in other words, relative to gyro-stabilized vectors, these basis vectors are spinning about the ##Z## axis, in the same sense as the disk is rotating. Thus we can write:

$$\nabla_{\hat{p}_0} \hat{p}_0 = A \hat{p}_2$$
$$\nabla_{\hat{p}_0} \hat{p}_2 = A \hat{p}_0 + \Omega \hat{p}_3$$
$$\nabla_{\hat{p}_0} \hat{p}_3 = – \Omega \hat{p}_2$$

where

$$A = – \gamma^2 \omega^2 R = – \frac{\gamma^2 v^2}{R}$$

$$\Omega = \gamma^2 \omega = \frac{\gamma^2 v}{R}$$

In other words, we have a combination of the hyperbolic rotation in spacetime of ##\hat{p}_0## and ##\hat{p}_2## due to Fermi-Walker transport, and the extra rotation of ##\hat{p}_2## and ##\hat{p}_3## due to the nonzero vorticity.

But consider now what we have just said: relative to gyro-stabilized vectors, these basis vectors are spinning about the ##Z## axis, in the same sense as the disk is rotating. At first this seems obvious: the basis vectors in the plane of the rotation are obviously rotating with the disk. But there’s a wrinkle: ##\Omega## is larger in magnitude than ##\omega## by a factor of ##\gamma^2##. Even after removing a factor of ##\gamma## to account for time dilation (because ##\omega## is relative to an observer at infinity, but ##\Omega## is relative to the Langevin observer), this means that, relative to a stationary observer, gyro-stabilized vectors carried by a Langevin observer will rotate in the retrograde direction, because the twist of the Langevin congruence is larger than its angular velocity. This is known as Thomas precession.

In other words, the spatial basis vectors that are “not rotating” in the sense of being Fermi-Walker transported, are still rotating with respect to a distant observer. (For example, Fermi-Walker transported basis vectors would still be rotating, in the retrograde sense, with respect to a distant star.) This would not be the case in Newtonian mechanics. This is a key reason why Fermi-Walker transport is a useful concept: it allows us to distinguish local and global concepts of “not rotating”, so that we can make sure we are not making unwarranted assumptions that the two are the same in cases where they turn out not to be.

In the next post in this series, we’ll look at how this works in our first curved spacetime: Schwarzschild spacetime.

8 replies