# Motivation for tensors in GR

## Main Question or Discussion Point

I quite often hear that GR is formulated in terms of tensors because laws of physics expressed in terms of tensor equations are indepedent of choice coordinates because they transform nicely'.

I thought the motivation for tensors was that since spacetime is curved, we locally linearize it by introducing the tangent space to a point. Then since all physics happens within the tangent space, all quantities of interest can be expressed as multilinear mappings from the tangent/cotangent space to the real numbers. Coordinate invariance then follows because of the same reason that vectors have a coordinate independent existence in the tangent space, tensors have a coordinate independent existence in tensor product space.

I don't see the connection between coordinate invariance and transforming nicely'' under a change of coordinates. Ie, what would be the supposed bad' implication of not transforming according to the tensor transformation law??

Last edited:

## Answers and Replies

Related Special and General Relativity News on Phys.org
CompuChip
Homework Helper
Not transforming according to the tensor transformation law would mean that you would get additional terms containing partial derivatives of one set of coordinates with respect to the other, which would not cancel in the end. This means that your answer would become dependent on the coordinates you would choose.

I guess you could say that, from a differential geometry point of view, tensors (or something equivalent to tensors), are a unique way of dealing with functions and their derivatives on manifolds. I think uniqueness follows from a choice of defining an inner product, but anyway. So once you've agreed on how to take dot products on flat space, you either end up with tensors and metrics and whatnot on curved space, or you end up with something else. The something else isn't going to be coordinate invariant because of uniqueness.

Now, historically a lot of "reasons" people had turned out to be false but had an important role to play in developing our understanding of GR. I think that few people have been more confused by GR than Einstein himself.

In any event, for a concrete example, lets say I can find a coordinate system in which I can write the laws of physics as $$\partial_\mu \partial^\mu \Phi = 0$$ where $$\Phi$$ is just some scalar field. Now, no one in their right mind would claim this has any meaning (what I wrote down is garbage), but bear with me. So I go out and claim that this is a law of nature. Someone else picks a different coordinate system, and they evaluate what $$\partial_\mu \partial^\mu \Phi$$ is. Now, they take their resulting expression, and write it in terms of my coordinates, and they do not find $$0$$. Obviously my coordinates were special (specifically, $$\Gamma^\alpha_{\beta\gamma} = 0$$) for me, whereas not for them, and what I thought was a "law of nature" was more like a crude observation which doesn't immediately generalize.

I think it is important to phrase the answer to your question in this way, because there are different meanings of coordinate invariance (to some people, at least). But here it is meant in the sense that "the equation looks the same" rather than "evaluates to the same".

I quite often hear that GR is formulated in terms of tensors because laws of physics expressed in terms of tensor equations are indepedent of choice coordinates because they transform nicely'.
Tensors are actually used in Newtonian physics as well. 3-vectors and 3-tensors are tensors by definition. They are referred to as Cartesian tensors (aka Affine tensors). They are defined according to how their components transform under an orthogonal transformation. In SR the tensors are Lorentz tensors which are defined according to how their components transform under a Lorentz transformation. In GR tensors can be defined according to how their components transform under an arbitrary coordinate transformation. The reason they transform as such is because their definition is independant of any particular coordinate system and laws of physics must be able to be defined in such an arbitrary fashion.
I thought the motivation for tensors was that since spacetime is curved, we locally linearize it by introducing the tangent space to a point.
The motication is that the laws of physics must have the same form in in all coordinate systems, regardless of whether the undelying manifold is curved or not.
I don't see the connection between coordinate invariance and transforming nicely'' under a change of coordinates. Ie, what would be the supposed bad' implication of not transforming according to the tensor transformation law??
"Invariance" refers to the geometric nature of tensors in that they have a coordinate independant meaning. In fact a tensor can be defined as a map from vectors and 1-forms to real numbers (scalars). No coordinate system is used in such a definition. However once a coordinate system is defined it can be shown that it implies the transformation properties that you're referring to given the definition of how the components of vectors and 1-forms transform under coordinate transformation. The "bad" impilication that you're asking about is coordinate dependance. Such a dependance would violate a principle of general relativity.

Pete

Any tensor equation can be written as $T = 0$ where $T$ is some tensor and 0 is the zero tensor e.g. in component notation for a a rank-2 tensor this would read $\sum_{\mu,\nu}T^{\mu\nu}\partial_\mu\otimes\partial_\nu = 0$.

If I locally reparametrize the manifold (ie change coordinates by rotating, Lorentz boost or whatever) then the following will be true

$\sum_{\mu,\nu}T'^{\mu\nu}\partial'_\mu\otimes\partial'_\nu = 0$ where the new components are related to the old ones by the tensor transformation law. This obviously looks the same as the first expression. But what does any of this have to do with cancelation of terms, nothing canceled.

I think the problem I'm having is trying to understand why the form of ANY equation could depend on the coordinates chosen. A concrete example of may assist here.

It seems so obvious that if we only have access to the tangent space then our laws of physics will depend on those vectors. Saying that an equation of physics is has the same form in any coordinate system is merely a restatement of the fact that tensors are built out of tangent and cotangent vectors which are themselves coordinate independent objects.

Fredrik
Staff Emeritus
Gold Member
It seems to me that the real question here is "Why can the laws of physics be expressed in terms of real-valued functions on M and local sections of tensor bundles of M?" I don't see how that question can be answered with anything but "We're just lucky I guess", or a direct derivation from a more fundamental theory.

Edit: Oooooh....ooohhhh...500 posts.

Last edited:
I think the problem I'm having is trying to understand why the form of ANY equation could depend on the coordinates chosen. A concrete example of may assist here.

It seems so obvious that if we only have access to the tangent space then our laws of physics will depend on those vectors. Saying that an equation of physics is has the same form in any coordinate system is merely a restatement of the fact that tensors are built out of tangent and cotangent vectors which are themselves coordinate independent objects.
There are several ways to define things like vectors and tensors. It is an unfortunate state of affairs that physicists generally prefer the coordinate-based method. In essence, you need to distinguish between two things --- the tensor itself and the list of numbers that you choose to use to represent the tensor. The latter requires a coordinate system or frame for it to make sense, but the former is independent of the choice. Pure mathematicians often define vectors spaces as a purely algebraic structure, and tensors as linear spaces on top of that. As you've noticed, this has nothing to do with the way they "transform". Choosing a frame, and then writing down a set of numbers allow you to do concrete calculations with them. Change the frame, and the same tensor now has different numbers --- that is the transform. The fact is that the transformation is always a specific type, and can be calculated independently. Now the amazing thing is that you can also go backwards. So start with lists of numbers, and require that they transform in a certain way when you make a "change of basis", and you can recover the coordinate-free description as the unique result.

Hi Genneth,

I also find the physicist way'' of defining tensors to be extremely annoying because of its coordinate dependence, even though I consider myself a physicist.

Let me see if I can run through cartesian tensors the way a mathematician would see them. The first thing you do when you encounter a space $S$ with an inner product is to linearize it by introducing a vector space to each point. This is done by first defining a parametrization of the space and then defining the tangent space to be the set of derivatives of smooth curves passing through the point.

In the case of Euclidean space, there is a well-defined mapping between any tangent space defined by parallel transport, thus it is only necessary to consider the tangent space at the origin.

Note that a choice of basis for the tangent space is determined by a choice of parametrization $F : \mathbb{R}^3 \to \mathbb{E}^3$. $F$ should be conceputalized as a physical measurement apparatus for assigning numbers to points in space (e.g. using steel rulers).

I think it is important to realize (as many physicists probably overlook this), that one cannot even talk about vectors (abstract algebraic things) in Euclidean space unless one has a notion of how to parametrize Euclidean space. It is through this parametrization that one defines vectors (which turn out to be independent of the choice of parametrization).

What the physicists don't seem to realize is that the choice of parametrization is totally arbitrary. In the same way that I could change parametrization by rotating my steel rulers, I could also change to spherical polar coordinates and THEN rotate my coordinates. In spherical coordinates, the components won't change under rotation the way a physicist would expect them to, so they would arge that the object is not a vector, but they are wrong, because they don't understand what a vetor is.

Now the amazing thing is that you can also go backwards. So start with lists of numbers, and require that they transform in a certain way when you make a "change of basis", and you can recover the coordinate-free description as the unique result.
I'd be interested to see the proof of this. Can suggest a reference please?

So once you've agreed on how to take dot products on flat space, you either end up with tensors and metrics and what not on curved space, or you end up with something else. The something else isn't going to be coordinate invariant because of uniqueness.
Do you know a proof of this?

Do you know a proof of this?
Well, mmm... On a smooth manifold there is a natural correspondence between tangent vectors to curves through a point and directional derivatives of functions on the manifold at that point. So we establish $$\vec{v}(f) \mapsto \tfrac{d}{d \lambda}(f)$$. At a point, these span the tangent space $$T_p(M)$$. Now be daring and define it's dual space of oneforms like $$df$$ with the rule $$\langle \vec{v}, df\rangle = \tfrac{df}{d\lambda}$$, and which span $$T^*_p(M)$$.

Now, if you expand $$\vec{v}$$ in terms of a basis $$\tfrac{\partial}{\partial x^\mu}$$ and $$df$$ in terms of a dual basis $$dx^\mu$$ you will find that the inner product $$\langle\, , \, \rangle$$ is the normal Euclidean one, and you can convince yourself that it is coordinate invariant.

Now, to do anything like GR we would actually need to add "geometry" to the manifold in addition to topology, and so we need to know how to take dot products between vectors. A "natural" way to do this is a map from vectors to oneforms, since we already have an induced inner product. This map is the metric, and when you've done this, you get something like the Euclidean dot product between vectors when you have a flat metric.

I should retract my statement that it is a unique way. I don't really know how else you would do it. What I mean is that it is a "natural" way, in the sense that you take what you do in flat space, but do it in a coordinate free manner. When you make the space curved, everything follows from your coordinate invariant inner product $$\langle \, , \, \rangle$$ and your choice of metric.

As a physicist, I want to address your "don't know what a vector is", but right now I have to make supper =)

Edit: my discussion about tangent spaces is for the benefit of those joining the discussion, of course.

Last edited:
Hi Genneth,
What the physicists don't seem to realize is that the choice of parametrization is totally arbitrary. In the same way that I could change parametrization by rotating my steel rulers, I could also change to spherical polar coordinates and THEN rotate my coordinates. In spherical coordinates, the components won't change under rotation the way a physicist would expect them to, so they would arge that the object is not a vector, but they are wrong, because they don't understand what a vetor is.
Are you referring to active verus passive transformations?

In any case, in spherical coordinates, if you rotate your coordinates then the components of the vector would change exactly as a physicist would expect them to... if not, you're hanging out with the wrong physicists =)

In spherical polar coordinates If I rotate about the z-axis by $\Delta \varphi$ then my coordinates change by $(r,\theta,\varphi) \mapsto (r,\theta, \varphi + \Delta\varphi)$: not what was expected from the definition of a cartesian tensor.

Last edited:
In spherical polar coordinates If I rotate about the z-axis by $\Delta \varphi$ then my coordinates change by $(r,\theta\,varphi) \mapsto (r,\theta, \varphi + \Delta\varphi)$: not what was expected from the definition of a cartesian tensor.
In what universe? =)

Why do you question that the components will transform differently, it seems clear to me that if you represent a vector in the curvilinear basis then the components will transform differently under rotation than were you to represent the vector in an orthogonal basis.

Getting back to the motivation for tensors, I think you might be able to at least partially justify it as follows:

Given a tensor $T$, we know from the chain rule that if the component representation wrt the parametrization $x$ is $T^{\mu_1,\ldots,\mu_k}_{\nu_1,\ldots,\nu_k}$, then the components with respect to another parametrization are

$T^{\mu_1',\ldots,\mu_k'}_{\nu_1',\ldots,\nu_k'} = \frac{\partial x^{\mu_1'}}{\partial x_{\mu_1}}\cdots \frac{\partial x^{\mu_k'}}{\partial x_{\mu_k}}\frac{\partial x^{\nu_1'}}{\partial x_{\nu_1}}\cdots \frac{\partial x^{\nu_k'}}{\partial x_{\nu_k}} T^{\mu_1,\ldots,\mu_k}_{\nu_1,\ldots,\nu_k}$

Thus, given tensor equation $T=0$ we then have

$T^{\mu_1,\ldots,\mu_k}_{\nu_1,\ldots,\nu_k} = 0$

and moreover

$T^{\mu_1',\ldots,\mu_k'}_{\nu_1',\ldots,\nu_k'} = \frac{\partial x^{\mu_1'}}{\partial x_{\mu_1}}\cdots \frac{\partial x^{\mu_k'}}{\partial x_{\mu_k}}\frac{\partial x^{\nu_1'}}{\partial x_{\nu_1}}\cdots \frac{\partial x^{\nu_k'}}{\partial x_{\nu_k}} \times 0 = 0$.

Thus the tensor equation holds in all frames of reference.

Surprisingly simple yet illustrates the point.

Why do you question that the components will transform differently, it seems clear to me that if you represent a vector in the curvilinear basis then the components will transform differently under rotation than were you to represent the vector in an orthogonal basis.
I'm not questioning it... I just think every physicist knows this... I'm confused about these physicists that expect them to transform differently of whom you are talking about. Anyway, your example seems to answer the original post, but maybe it is a bit vacuous? Or maybe too clever =)

lbrits,

It was just the kind of vacuous answer I was looking for :)

Of course, you still need uniqueness. It may be there are other structures that transform according to generalized matrix multiplication'' which tensors obey, but with different transformation coefficients.

I'd be interested to see the proof of this. Can suggest a reference please?
Any of the standard GR texts should contain proofs. MTW or Wald should both have it.

What the physicists don't seem to realize is that the choice of parametrization is totally arbitrary.
Exactly who are these physicists that you speak of? I studied GR from Wald and he was very careful, methodical and rigorous in his approach to tensors.

I was being a bit harsh when I made that criticism.

It is certainly true, however, that the majority of physicists would define tensors in terms of their transformation properties.

This is especially true in quantum theory, where, for example, elementary particles are defined in terms of how they transform under irredubible representations of the special orthogonal or Lorentz group.

In general relativity the situation is more straightforward since every equation is a tensor equation, and such equations are naturally covariant.

For more on that I refer you to another post: