Understanding General Covariance & Relativity Principle

Click For Summary
General covariance is a principle stating that physical laws should maintain the same form across different coordinate systems, including accelerated ones. It generalizes the relativity principle, which asserts that if a physical law holds in one inertial frame, it should hold in another. The discussion highlights that while general covariance can seem trivial, it serves as a guideline for formulating theories that are coordinate-independent. However, some argue that general covariance lacks physical content compared to the relativity principle, which has demonstrable implications. The conversation also touches on the distinction between general covariance and background independence, emphasizing the importance of understanding these concepts in the context of modern physics.
  • #61
PeterDonis said:
And possibly also mapping between different manifolds;
Sure.

PeterDonis said:
Can you give a specific example of a passive diffeomorphism that is not an invertible map between charts, but *is* an invertible map between tangent spaces at each point?

No, what is invertible is the function that maps open sets containing a point of the manifold to open sets at the tangent space, the inverse map (from the tangent space at a point in M to M itself) would be the exponential map and is given by the affine connection in GR.
The map between charts fails to be invertible but it is injective, it turns out that that is all you need for vacuum solutions and for conformally flat ones (like FRW metrics).
The examples are any coordinate transformation in GR of the above mentioned kind of solutions, that is what I mentioned as the solution to the Einstein's "hole argument". He realized that: "All our spacetime verifications invariably amount to a determination of spacetime coincidences." (Einstein, 1916, p.117)
The drawback is that there are no solutions in general relativity with point particle stress-energy.
 
Physics news on Phys.org
  • #62
lugita15 said:
GR doesn't admit global spacetime symmetries, only local ones.

This is a key point. Often not well explained in GR textbooks. This leads to Killing vector fields in GR not being global(except those geometrically imposed to solutions like spherical symmetry for instance), this took me a long time to grasp.
 
  • #63
lugita15 said:
It seems to me that what is nowadays called "general covariance" is indeed a statement with no physical content. But I think Einstein was describing a physically meaningful principle.

Let me put it this way. General covariance, let's call it statement 1, states "the laws of physics are the same in all coordinate systems". A special case of this is statement 2, "the laws of physics are the same in all coordinate systems that are at rest or in uniform motion". That sounds like the Principle of Relativity, doesn't it? So how can a physically meaningless statement imply a physically meaningful statement? I think that the answer to that is that statement 2 is NOT really the principle of relativity, although it sounds similar to it. Statement 2 is a statement that any theory can be made to satisfy, whereas the principle of relativity holds for some theories but not all, e.g. Aristotelian physics. So statement 2 is to the POR as statement 1 is to ... what?

I would say the physical special relativity principle is that rest cannot be distinguished from inertial motion (in GR, add : locally).

The simplest generalization is obviously false: accelerated motion cannot be distinguished from rest (or inertial motion). The best you have is a particular variant of the equivalence principle: you cannot locally distinguish accelerated motion from rest in a uniform gravitational field.

I think the modern view is the separate out some variant of equivalence principle, some variant of background independents or 'no prior geometry', and the principle relativity (SR sense, local). Then say there is no such thing as the 'general principle of relativity'. This is my view.

Then, general covariance is simply the result of formulating a theory in a coordinate independent manner.
 
  • #64
lugita15 said:
Perhaps you're misinterpreting statement 2. Surely "such-and-such is true for all coordinate systems satisfying condition X" is a logical consequence of "such-and-such is true for all coordinate systems". Do you disagree with that? So statement 2 is a logical consequence of statement 1, and thus statement 2 can't be the PoR.

Yes, if statement 2 taken to be a special form of statement 1 (general covariance), then it places no restriction on what theories are possible, ie. it is not the PoR.

In order for something with the same words as statement 2 to be the PoR, it cannot be a special form of statement 1. In particular, it must use a definition of "laws of physics have the same form" in which Christoffel symbols are not allowed.

Basically statement 1 is less restrictive than the PoR because it allows use "fudge factors" like the Christoffel symbols to write the laws of physics in "the same form", whereas in the PoR, the use of such fudge factors is not allowed.
 
  • #65
It seems to me that the question to be addressed isn't "Can the laws of physics be expressed the same in all coordinate systems?" but "Are there some coordinate systems in which all the laws of physics take a simpler form than in an arbitrary coordinate system?"

Before relativity, it was thought the answer was yes for the "aether frame" only. Special relativity asserts that the answer is yes for inertial frames. In general relativity the answer is no, but the equivalence principle says the answer is "locally yes as an approximation" for "locally inertial" frames.
 
Last edited:
  • #66
TrickyDicky said:
what is invertible is the function that maps open sets containing a point of the manifold to open sets at the tangent space, the inverse map (from the tangent space at a point in M to M itself) would be the exponential map and is given by the affine connection in GR.

Yes, no problem here. You can do this for the tangent space at any point in the manifold, and it doesn't even involve a chart, strictly speaking.

TrickyDicky said:
The map between charts fails to be invertible but it is injective, it turns out that that is all you need for vacuum solutions and for conformally flat ones (like FRW metrics).

Then I'm not sure I understand what you mean by "the map between charts". Take a case where the "double cover" issue in isotropic coordinates on Schwarzschild spacetime doesn't come up: suppose I want a map between ingoing Painleve coordinates and ingoing Eddington-Finkelstein coordinates on Schwarzschild spacetime. Both of these charts cover the exact same portion of the maximally extended manifold (regions I and II, as they're usually labeled on the Kruskal chart), and both of these charts assign unique coordinate values to every point of the manifold in the region they cover. So it seems to me that the map between them is obviously invertible and bijective. What am I missing? Or is it just that we are using different terminology, so you mean something else by "the map between charts" than what I just described? (And if so, what?)

TrickyDicky said:
The examples are any coordinate transformation in GR of the above mentioned kind of solutions, that is what I mentioned as the solution to the Einstein's "hole argument". He realized that: "All our spacetime verifications invariably amount to a determination of spacetime coincidences." (Einstein, 1916, p.117)

I understand and agree with the Einstein quote; the real content of our physical models is in observables like "the worldlines of objects A and B intersect at event E".
 
  • #67
TrickyDicky said:
This is a key point. Often not well explained in GR textbooks.
In fact, you can even define curvature as the extent to which global spacetime symmetries are violated.
 
  • #68
atyy said:
Yes, if statement 2 taken to be a special form of statement 1 (general covariance), then it places no restriction on what theories are possible, ie. it is not the PoR.

In order for something with the same words as statement 2 to be the PoR, it cannot be a special form of statement 1. In particular, it must use a definition of "laws of physics have the same form" in which Christoffel symbols are not allowed.

Basically statement 1 is less restrictive than the PoR because it allows use "fudge factors" like the Christoffel symbols to write the laws of physics in "the same form", whereas in the PoR, the use of such fudge factors is not allowed.
The difference between statement 2 and PoR is that PoR requires certain symmetry in laws of physics (they are transformed but look the same as a group but not individually) while statement 2 does not require any symmetry in laws of physics (each law individually have exactly the same form after transformation i.e. they are not transformed at all).
 
  • #69
zonde said:
The difference between statement 2 and PoR is that PoR requires certain symmetry in laws of physics (they are transformed but look the same as a group but not individually) while statement 2 does not require any symmetry in laws of physics (each law individually have exactly the same form after transformation i.e. they are not transformed at all).
What exactly do you mean by the laws of physics looking the same as a group vs looking the same individually? Also, what is your answer to the question: statement 2 is to the PoR as statement 1 is to ... what? That is to say, what is the physically meaningful statement that bears the same relation to statement 1 that the principle of relativity bears to statement 2? Would it be the statement that the laws of physics look the same "as a group" (whatever you mean by that) in all coordinate systems?
 
  • #70
lugita15 said:
What exactly do you mean by the laws of physics looking the same as a group vs looking the same individually?
Say there are physical laws A and B and there is such a transformation that law A after transformation looks like original law B and law B looks like original A. So that group consisting of A and B looks the same after transformation.

lugita15 said:
Also, what is your answer to the question: statement 2 is to the PoR as statement 1 is to ... what? That is to say, what is the physically meaningful statement that bears the same relation to statement 1 that the principle of relativity bears to statement 2? Would it be the statement that the laws of physics look the same "as a group" (whatever you mean by that) in all coordinate systems?
From physical standpoint statements 1 and 2 are equal IMO. So that I doubt there is meaningful answer to your question.

But I suppose you can say it the way you did: The laws of physics look the same "as a group" in all coordinate systems.
 
  • #71
zonde said:
Say there are physical laws A and B and there is such a transformation that law A after transformation looks like original law B and law B looks like original A. So that group consisting of A and B looks the same after transformation.
Could you give me an example of this? I've never heard of the phenomenon you describe.
 
  • #72
lugita15 said:
Could you give me an example of this? I've never heard of the phenomenon you describe.
When observing distant stationary star from moving observatory we have aberration. When observing distant moving from stationary observatory we observe it at it's past position.
 
  • #73
PeterDonis said:
Then I'm not sure I understand what you mean by "the map between charts". Take a case where the "double cover" issue in isotropic coordinates on Schwarzschild spacetime doesn't come up: suppose I want a map between ingoing Painleve coordinates and ingoing Eddington-Finkelstein coordinates on Schwarzschild spacetime. Both of these charts cover the exact same portion of the maximally extended manifold (regions I and II, as they're usually labeled on the Kruskal chart), and both of these charts assign unique coordinate values to every point of the manifold in the region they cover. So it seems to me that the map between them is obviously invertible and bijective. What am I missing? Or is it just that we are using different terminology, so you mean something else by "the map between charts" than what I just described? (And if so, what?)
We might be not coinciding in our idea of what bijectivity implies for differential manifolds mappings, rather than by map between charts.
To me the case you describe involves injectivity(that is you cannot map two different points of one chart to the same point of the other),with the point in the manifold remaining "static", we are just changing the coordinates of the point, it is a one-to-one function, but it is not a one-to-one correspondence, for that, as was previously commented, you need to be able to "move around" the points of the manifold (active diffeomorphisms) , but in GR you are limited in doing that due to the local nature of the spacetime symmetries, except for rotations that is a symmetry we usually impose on the solutions of the EFE.
What you call the "double cover issue" is just an instance where the lack of one-to-one correspondence manifests.
 
  • #74
It's amazing how such a widely used concept (general covariance) can still be so tricky.
 
  • #75
haushofer said:
It's amazing how such a widely used concept (general covariance) can still be so tricky.

Indeed, see for instance:"General covariance and the foundations of general relativity: eight decades of dispute" by J. D. Norton. It is linked in the wiki entry for GC.

Also this issue seems not to be fully addressed in the usual GR textbooks (maybe because it is still controversial to a certain point), and many people are still not clear on certain subtleties of the mathematical implications of coordinate transformations in GR's pseudoRiemannian manifolds versus general coordinate transformation in Riemannian manifolds.
In practical terms there is a further reason why all this seems to bother very few relativists, since most results in GR only require to be valid locally or "minimal coupling" prescription, these theoretical implications about GC had little impact on the core of GR computations and experimental work.
 
  • #76
TrickyDicky said:
We might be not coinciding in our idea of what bijectivity implies for differential manifolds mappings, rather than by map between charts.

I didn't realize "bijective" had multiple meanings; AFAIK it always means "one-to-one".

TrickyDicky said:
To me the case you describe involves injectivity(that is you cannot map two different points of one chart to the same point of the other),

But it's that way in both directions, i.e., bijective, not just injective.

TrickyDicky said:
with the point in the manifold remaining "static", we are just changing the coordinates of the point, it is a one-to-one function, but it is not a one-to-one correspondence, for that, as was previously commented, you need to be able to "move around" the points of the manifold (active diffeomorphisms)

But that's a different kind of transformation from a "map between charts". Or at least, the term "map between charts" does not suggest an active diffeomorphism to me, only a passive one. If you're moving around the points of the manifold, and you're also changing the coordinates (how the points in the manifold are labeled), then what exactly are you *not* changing? And if everything is changing, how do you even define the transformation?

TrickyDicky said:
What you call the "double cover issue" is just an instance where the lack of one-to-one correspondence manifests.

But the double cover issue I was talking about involves a passive diffeomorphism only; I was using it to illustrate that a "passive map between charts" (to make it clear what kind of diffeomorphism I'm talking about) might not be the same as a one-to-one passive diffeomorphism, because two different patches in one chart might map to the same patch in the other chart.
 
  • #77
PeterDonis said:
I didn't realize "bijective" had multiple meanings; AFAIK it always means "one-to-one".

It doesn't, what can be ambiguous is the one-to-one part, both injection and bijection are one-to-one, but the former doesn't include surjection, so it is not necessarily invertible for the complete codomain.

But it's that way in both directions, i.e., bijective, not just injective.
It may be for the neighbourhood of the point but not for the entire manifold.

But that's a different kind of transformation from a "map between charts". Or at least, the term "map between charts" does not suggest an active diffeomorphism to me, only a passive one. If you're moving around the points of the manifold, and you're also changing the coordinates (how the points in the manifold are labeled), then what exactly are you *not* changing? And if everything is changing, how do you even define the transformation?
What you are describing is known as (global)isometry , a type of active diffeomorphism that preserves the metric, and defines spacetime symmetries when the manifold is invariant to them. The thing to keep in mind in GR is that these isometries are local, not global, with the exception of those that might be imposed on the solutions, like isotropy.


But the double cover issue I was talking about involves a passive diffeomorphism only; I was using it to illustrate that a "passive map between charts" (to make it clear what kind of diffeomorphism I'm talking about) might not be the same as a one-to-one passive diffeomorphism, because two different patches in one chart might map to the same patch in the other chart.
They are the same thing, as I said it is an example where the lack of bijectivity of passive transformations shows up.
 
  • #78
TrickyDicky said:
It doesn't, what can be ambiguous is the one-to-one part, both injection and bijection are one-to-one, but the former doesn't include surjection, so it is not necessarily invertible for the complete codomain.

Yes, I see now, I was mixing up definitions in my head. Basically, injective is "one to one", surjective is "onto", and bijective = injective + surjective.

TrickyDicky said:
It may be for the neighbourhood of the point but not for the entire manifold.

Huh? The mapping between ingoing Painleve and ingoing Eddington-Finkelstein *is* bijective over the entire manifold, or at least the entire region of the manifold that they both cover. I suppose you could say that since that region is not the entire (maximally extended) manifold, the mapping can't "count" as bijective because there are points of the manifold that aren't mapped, but that applies equally well to both domain and codomain, and there doesn't really seem to be a word for a function that is "one to one", but doesn't map every element of its "domain" (since the definition of "domain" implicitly includes only points for which the function is defined).

TrickyDicky said:
What you are describing is known as (global)isometry , a type of active diffeomorphism that preserves the metric, and defines spacetime symmetries when the manifold is invariant to them.

Huh? In an isometry of the type you describe, the chart is held constant. More precisely, an isometry can be described without using a chart at all, so no issue of "mapping between charts" even arises. You just define equivalence classes of points in the spacetime with respect to the isometry. So I don't see how a "map between charts" even comes into play.

It would really be helpful if you would give a specific example; for example, take the "map between charts" that I gave (ingoing Painleve to ingoing Eddington-Finkelstein) and show explicitly how it relates to an isometry.
 
  • #79
PeterDonis said:
Huh? The mapping between ingoing Painleve and ingoing Eddington-Finkelstein *is* bijective over the entire manifold, or at least the entire region of the manifold that they both cover. I suppose you could say that since that region is not the entire (maximally extended) manifold, the mapping can't "count" as bijective because there are points of the manifold that aren't mapped...

Yes, that is what I'd say.



Huh? In an isometry of the type you describe, the chart is held constant. More precisely, an isometry can be described without using a chart at all, so no issue of "mapping between charts" even arises. You just define equivalence classes of points in the spacetime with respect to the isometry. So I don't see how a "map between charts" even comes into play.
Precisely isometry invariance is the property of Riemannian manifolds that allows us to do without coordinates, because in Riemannian geometry coordinate transformations are isometries(bijective).
In GR this is limited to its local counterpart, local isometries.

It would really be helpful if you would give a specific example; for example, take the "map between charts" that I gave (ingoing Painleve to ingoing Eddington-Finkelstein) and show explicitly how it relates to an isometry.
They are local isometries.
 
  • #80
TrickyDicky said:
Precisely isometry invariance is the property of Riemannian manifolds that allows us to do without coordinates, because in Riemannian geometry coordinate transformations are isometries(bijective).

Now you're using the word "isometry" in a different sense than I understand it. As I understand the term "isometry", it is what is generated by a Killing vector field. A KVF, and therefore an isometry, can certainly be defined in coordinate-free terms, but that doesn't mean coordinate transformations are isometries. An "isometry" is what you have been calling an "active diffeomorphism" (at least, if I understand your usage of *that* term right); for example, a rotation of a 2-sphere about any axis is an isometry, because it leaves the intrinsic geometry of the 2-sphere invariant. But that has nothing to do with coordinate transformations.

[Edit: I also don't understand why you appear to equate "isometry" with "bijective". I would agree that an isometry must be bijective, but I would not agree that every bijective transformation must be an isometry.]
 
  • #81
PeterDonis said:
. As I understand the term "isometry", it is what is generated by a Killing vector field.
Correct.

A KVF, and therefore an isometry, can certainly be defined in coordinate-free terms, but that doesn't mean coordinate transformations are isometries.
Right, it is the other way around, isometries are a subgroup of generalized coordinate transformations, consider this textbook definition:
"Isometry: a coordinate transformation x′^μ = x^μ + ζ^μ(x), which we think of as infinitesimal. The term isometry applies to any transformation that leaves the metric of the same form. The metric is form invariant under such a transformation. We will, however, only consider continuous symmetries."
Simply put an isometry is a (active)diffeomorphism that preserves the metric.

for example, a rotation of a 2-sphere about any axis is an isometry, because it leaves the intrinsic geometry of the 2-sphere invariant. But that has nothing to do with coordinate transformations.
Are you sure rotations are not a type of coordinate transformations? :rolleyes:

[Edit: I also don't understand why you appear to equate "isometry" with "bijective". I would agree that an isometry must be bijective, but I would not agree that every bijective transformation must be an isometry.
I don't equate them, it just happens that every isometry just by being a diffeomorphism is bijective. This is not the case with local isometries(that is local diffeomorphisms that preserve the metric), which are injective.


Maybe you should try and compare what I'm saying with a GR/differential geometry textbook.
 
  • #82
TrickyDicky said:
Simply put an isometry is a (active)diffeomorphism that preserves the metric.

I'm fine with that.

TrickyDicky said:
Are you sure rotations are not a type of coordinate transformations? :rolleyes:

Not the way I think of "coordinate transformations", no. Coordinate transformations ought to, it seems to me, involve coordinates. A rotation does not involve coordinates; it can be defined without ever talking about coordinates at all. See further comments below.

TrickyDicky said:
I don't equate them, it just happens that every isometry just by being a diffeomorphism is bijective. This is not the case with local isometries(that is local diffeomorphisms that preserve the metric), which are injective.

Once again, it would be really helpful if you could give a specific example. So far every time I've asked you to do that, you've just stated that an example I gave applies. If that were enough to resolve my confusion, I wouldn't have needed to ask you for an example. I am asking *you* to explicitly exhibit an example of a global isometry and a local isometry and show how they are different, and why the former must be bijective while the latter may only be injective (i.e., not surjective). All the examples I can come up with to fit the term "local isometry" are either bijective, or not even injective (e.g., the mapping between isotropic and Schwarzschild coordinates, if we consider both patches of isotropic coordinates mapping to a single patch of Schwarzschild coordinates as a single "mapping", is not even injective).

TrickyDicky said:
Maybe you should try and compare what I'm saying with a GR/differential geometry textbook.

Different textbooks appear to use different terminology as well, so that's not necessarily helpful. I'm not confused about the underlying concepts; I'm confused about which terms you are using to refer to which underlying concepts. I don't have any particular attachment to any particular terminology; I have preferences, but I'm perfectly willing to put them aside and adopt your terminology (or anyone else's) for the sake of having a clear discussion. But I have to be able to understand *what* your terminology is to do that. Even a simple statement like "I'm using the same terminology as textbook X" would help, but just saying "textbooks" or "a textbook" without saying which one is not helpful, because, as I said, they use different conventions for terminology.

Anyway, this subthread seems to me to be getting away from the main topic, which is general covariance. As far as that is concerned, I basically agree with the position Ben Niehoff stated in the second post in this thread: *any* physical theory can be written in "coordinate-independent" form, so general covariance as it's usually stated is trivial; it's just a reminder to write theories in coordinate-independent form. I.e., general covariance has nothing much to say about the *content* of a physical theory; and once we start talking about isometries and other properties of solutions, we are talking about content, not form.
 
  • #83
PeterDonis said:
Not the way I think of "coordinate transformations", no. Coordinate transformations ought to, it seems to me, involve coordinates. A rotation does not involve coordinates; it can be defined without ever talking about coordinates at all. See further comments below.
Ok, I think I know what you mean.
All comes from the two different senses of "coordinate transformation". When you say a rotation (as a global symmetry) can be defined in a coordinate-free way, you are of course right, this transformation is a diffeomorphism and therefore a bijection, and it defines two coordinate transformations in the case we want use coordinates to describe a fixed point in the manifold. These diffeomorphisms are sometimes called coordinate transformations by mathematicians but they are clearly not what physicists usually consider coordinate transformations, which are what are called passive transformations, and I call in the case of GR local isometries. I think these are the ones general covariance in GR refer to when talking about coordinate transformation invariance.
PeterDonis said:
Once again, it would be really helpful if you could give a specific example. So far every time I've asked you to do that, you've just stated that an example I gave applies. If that were enough to resolve my confusion, I wouldn't have needed to ask you for an example. I am asking *you* to explicitly exhibit an example of a global isometry and a local isometry and show how they are different, and why the former must be bijective while the latter may only be injective (i.e., not surjective). All the examples I can come up with to fit the term "local isometry" are either bijective, or not even injective (e.g., the mapping between isotropic and Schwarzschild coordinates, if we consider both patches of isotropic coordinates mapping to a single patch of Schwarzschild coordinates as a single "mapping", is not even injective).
Peter, you know I'm just an interested layman, the farthest from a physicist or a mathematician, I'm trying to help but I might not be the most qualified to do that, I was hoping some of the pros would jump in. In the meantime to me for instance rotations in the Schwarzschild spacetime are global isometries, and any passive coordinate transformation from a point in Schwarzschild coordinates to a different chart is a local isometry.
PeterDonis said:
Anyway, this subthread seems to me to be getting away from the main topic, which is general covariance. As far as that is concerned, I basically agree with the position Ben Niehoff stated in the second post in this thread: *any* physical theory can be written in "coordinate-independent" form, so general covariance as it's usually stated is trivial; it's just a reminder to write theories in coordinate-independent form. I.e., general covariance has nothing much to say about the *content* of a physical theory; and once we start talking about isometries and other properties of solutions, we are talking about content, not form.
I also agree.
 
  • #84
TrickyDicky said:
These diffeomorphisms are sometimes called coordinate transformations by mathematicians but they are clearly not what physicists usually consider coordinate transformations, which are what are called passive transformations, and I call in the case of GR local isometries. I think these are the ones general covariance in GR refer to when talking about coordinate transformation invariance.

I agree.

TrickyDicky said:
Peter, you know I'm just an interested layman

So am I. :wink:

TrickyDicky said:
rotations in the Schwarzschild spacetime are global isometries, and any passive coordinate transformation from a point in Schwarzschild coordinates to a different chart is a local isometry.

I'll have to take some time to work through these examples with the definitions.

I do have one rather lengthy comment: transformations between charts aren't always viewed as "local". Some are, for example transformations between Fermi normal coordinates for two observers in relative motion at a particular event. But others are not, for example the transformation between Painleve and Eddington-Finkelstein coordinates; that transformation applies at every point in the manifold that is covered by both charts.

However, the latter transformation is "local" in another sense, that it maps the *same* point from one chart to the other, so it can be viewed as an infinite "family" of transformations, each one mapping a single point only. A rotation (or in general any active transformation) can't be viewed this way; it intrinsically is a mapping from the entire manifold into itself (or into another manifold, in the case of a more general active transformation), and can't be "decomposed" into a family of local transformations that each affect only a single point.

(I think you basically said this in an earlier post, but I wasn't really grokking it until I stepped back and walked through things in more detail.)
 
  • #85
I still don't get it really, reflecting the fact that some of my earlier posts here confuse different things.

I think most people now agree that general covariance, the ability to write the equations of motion in a gct-covariant way, is physically void. But I also sometimes see the statement that the real deal of GR is "background independency" or the lack of "a priory geometry": the geometry of spacetime, uniquely determined by the metric, is dynamical and obeying EOM called the Einstein equations. But Newton-Cartan theory is in the same way "background independent": both the metrics (spatial and temporal) are dynamically determined by equations analogous to the Einstein equations of GR. The connection is not uniquely determined by both metrics; one obtains an extra vector field, but that does not change the matter. I could even apply the hole argument for Newton-Cartan theory in the same way as for General Relativity, because the EOM are gct-invariant.

The solution should thus be found in the fact that GR is really a non-linear self-interacting theory of massless spin-2 particles which becomes clear after gauge-fixing, while for Newton-Cartan this cannot be said: after gauge-fixing one obtains a spin (spin is here wrt to the Galilei group!) 0 theory, which is static and non self-interacting.

Does this make sense? And what does "background independence" really mean then?
 
  • #86
haushofer said:
what does "background independence" really mean then?

I think it is actually a misnomer and somewhat ill-defined but anyway I always understood it as something like the difference between the EM theory of Maxwell and the GRT of Einstein, in the sense that the former equations refer to fields that act in " a background space" and so are "background dependent" while the latter equations refer to a field that "is" the spacetime in itself and therefore "background independent".
I say it is a misnomer and superfluous term because at least since Riemann we know manifolds don't need to be embedded in any "background space" to be defined.
 
  • #87
TrickyDicky said:
I say it is a misnomer and superfluous term because at least since Riemann we know manifolds don't need to be embedded in any "background space" to be defined.

True, but there is a difference between the manifold being predetermined (as in EM) and it being dynamic (as in GR). I agree that "background independence" is a bad term for the latter case, though; why not "dynamic manifold" or something like that? After all, the key difference is that in GR the manifold (spacetime) appears in the dynamical equations of the theory, where in EM it doesn't.
 
  • #88
PeterDonis said:
True, but there is a difference between the manifold being predetermined (as in EM) and it being dynamic (as in GR). I agree that "background independence" is a bad term for the latter case, though; why not "dynamic manifold" or something like that? After all, the key difference is that in GR the manifold (spacetime) appears in the dynamical equations of the theory, where in EM it doesn't.

I agree with you on the terminology, but bad terminology seems to dominate this whole discussion anyway ;) So in the sense of BI Newton-Cartan theory and GR don't differ. But why do people like Rovelli then keep hammering on the importance of BI, if clearly even Newtonian gravity can be made BI? Clearly, it doesn't say that much.

I think ultimately, the fact that GR is BI is not a defining property of the theory; what is the defining property is what is left of your theory after you have fixed gauges to uncover physical degrees of freedom (in the case of GR, this comes down to a perturbative analysis and noticing that one is really dealing with massless self-interacting spin-2).

Do you agree that the hole argument is just as applicable to Newton-Cartan theory as GR?
 
Last edited:
  • #89
haushofer said:
So in that sense Newton-Cartan theory and GR don't differ.

I'm not sure that's true. Newton-Cartan theory has absolute time and an absolute slicing of the complete manifold into spacelike slices, so the spacetime isn't completely dynamic as it is in GR. (Btw, you said in an earlier post that N-C theory determines the "temporal metric" dynamically; I'm not sure that's true either. There is no gravitational time dilation in N-C theory.)

haushofer said:
I think ultimately, the fact that GR is BI is not a defining property of the theory

I think the question here is, is GR the *only* possible theory that is BI in the way GR is? (I.e., with a *completely* dynamic spacetime metric.) I don't think anybody really knows the answer to that.
 
  • #90
PeterDonis said:
I think the question here is, is GR the *only* possible theory that is BI in the way GR is? (I.e., with a *completely* dynamic spacetime metric.) I don't think anybody really knows the answer to that.

The answer is obviously "no". GR is defined by the Einstein-Hilbert action

S = \int (R(g) - 2 \Lambda) \, \sqrt{|g|} d^4x
Clearly I can put together any curvature invariants I feel like into a Lagrangian and I will have another theory where the metric is dynamic. "Background independent" is not overly restrictive.
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
1K
  • · Replies 29 ·
Replies
29
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 9 ·
Replies
9
Views
1K
  • · Replies 5 ·
Replies
5
Views
2K
Replies
5
Views
2K
  • · Replies 34 ·
2
Replies
34
Views
5K
Replies
16
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K