Einstein simultaneity: just a convention?

Click For Summary
The discussion centers on the nature of Einstein's simultaneity and its implications for the Lorentz transformation, debating whether it is an arbitrary convention or a fundamental aspect of time and motion. Participants express differing views on whether the isotropic speed of light is a law of nature or merely a convenient coordinate choice stemming from Einstein's framework. There is a concern about the pedagogical approach to teaching special relativity, particularly regarding the historical context of the ether and the implications of simultaneity. Some argue for a more refined teaching method that emphasizes the mathematical and physical significance of Einstein's synchronization convention. Ultimately, the conversation highlights the complexities in understanding and teaching the foundational principles of relativity.
  • #61
Antenna Guy said:
Can a geodesic in GR be a straight line?
All geodesics are, by definition, straight lines.

What curves a geodesic in GR?
Geodesics are not curved.



The geometry of space-time yields a meaning to the term "straight" (which I will henceforth call 'intrinsic-straight'). Among other things, the path traced out by a geodesic is defined to be 'intrinsic-straight'.

There is a 'usual' way to attach an affine structure to R^4, and this gives another meaning to the term "straight" (which I wlil henceforth call 'coordinate-straight'). There is no good reason to do so -- it's simply that in a different context, we often use the same set R^4 as the underlying set of an affine space, so there is a temptation to invoke that affine structure in this context.

The point is, when you select coordinates, that affine structure usually has absolutely nothing to do with the geometry of space-time; the notions of intrinsic-straightness and coordinate-straightness are different. An intrinsic-straight path will generally be coordinate-curved. Similarly, a coordinate-straight path will generally be intrinsic-curved.
 
Last edited:
Physics news on Phys.org
  • #62
Hurkyl said:
The point is, when you select coordinates, that affine structure usually has absolutely nothing to do with the geometry of space-time; the notions of intrinsic-straightness and coordinate-straightness are different. An intrinsic-straight path will generally be coordinate-curved. Similarly, a coordinate-straight path will generally be intrinsic-curved.

I think I follow.

Would it be correct to say that a particle following a geodesic (intrinsic-straight path in space-time) could be mapped as a classical trajectory (coordinate-curved path in space)?

Regards,

Bill
 
  • #63
Hi bernhard, I wondered how long it would take for you to join the conversation! :smile:
bernhard.rothenstein said:
Please follow the following thoughts:
1.Einstein's transformation equations hold only with Einstein synchronized clocks t(E) and t"(E).
Yes, everyone already agreed with that. The underlying Minkowski geometry is present regardless of the synchronization convention.
 
  • #64
convention simultaneity

DaleSpam said:
Hi bernhard, I wondered how long it would take for you to join the conversation! :smile:Yes, everyone already agreed with that. The underlying Minkowski geometry is present regardless of the synchronization convention.

what about my thoughts 2 and 3?
 
  • #65
bernhard.rothenstein said:
what about my thoughts 2 and 3?
For 3 the ether is non-physical and the absolute frame is arbitrary, so I don't care.

For 2 I don't know enough about the specific transformations you referenced to comment about them in particular. But I already gave general comments about coordinate systems and synchronization conventions in https://www.physicsforums.com/showpost.php?p=1720437&postcount=26".
 
Last edited by a moderator:
  • #66
DaleSpam said:
If you are going to try to win an argument by appeal to authority you should at least try to do better than Wikipedia. RL Faber. "Differential Geometry and Relativity Theory: An Introduction" has a whole chapter entitled "Special Relativity: the Geometry of Flat Spacetime". Or MS Parvez. "On the theory of flat spacetime" which says in the abstract "Special relativity, in essence, is a theory of four-dimensional flat spacetime".
Both of those chapter headings are of course true (I never said otherwise), and neither are responsive to the issue of "what is the word 'special' there to imply".
I note that you didn't address the point I made about geodesics.
It required no comment, I am aware that geodesics in general relativity become straight lines in special relativity. Again, it's simply not responsive to the question of the meaning of "special", and again I repeat that this comes from the specialness of the treatment of inertial frames. That is the key element that distinguishes the approach of general and special relativity, in any kind of spacetime.

Still, this is not a terribly important semantic question-- you are welcome to your opinion on that matter, and it may be unanswerable because both the name and the theory have evolved so much that multiple interpretations of that process may be possible.
Sorry Ken, you need to read up a little more. The first postulate is in fact closer to your second statement than the first. The first postulate is, in Einstein's words, "the same laws of electrodynamics and optics will be valid for all frames of reference for which the equations of mechanics hold good ... The laws by which the states of physical systems undergo change are not affected, whether these changes of state be referred to the one or the other of two systems of co-ordinates in uniform translatory motion".
Again, I can't agree that those statements support your contention. When Einstein said "all frames for which the equations hold good", he obviously means "all inertial frames". And in the second sentence, he said that the changes have to be referred to the coordinates used by inertial observers, but if one is going to "refer" to coordinates willy nilly, there is no problem with simply using "the coordinates of the King", and be done. His point is that you can choose any inertial frame, i.e., any of the special frames, to refer to, and use that special frame, where the postulates apply, to translate between measurements by noninertial observers. It is implicit that the coordinates of that inertial frame correspond to the measurements of a hypothetical inertial observer, i.e. of the special class of observers that define special relativity. Whether or not that special observer actually exists is irrelevent, physics uses hypothetical observers all the time, as did Einstein.
 
Last edited:
  • #67
Ken, this conversation is getting repetitive and boring.

In summary:
1) The general theory of relativity simplifies to the special theory of relativity in flat spacetime, hence SR is a special case of GR.
2) The first postulate refers to inertial reference frames, not inertial observers.
3) Einstein synchronization is a convention.
4) The fundamental and coordinate independent concept of SR is the Minkowski geometry of spacetime.

I'm done.
 
  • #68
Hurkyl said:
Again, you're confusing "the metric" with "coordinate representation of the metric"
Normally, you start out by defining a bunch of vectors, and a metric is a way to take those vectors two at a time and associate a number with each pair in a bilinear way. Unless you plan to enumerate every such pairing, you will need a convenient way to name the vectors, such that the metric can work automatically on that naming convention. That's called a coordinatization. Then you name the metric by how it functions on that vector-naming convention, that's what is meant by the "Minkowski metric". Thus, the naming convention on the vectors is presumed in the naming of the metric. It is not I who confuses that with what a metric is-- it is the way the Minkowski metric is taught that does that, and this is very much the point of the thread.
It sounds like you're claiming that, in Euclidean geometry, 'distance' is a coordinate-dependent notion. :confused: If that really is what you're saying, then I posit that you need to review elementary geometry before continuing to reflect upon physics.
I posit you need to read my words more carefully. What I was saying is that the way we produce a concept of distance is by the use of a metric, or an inner product if you will. You said that Euclidean geometry "preserves" lengths and angles. I presumed your use of the word "preserve" meant "leaves invariant under some type of mapping of the space into itself". Then I pointed out that, if you did mean that, the statement only holds on the subclass of mappings that are "orthonormal" under the action of your metric. If you only meant that distances are by definition the same no matter how you coordinatize the space, then (1) there's no meaning to the word "preserve", as there's nothing to preserve, you've already declared by fiat what the distance is, (2) that would hold in any geometry that admits a metric, and (3) that is what I meant by "redefining what you mean by distance" (I probably should have said redefine what you mean by an inner product that magically knows what the vector was before its name got changed).

Oh, and drop the haughtiness, it's not being backed up.
 
Last edited:
  • #69
DaleSpam said:
1) The general theory of relativity simplifies to the special theory of relativity in flat spacetime, hence SR is a special case of GR.
Agreed, nor did I ever express any disagreement. The reason behind this is summed up well by John Baez(http://math.ucr.edu/home/baez/physics/Relativity/SR/acceleration.html):
"The difference between general and special relativity is that in the general theory all frames of reference including spinning and accelerating frames are treated on an equal footing. In special relativity accelerating frames are different from inertial frames. Velocities are relative but acceleration is treated as absolute. In general relativity all motion is relative. To accommodate this change general relativity has to use curved space-time. In special relativity space-time is always flat."

So there it is, we see that indeed special relativity does involve the restriction to flat spacetime, and the reason for this, and where the word "special" comes from, is that inertial frames are treated differently than noninertial frames, i.e., inertial frames are special in that theory. Take it up with Dr. Baez, I can't delve any deeper than I have already.

2) The first postulate refers to inertial reference frames, not inertial observers.
Indeed, but be careful you are not implying that the explicit reference to inertial frames does not include the all-important implicit reference to inertial (possibly hypothetical) observers. How else do you plan to define an inertial frame but by using observers, possibly hypothetical, with accelerometers that read zero? (See the definition you linked-- unfortunately that same library has no entry for "inertial reference frame".)
3) Einstein synchronization is a convention.
Agreed, when we have access to the concept of the measurements of hypothetical inertial observers. Then when we add the Einstein convention, we get that the connection between those observer coordinates is the Lorentz transformation, and we find that the invariant distance is given by the Minkowski metric. If we don't use the Einstein convention, we get neither the Lorentz transformation nor invariance of the Minkowski metric as it is normally expressed (or alternatively, we need to define a new Minkowski metric commensurate with the new time coordinatization).
4) The fundamental and coordinate independent concept of SR is the Minkowski geometry of spacetime.
If that were true, there'd be no need for this thread. But it isn't. "Minkowski geometry", as it is generally used, means a geometry spawned by an inner product that deviates from Euclidean by a -1 in one of the terms, using a particular choice of basis vectors chosen from a special class that represent a particular physically-motivated ordering of events by inertial observers. That is not "coordinate independent", because that particular metric, defined in the Minkowski way, is only invariant when acting on those special coordinates generated by inertial observers, coordinates which are connected by Lorentz transformations, which are of course the orthonormal transformations under the action of the Minkowksi metric (which is why it is invariant for those transformations). This I would say is precisely the basis of Baez's remark: "In special relativity accelerating frames are different from inertial frames."

Put mathematically, it is a very basic theorem of metric spaces that g(x,y)=g(Lx,Ly) only works if L is an orthonormal transformation (indeed, that defines the orthonormal transformations under the action of g). Ergo "Minkowski geometry" is very much the geometry associated with the Lorentz transformations and the Einstein simultaneity convention. These form a subclass of linear coordinatizations, and hence "Minkowski geometry" is not "coordinate independent". Just what is coordinate independent in all this is precisely the question behind this thread.

I'm done.
We all decide how much we want to know.
 
Last edited by a moderator:
  • #70
bernhard.rothenstein said:
1.Einstein's transformation equations hold only with Einstein synchronized clocks t(E) and t"(E).
Yes, the coordinate form of the standard "Lorentz transformation" between inertial frames (as defined by a population of hypothetical inertial observers) requires that inertial observers use the Einstein simultaneity convention.
2.the t(E) and t'(E) readings could be brought in a physically correct relationship with the readings of other clocks synchonized in a different way.
Yes, I would say that any arbitrary prescription could be used to synchronize clocks that merely followed some very weak constraints (supportive of metric spaces), and it would merely spawn a new way to transform between the inertial observers' coordinates. The special treatment of those observers would still allow the first postulate of SR to apply, but the second postulate would be lost. In that sense, I see the second postulate as superfluous, and Einstein's simultaneity convention should be elevated to the level of a postulate if one wanted to work in the standard coordinatization. If one wanted a coordinate-free treatment, one would simply assert that the speed of light is whatever is necessary to allow the first postulate to hold.

3.It is considered that under such conditions the reference frame I is in absolure rest relative to the ether the motion of I' relative to it having an absolute character confering to I' some properties (anisotropy).
Yes, it is a matter of sheer preference, a la Occam's Razor, to exclude anisotropy. In other words, if we later found some new physics that required anisotropy, no previous experiments would suddenly seem strange, we would merely have to use a different simultaneity convention and/or a different status of what are the "special" reference frames.
4.My oppinion is that those properties are merely introduced by the shift from the reading t(E) to the reading t(v). Using different physically correct relationships between t(E) and the reading t of a differently synchronized clock we obtain different transformation equations which confer different properties to I'.
To me, the key unanswered issue is, "how should we think of all this so that none of the arbitrary choices matter, i.e., what possibilities are ruled out by experiment and what is just what we accept from our preference for simplicity?"
 
  • #71
Hurkyl said:
If that really is what you're saying, then I posit that you need to review elementary geometry before continuing to reflect upon physics.

Ken G said:
Oh, and drop the haughtiness, it's not being backed up.
Perhaps I should have instead said "I will if you will"!
 
  • #72
Ken G said:
DaleSpam said:
4) The fundamental and coordinate independent concept of SR is the Minkowski geometry of spacetime.
If that were true, there'd be no need for this thread.
So, you can see why he might have become exasperated. :-p

But it isn't. "Minkowski geometry", as it is generally used,
I have ever only seen "Minkowski geometry" used to refer to the notion of a coordinate-independent description of spacetime. (Conversely, those who reject Minkowski geometry prefer coordinate-dependence)

means a geometry spawned by an inner product that deviates from Euclidean by a -1 in one of the terms, using a particular choice of basis vectors
Everything can be described by a coordinate-based approach: that's why coordinates are useful. But that doesn't mean everything is coordinate-dependent.

In fact, I don't remember the last time I have ever heard of Minkowski space being described in a coordinate-dependent manner -- it's usually described in terms of its ,[/URL] which an intrinsic property of a metric, and is invariant under all coordinate transformations. In fact, it is effectively the only property of a metric that is invariant under all coordinate transformations.
 
Last edited by a moderator:
  • #73
Ken,

Sorry I haven't had chance to respond for a few days.

To echo what Hurkyl has been saying, the modern "geometrical" view of spacetime uses terminology slightly differently than the way you've been using it. It might help to forget relativity for a while and go back to 2D Euclidean geometry. The metric here is given by

ds^2 = dx^2 + dy^2​

where x and y are orthonormal Cartesian coordinates. However, that equation is not the metric; it is the equation for the metric in a particular coordinate system. It turns out that the same equation works for all other orthonormal Cartesian coordinates. But it doesn't work for other coordinates. For example, in "skew" coordinates, where the axes are at an angle of \alpha to each other, the metric is given by

ds^2 = dx^2 + dy^2 - 2\,dx\,dy\,\cos \alpha​

And in polar coordinates the equation is

ds^2 = dr^2 + r^2d\theta^2​

The above three equations are not three different metrics. They all represent the same metric, viz. the 2D Euclidean metric, expressed in different coordinate systems. And the metric has a physical interpretation as "distance", which is invariant under any coordinate change.

In relativity, even though the physical interpretation of the metric is a little more complicated, the same principle applies.

Ken G said:
DrGreg said:
Note that if you rescale rapidity to be c \log_e k then it approximates to coordinate-speed at low speeds.
True, but that's not really a speed, it's a Doppler factor. That's the thing we can measure, speed requires a coordinatization.
But I am saying, if you do the maths, you will find that for low speeds the natural logarithm of the Doppler factor, viz. c \log_e k really does approximate to coordinate speed (at "everyday" terrestial speeds the two values would be indistinguishable), so you could use rapidity as a coordinate-independent measure of motion that is fully compatible with Newtonian (non-relativistic) speed.

Ken G said:
That had me thinking for awhile, but I don't think that would give a unique result. After all, there are infinitely many pairs of mutually stationary objects that could have one object at each event, all with different distances between them. If you further stipulate that the objects must be stationary with respect to the observer doing the measurement, it just means each such pair comes with their own observer, each finding a different "proper distance" between the events. If the events themselves don't have a concept of being "stationary", which they don't normally, then we still have no way to know which observer is getting the "proper" result.
Actually you are right here: what I said isn't enough to define the "interval" between two events. Every inertial observer can measure a different distance between events in the way I said. The "interval" is the shortest possible distance that any inertial observer might measure between those two events, assuming that minimum is not zero (otherwise your two events are timelike separated).

Ken G said:
They are not "different"-- everyone can measure something with an accelerometer. The inertial ones are simply defined as those who measure zero.
The point I was alluding to is that to an inertial observer in GR, Special Relativity still appears to be approximately true in a small local region around himself/herself. (The phrase "approximately true" can be made precise by means of calculus.) An inertial observer, in GR, can set up a local, Einstein-synced coordinate system in such a way that ds^2 = dt^2 - dx^2/c^2 - dy^2/c^2 - dz^2/c^2 is still true at the origin of the coordinate system (although it won't be true elsewhere). (And conversely, non-inertial observers can never set up a local Minkowski approximation.) In that sense, inertial observers are "different", even though, as you rightly say, all observers, inertial or not, can set up coordinate systems.

Ken G said:
DrGreg said:
The inner product, or "metric" is invariant, that is you always get the same answer for g(X,Y) no matter what coordinate system you use to carry out the calculation.
Not if you use "radio coordinates". This is part of the point-- the metric space has more general properties than the form of the metric.
No, this is a terminological issue. I think you are thinking of "the metric" as being the formula for ds in terms of the coordinates. I am saying that "the metric" is an entity that exists independently of coordinates, that you can define physically in terms of proper time and proper distance, and whose mathematical properties can be formulated in terms of vector equations, not component equations. So in spherical radar coordinates the equation

ds^2 = du \, dv - \frac{(u-v)^2}{4} ( d\theta^2 + \sin^2 \theta d\phi^2)​

represents exactly the same metric as

ds^2 = ds^2 = dt^2 - dx^2/c^2 - dy^2/c^2 - dz^2/c^2​

expressed in Minkowski coordinates. Both equations are the Minkowski metric. The metric is an operator that maps a pair of vectors to a scalar.
 
  • #74
To get back to the original question, is Einstein synchronisation arbitrary or is there some good reason for it? One good reason is the mathematical one that it makes the maths simpler, and it makes it easy to compare one frame against another and confirm that neither is "special" in any way.

For those that are not aware, there is another "natural" synchronisation method called "ultra slow clock transport". The obvious Newtonian way to sync 2 inertial clocks A and B at rest relative to each other is to put a 3rd clock C next to A, sync it to A, then put it next to B and sync B to C. We know that method is no good in relativity, for if you then moved C back to A you would find that C was no longer synced to A (the twin "paradox"). Syncing B to A gives a different result than syncing A to B, by this "fast clock transport method".

But what if we move C from A to B v-e-r-y s-l-o-w-l-y? The twin paradox discrepancy gets less the slower you go. Although you could never achieve zero speed in practice, you can consider, mathematically, what would happen in the limit. It turns out, when you do the maths, that this method of "ultra slow clock transport" synchronisation gives exactly the same result as Einstein synchronisation (and experiments have confirmed this).
 
  • #75
DrGreg said:
For those that are not aware, there is another "natural" synchronisation method called "ultra slow clock transport". The obvious Newtonian way to sync 2 inertial clocks A and B at rest relative to each other is to put a 3rd clock C next to A, sync it to A, then put it next to B and sync B to C. We know that method is no good in relativity, for if you then moved C back to A you would find that C was no longer synced to A (the twin "paradox"). Syncing B to A gives a different result than syncing A to B, by this "fast clock transport method".

But what if we move C from A to B v-e-r-y s-l-o-w-l-y? The twin paradox discrepancy gets less the slower you go. Although you could never achieve zero speed in practice, you can consider, mathematically, what would happen in the limit. It turns out, when you do the maths, that this method of "ultra slow clock transport" synchronisation gives exactly the same result as Einstein synchronisation (and experiments have confirmed this).

A nice description and comparison of Einstein synchronization and "slow clock transport" can be found in:
Mansouri R., Sexl R.U.: A test theory of special relativity. I: Simultaneity and clock synchronization. In: General. Relat. Gravit.. 8, Nr. 7, 1977, pp. 497–513.​

Experiments, which confirmed the equivalence between those methods, were made by:
Wolf P. and Petit G., Satellite test of special relativity using the global positioning system, Phys. Rev. A56, 6, 4405, (1997).​

See also:
en.wikipedia.org/wiki/Einstein_synchronisation
 
  • #76
DrGreg said:
It turns out, when you do the maths, that this method of "ultra slow clock transport" synchronisation gives exactly the same result as Einstein synchronisation (and experiments have confirmed this).
Wouldn't this method of "ultra slow clock transport" give exactly the same result as any other synchronization method if one assumes the same conventional isotropy/anisotropy of speeds as assumed for the other method? There is no unique connection between Einstein synchronization and slow clock transport.
 
  • #77
Hurkyl said:
I have ever only seen "Minkowski geometry" used to refer to the notion of a coordinate-independent description of spacetime. (Conversely, those who reject Minkowski geometry prefer coordinate-dependence).
The key point I was making is, a metric is only invariant on mappings of the vector space into itself that constitute the "orthonormal transformations" under that metric. Ergo, one cannot say the "Minkowski metric is invariant" and "Minkowski geometry is coordinate independent" in the same breath, they are contradictory. They both have their separate meanings, it is true, but the meanings are different. If you want to count the latter as true, as is the conventional choice, then the former statement is not coordinate independent. That basic confusion is at the heart of what we are trying to get to the bottom of-- the contradiction between imagining that "Minkowski geometry" is coordinate independent, but it is generated by a "Minkowksi metric" (as in any textbook) that is not in general an invariant. If the latter requires assumptions not required in the former, then will the real "Minkowki metric" please stand up?
In fact, I don't remember the last time I have ever heard of Minkowski space being described in a coordinate-dependent manner -- it's usually described in terms of its ,[/URL] which an intrinsic property of a metric, and is invariant under all coordinate transformations. In fact, it is effectively the only property of a metric that is invariant under all coordinate transformations.
Right-- that's why it would normally be considered true that the Minkowski signature is really the heart of special relativity-- not invariants of the Minkowski metric. Yet I will bet you that if you pick up virtually any physics textbook, you will quickly find a confusion between what "coordinate independent" means and what "invariance under Lorentz transformations" means. They get enmeshed as if they were saying the same thing, and untangling that confusion is the progress we are making.

What you are now saying is that the sole "physical" aspect of the Minkowksi metric is that it is symmetric (in the sense <x_i,x_j> = <x_j,x_i>) and gives three positive and one negative norm on an any orthogonal basis. It will not give that on arbitrary bases, however, as if the basis vectors are strange combinations of observables, or if they are the observables of an accelerated observer over a finite time period. Note in particular what happens to the "postulates of special relativity" in the latter cases-- we find they make coordinate assumptions. What's more, any metric with that signature would successfully generate special relativity with the appropriate definitions (i.e., not using the Einstein simultaneity convention, or not requiring inertial observers for finite-time calculations). Thus if the "real heart of special relativity" does not make those assumptions, then the "postulates of special relativity", as they are normally taught, are not in fact the real heart of special relativity. Now we're getting somewhere.
 
Last edited by a moderator:
  • #78
DrGreg said:
The above three equations are not three different metrics. They all represent the same metric, viz. the 2D Euclidean metric, expressed in different coordinate systems.
Actually, I think your "skew" metric is indeed a different metric. You have just changed the metric when you changed the basis vectors, to make the new basis an orthonormal one under the new metric. I'm pretty sure that for a single metric, saying <x,y> = <Ox,Oy> is the definition of O being an orthonormal coordinate transformation under that metric.

I think part of the problem here is that metrics normally work on a single vector space, from which you select two vectors, but for them to transform in an invariant way you actually have to take one vector from the vector space and the other from the "dual space", so if you want the first vector space to be covariant vectors, you have to select a contravariant vector from its dual space. If you do that, you obtain complete coordinate independence, but that is not the normal way that metrics operate. Maybe we shouldn't be using a Minkowski "metric" at all.
But I am saying, if you do the maths, you will find that for low speeds the natural logarithm of the Doppler factor, viz. c \log_e k really does approximate to coordinate speed (at "everyday" terrestial speeds the two values would be indistinguishable), so you could use rapidity as a coordinate-independent measure of motion that is fully compatible with Newtonian (non-relativistic) speed.
It seems to me that "coordinate speed" and "coordinate-independent measure of motion" are having a little fight in that sentence.
The point I was alluding to is that to an inertial observer in GR, Special Relativity still appears to be approximately true in a small local region around himself/herself...In that sense, inertial observers are "different", even though, as you rightly say, all observers, inertial or not, can set up coordinate systems.
But if there is no difference between an observer whose accelerometer reads zero, and one whose reads something else (which is true on the scales you are describing), then there is still nothing "special" about the one who is inertial. The "specialness" in special relativity appears on finite times, where the physics comes in, and the accelerometer reading becomes important.
I am saying that "the metric" is an entity that exists independently of coordinates, that you can define physically in terms of proper time and proper distance, and whose mathematical properties can be formulated in terms of vector equations, not component equations.
It is my impression that your remark here would only be true if the vectors that the metric acts on were selected from dual spaces (one covariant and one contravariant), but normally metrics are defined with both vectors from the same space. When the latter is used, metrics are only invariant when acting with respect to orthonormal bases, so that is a coordinate constraint that does single out inertial observers observing vectors of finite (i.e., not infinitesmal) length. Nevertheless, as in the above exchange with Hurkyl, it does not appear that the invariance of the metric is a terribly crucial property, as it is actually its signature that determines the physics within any particular coordinate system.

Both equations are the Minkowski metric. The metric is an operator that maps a pair of vectors to a scalar.
I would say that both equations share the signature of the Minkowski metric, and generate Minkowski geometry, but they are not the same metric. Moreover, the way the Minkowski metric is usually taught is as a single metric, not as a class of metrics that all spawn the same geometry but differ in the values of the norms.
 
  • #79
DrGreg said:
To get back to the original question, is Einstein synchronisation arbitrary or is there some good reason for it? One good reason is the mathematical one that it makes the maths simpler, and it makes it easy to compare one frame against another and confirm that neither is "special" in any way.
I agree, there's certainly plenty of motivation from Occam to set up special relativity the way it is done. My issue, however, is when we "cover our tracks" and assert statements of our own choosing, to make the math simple, as though they were "truths about reality" (that's often how the postulates of relativity are taught, I've seen very few counterexamples). The place you'd see the difference is if we ever found evidence that those postulates were wrong, would we say "hey, but I thought we had observations to back them", and the answer would be "no, the observations only backed more general postulates, we added additional elements for no reason other than to simplify the math. We did the same thing with Newton's laws and look where that got us".
 
  • #80
Ken G said:
The key point I was making is, a metric is only invariant on mappings of the vector space into itself that constitute the "orthonormal transformations" under that metric. Ergo, one cannot say the "Minkowski metric is invariant" and "Minkowski geometry is coordinate independent" in the same breath, they are contradictory.
Change-of-basis transformations are not "mappings of the vector space into itself". (although, they are equivalent to "mappings of the {coordinate-representation of the vector space} into itself")

From this, and subsequent comments, it looks like you're still confusing "the metric" with "the coordinate representation of the metric". I'm quite serious when I say you should reconsider Euclidean geometry before you continue thinking about Minkowski geometry. (Since I assume you understand the Euclidean case)

The invariance of the metric under local Lorentz transformations means that if you change which direction you look, physics remains the same.

The coordinate-independence of the metric means that lengths and angles remain the same, no matter what chart you use to compute them.
 
Last edited:
  • #81
Hurkyl said:
Change-of-basis transformations are not "mappings of the vector space into itself".
Changing to a different observer is, and that's what we are ultimately talking about here.
I'm quite serious when I say you should reconsider Euclidean geometry before you continue thinking about Minkowski geometry.
The issue was never about Euclidean geometry vs. Minkowski geometry (we agree that is the crucial geometric difference). The issue was about the invariance of a metric, what the "Minksowki metric" means, and how that is different from "Minkowski geometry" (in virtually any textbook). Those are not the same questions, that's the point.
The invariance of the metric under local Lorentz transformations means that if you change which direction you look, physics remains the same.
That's incorrect, it means that if you change from one inertial observer to another, physics remains the same.
The coordinate-independence of the metric means that lengths and angles remain the same, no matter what chart you use to compute them.
Let's define a metric, and denote it by < , >. Now I tell you that <x,y> = <Ox,Oy> for some transformation O. We may imagine that O is how things look different when I change from one observer to another, and we are asserting that the metric remains invariant under that change. Question: what can we say about O?
 
Last edited:
  • #82
Aether said:
Wouldn't this method of "ultra slow clock transport" give exactly the same result as any other synchronization method if one assumes the same conventional isotropy/anisotropy of speeds as assumed for the other method? There is no unique connection between Einstein synchronization and slow clock transport.

Well I'm not exactly sure what you mean by "the same conventional isotropy/anisotropy of speeds". If you mean one-way coordinate-speed of light isotropy, then you are assuming Einstein synchronization.

Mansouri & Sexl (mentioned in this post) make some homogeneity and "Lorentzian" assumptions which amount to assuming Einstein's postulates are true when expressed in a suitable sync-convention-independent way.

You can also prove the equivalence of ultra slow clock transport and Einstein synchronization using Bondi's k-calculus and radar coordinates, which do not depend on any sync convention.

Assuming SR is true, as we can prove ultra slow clock transport and Einstein synchronization are equivalent, then ultra slow clock transport cannot be equivalent to anything that is not equivalent Einstein synchronization.

Any experimentally confirmed difference between ultra slow clock transport and Einstein synchronization would amount to a disproof of relativity. It hasn't happened yet.
 
  • #83
Ken G said:
Changing to a different observer is, and that's what we are ultimately talking about here.
If, by that, you don't mean a change of coordinates, then you need to explain.


The issue was never about Euclidean geometry vs. Minkowski geometry (we agree that is the crucial geometric difference).
Euclidean and Minkowski geometry are identical in all aspects relevant to this discussion. For example, they are both affine spaces equipped with a symmetric, nondegenerate bilinear form (that is compatable with the affine structure). I assume we both consider Euclidean geometry 'simpler', and so there is much to gain reviewing that case first.
 
Last edited:
  • #84
Hurkyl said:
If, by that, you don't mean a change of coordinates, then you need to explain.
When you change the observer, you will change the basis vectors used to label the events of spacetime in terms of the physical measurables "distance" and "time". That is both a change in coordinates, in that the labels are changing in a particular way, and a transformation of the vector space into itself, as all the events are now seen from a different perspective-- that of a new observer. The events are the same, but the vectors are different (indeed, nonlinear transformations would make them no longer even members of a vector space at all). If the transformation was Lorentzian, which happens when you are changing between inertial observers and are using the Einstein simultaneity convention, then the "Minkowski metric" connecting any two events, as it is normally defined, will be invariant. But in general, it will not. If you want to get something that is invariant to all linear transformations, you must choose one vector from the vector space and the other from its dual space, as I dimly understand the situation.

Euclidean and Minkowski geometry are identical in all aspects relevant to this discussion. For example, they are both affine spaces equipped with a symmetric, nondegenerate bilinear form (that is compatable with the affine structure). I assume we both consider Euclidean geometry 'simpler', and so there is much to gain reviewing that case first.
Well, if by "all aspects" you mean "in terms of the meaning of a transformation of a vector space, a coordinatization, and a dual space", then I suppose you are right, and those are all interesting and important but quite mathematical issues that I think we all have much to learn about.

But what I have in mind is a much more interesting physical issue, namely, "what are the minimal postulates required to describe the physics of special relativity." When that is the goal, then the different signatures (of the Euclidean and Minkowski metrics) are indeed important, and I am beginning to suspect that the minimal postulates say that the geometry of spacetime is described by a symmetric metric with a signature with three positive and one negative eigenvalue. Note this requires no simultaneity convention, nor the statement that the speed of light is isotropic, nor the requirement that physics look the same from all inertial reference frames. Interesting, is it not, that those are basically the "three pillars of special relativity" as it is normally taught, and my goal is to get to the bottom of this apparent flaw in the standard architecture.

The reason that's important, once again, is that when observations some day show that special relativity breaks down even in the absence of gravity (say, in quantum mechanics), we'll want to know what are the postulates that our observations really did back up, and what ones did we just imagine they backed up, in the process of mistaking Occam "simplicity" for something more akin to "computational convenience".
 
Last edited:
  • #85
Ken G said:
But what I have in mind is a much more interesting physical issue, namely, "what are the minimal postulates required to describe the physics of special relativity."
...
The reason that's important, once again, is that when observations some day show that special relativity breaks down even in the absence of gravity (say, in quantum mechanics), we'll want to know what are the postulates that our observations really did back up, and what ones did we just imagine they backed up
I have more to say, but no time this morning. But I did want to make one quick comment:
Mathematically speaking, at least, "minimal postulates" are not unique. There are many many different ways of formulating any theory.

In terms of your long-term goal, I think that guessing at the 'one true formulation of special relativity' is the wrong approach -- if you instead learn many different ways of formulating special relativity, you're much more likely to know one that can be tweaked to accommodate the new data.
 
  • #86
Hurkyl said:
Mathematically speaking, at least, "minimal postulates" are not unique. There are many many different ways of formulating any theory.
Right, you made that point earlier and that is a very valid one. It is not really the "minimum postulates" that count here, it is the minimal theory. By that I mean, the theory that unifies all the observations, without making unique predictions about what is outside the intended realm of explanation of the measurement set. Newtonian mechanics should have been done that way too, it would have saved us a lot of false surprise (surprise we had no real business being surprised about).
In terms of your long-term goal, I think that guessing at the 'one true formulation of special relativity' is the wrong approach -- if you instead learn many different ways of formulating special relativity, you're much more likely to know one that can be tweaked to accommodate the new data.
That's not the issue, the goal is not to find an equivalent formulation, but a less restrictive one. For example, uniting all metrics with the same signature is already a less restrictive form of dynamics than requiring invariance of a particular one.
 
  • #87
Ken G said:
By that I mean, the theory that unifies all the observations, without making unique predictions about what is outside the intended realm of explanation of the measurement set.
With the description you've given thus far, it appears that a database of all experimental data is precisely the "minimal theory" you seek. But it is not useful scientifically (it cannot be falsified), nor practically (it cannot make predictions).

That's not the issue, the goal is not to find an equivalent formulation, but a less restrictive one.
What exactly do you mean by "less restrictive"? My initial reaction is that that's a disadvantageous trait for a scientific theory -- the less restrictive a theory's predictions, the less the possibility for failure, and thus the less confidence we get by empirically testing it. Conversely, we gain a lot of confidence when a theory passes a test in which it makes very specific predictions.

e.g. if we are considering "space is globally Minkowski" versus "space is locally Minkowski" -- the former assertion is very specific. Every piece of experimental data consistent with the former assertion is, of course, also consistent with the latter assertion.

So, according to Bayesian statistical inference, given lots of experimental data confirming both of these assertions, it is correct to favor the stronger assertion, and so we conclude "space is globally Minkowski". (And, of course, being good statisticians, we are willing to drop that conclusion if later evidence contradicts it)


For example, uniting all metrics with the same signature is already a less restrictive form of dynamics than requiring invariance of a particular one.
They look equivalent to me. Every metric of signature +--- determines a unique class of coordinate charts (related by Poincaré transformations) in which the coordinate representation of the metric is given by d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2.

The "affine 4-space equipped with a compatable metric of signature +---" formulation does have pedagogical value due to its manifest coordinate-independence, but it is describing exactly the same theory as "affine 4-space equipped with a distinguished class of coordinate charts, and a metric whose coordinate representation in any coordinate chart is d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2."
 
Last edited:
  • #88
Hurkyl said:
With the description you've given thus far, it appears that a database of all experimental data is precisely the "minimal theory" you seek.
No, a "database" is not a theory at all because it is not unified. That's what an "explanation" means-- a way to see all that data as a consequence of a single theory.
But it is not useful scientifically (it cannot be falsified), nor practically (it cannot make predictions).
Correct, the minimal theory cannot be falsified, that is why it is such a useful springboard to making the kinds of "extensions" I specifically mentioned above. It is the extensions that make predictions, and are falsifiable. That way, you know what you are doing, and avoid the "scattershot" approach by which Newtonian mechanics was replaced by special relativity, and that same scattershot approach is how special relativity is still taught today.

You see, there is no point in making predictions of experiments you cannot do, so it makes more sense to look at the experiments you can, and tailor a theory that starts with fitting all the experiments you have done, and simply extends to make a prediction for the new experiment, without being weighed down with a host of other predictions that are not being tested and probably aren't right. That way, we avoid the continual mistake of "believing" in aspects of our theories that we failed to identify as being purely out of convenience. We could also avoid this annoying illusion that science undergoes "revolutions", rather than simply learns new stuff.
What exactly do you mean by "less restrictive"?
I mean the theory would come with fewer requirements on how we picture reality, and a broader understanding of the possibilities that work equally well. For special relativity, that means that inertial observers would not be singled out as special in any way, the speed of light would not need to be isotropic, and no one would need to claim "experiments show there is no ether". We would simply set up the mathematical machinery we need to get the dynamics right, and not bother to make claims about reality that we have no way to test. Because, when we later figure out a way to test them, more often than not we discover we were wrong, and science historians will make a big deal about the shocking revolution, when in fact we were simply pretending to know something we did not know.

Ironically, this is exactly what happened with the Michelson-Morely experiment, but we missed the full lesson there. The lesson was not "M-M showed us we made the wrong assumptions", as it is normally taught, but instead, "M-M showed us the danger in making assumptions that we simply don't need to unify the observations we have on hand". We should have simply gone into M-M with an open mind, realizing that we were entering a new regime and anything could happen. We could have come equipped with several possible extensions of our current theory, and used the experiment to distinguish them, but no one needed to act the least bit surprised when one extension worked better than another.

My initial reaction is that that's a disadvantageous trait for a scientific theory -- the less restrictive a theory's predictions, the less the possibility for failure, and thus the less confidence we get by empirically testing it. Conversely, we gain a lot of confidence when a theory passes a test in which it makes very specific predictions.
But what "confidence" do you mean? Confidence that the theory is indeed working for unifying a particular measurement set, and other measurements that fit into the same overall framework, or confidence that the theory will work when applied to some completely new measurement? The former kind of confidence is the confidence that builds bridges-- the latter is the one that makes fools of the best thinkers of all time.
e.g. if we are considering "space is globally Minkowski" versus "space is locally Minkowski" -- the former assertion is very specific. Every piece of experimental data consistent with the former assertion is, of course, also consistent with the latter assertion.
The former is the more restrictive theory, because it makes more assertions about reality, and has more ways to be false. So this is a good example of just what I'm talking about-- the latter unifies our current observations, the former is false (it breaks down either if there is gravity, or if the observer accelerates). The latter requires extensions to expand its usefulness into those realms, but that's just what it should need.

So, according to Bayesian statistical inference, given lots of experimental data confirming both of these assertions, it is correct to favor the stronger assertion, and so we conclude "space is globally Minkowski". (And, of course, being good statisticians, we are willing to drop that conclusion if later evidence contradicts it)
We already know that is false.
They look equivalent to me. Every metric of signature +--- determines a unique class of coordinate charts (related by Poincaré transformations) in which the coordinate representation of the metric is given by d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2.
I agree that +--- is equivalent to -+++, it didn't matter because we are comparing to +++ with absolute time.
The "affine 4-space equipped with a compatable metric of signature +---" formulation does have pedagogical value due to its manifest coordinate-independence, but it is describing exactly the same theory as "affine 4-space equipped with a distinguished class of coordinate charts, and a metric whose coordinate representation in any coordinate chart is d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2."
Again, that is not the coordinate representation of the Minkowski metric in any coordinate chart-- only any orthogonal coordinate chart. Furthermore, we need a finite concept of distance-- an infinitesmal one does not suffice to determine the dynamics, so the latter requires a special treatment of inertial observers, un awkward and unnecessary aspect of the theory that is often mistaken for a physical statement of some kind.
 
  • #89
A trivial theory is still a theory -- aesthetic grounds are not sufficient justification for rejecting it. And besides the 'database theory' is the only theory (up to equivalence) that makes no assertions beyond the experimental data. This is fairly easy to see: if you have a theory that is not equivalent to the database theory, then either it deals with things that are not experimental results, or it makes assertions that cannot be proven by the data.

And, of course, it is a trivial exercise to show that each piece of experimental data is a theorem of the database theory.


For special relativity, that means that Inertial Observers would not be singled out as special in any way,
But they can be singled out: an observer is inertial if and only if his worldline is straight,
the speed of light would not need to be isotropic
and it's an easy theorem that null vectors have 'speed' one in any orthonormal affine coordinate chart.

The theory of special relativity, like any other theory, is formulation independent: you get the same theory no matter how you formulate it. e.g. if you formualte it in terms of inertial observers and Poincaré-invariant coordinate metrics, you get exactly the same theory as if you formulate it in terms of a coordinate-independent metric with a specified signature.

Even Lorentz relativity is effectively the same as special relativity. LR includes an extra constant symbol denoting an orthonormal coordinate frame, but is otherwise exactly the same theory as special relativity. (mathematically speaking, at least)


We should have simply gone into M-M with an open mind, realizing that we were entering a new regime and anything could happen.
Tomorrow is a new regime too. :-p Yes, a closed mind is bad for science... but so is naïeveté. Scientific theories have been well-supported by empirical evidence, and that affords us confidence that they will continue to be correct. When going into a new experiment, we should have exactly as much confidence in our theories as they deserve... no more, and no less.


But what "confidence" do you mean?
The confidence afforded to us by the scientific method.


The former is the more restrictive theory, because it makes more assertions about reality, and has more ways to be false. So this is a good example of just what I'm talking about-- the latter unifies our current observations, the former is false (it breaks down either if there is gravity, or if the observer accelerates).
...
We already know that is false.
The point is, before we had evidence contradicting the former, it was scientifically correct to favor the "globally Minkowski" hypothesis over the "locally Minkowski" hypothesis. Why was that scientifically correct? Because the "globally Minkowski" hypothesis had stronger empirical support.

Of course, with the evidence we now have, "locally Minkowski" has stronger empirical support.


Hurkyl said:
They look equivalent to me. Every metric of signature +--- determines a unique class of coordinate charts (related by Poincaré transformations) in which the coordinate representation of the metric is given by d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2.
I agree that +--- is equivalent to -+++, it didn't matter because we are comparing to +++ with absolute time.
Huh? That has absolutely nothing to do with what I said in that quote.

Again, that is not the coordinate representation of the Minkowski metric in any coordinate chart-- only any orthogonal coordinate chart.
That was a typo, sorry. It was supposed to say "affine 4-space equipped with a distinguished class of coordinate charts, and a metric whose coordinate representation in any distinguished coordinate chart..."

Furthermore, we need a finite concept of distance-- an infinitesmal one does not suffice to determine the dynamics
That's what calculus is for.
 
  • #90
Ken G said:
When you change the observer, you will change the basis vectors used to label the events of spacetime in terms of the physical measurables "distance" and "time". That is both a change in coordinates, in that the labels are changing in a particular way, and a transformation of the vector space into itself, as all the events are now seen from a different perspective-- that of a new observer.
Looking at the same events from a different perspective -- that sounds exactly like you're leaving Minkowski space unchanged, but changing the coordinate chart you're using.

For a vivid (but Euclidean) example -- put a sheet of paper on the floor and look at it. Now, walk somewhere else and look at the paper again. Did the paper change?


The events are the same,
And since, physically speaking, events in 'reality' correspond to points in Minkowski space, we see that the operation you propose doesn't transform Minkowski space. (In fact, there is nothing physical that enacts a transformation of Minkowski space)

but the vectors are different (indeed, nonlinear transformations would make them no longer even members of a vector space at all).
Minkowski space is not a vector space; it is an affine space. You can view an affine space as a vector space by choosing an 'origin' and corresponding each point of the affine space with the vector given by subtracting off the origin. If you change the origin, then yes, that correspondence will change.

It looks like you're trying to make your observer correspond to an origin but that doesn't make sense -- the origin is a single point, whereas the observer occupies an entire worldline. (actually an entire 3+1-dimensional region -- we only get a worldline if we assume zero spatial extent)

If you want to get something that is invariant to all linear transformations, you must choose one vector from the vector space and the other from its dual space, as I dimly understand the situation.
If all linear transformations of interest act trivially, you get invariance automatically. :wink: That's what happens with a coordinate change -- the change-of-coordinates transformation doesn't do anything to Minkowski space; it only changes the coordinate functions, and the coordinate spaces.


Now, the fact that the symmetry group of Minkowski space is Poincaré group is interesting... and I suspect the thing you're really interested in; coordinate changes are just a red herring. And the key point is that Minkowski space is not symmetric under skew transformations, or a rescaling along a single axis; only Poincaré transformations preserve the Minkowski structure.
 

Similar threads

  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 32 ·
2
Replies
32
Views
4K
  • · Replies 51 ·
2
Replies
51
Views
5K
  • · Replies 127 ·
5
Replies
127
Views
9K
  • · Replies 34 ·
2
Replies
34
Views
2K
  • · Replies 6 ·
Replies
6
Views
931
  • · Replies 9 ·
Replies
9
Views
1K
  • · Replies 12 ·
Replies
12
Views
2K
  • · Replies 4 ·
Replies
4
Views
933
  • · Replies 57 ·
2
Replies
57
Views
7K