Einstein simultaneity: just a convention?

Ken G · May 10, 2008

Hurkyl said:

If that really is what you're saying, then I posit that you need to review elementary geometry before continuing to reflect upon physics.

Ken G said:

Oh, and drop the haughtiness, it's not being backed up.

Perhaps I should have instead said "I will if you will"!

Hurkyl · May 10, 2008

Ken G said:

DaleSpam said:

4) The fundamental and coordinate independent concept of SR is the Minkowski geometry of spacetime.

If that were true, there'd be no need for this thread.

So, you can see why he might have become exasperated. :tongue:

But it isn't. "Minkowski geometry", as it is generally used,

I have ever only seen "Minkowski geometry" used to refer to the notion of a coordinate-independent description of spacetime. (Conversely, those who reject Minkowski geometry prefer coordinate-dependence)

means a geometry spawned by an inner product that deviates from Euclidean by a -1 in one of the terms, using a particular choice of basis vectors

Everything can be described by a coordinate-based approach: that's why coordinates are useful. But that doesn't mean everything is coordinate-dependent.

In fact, I don't remember the last time I have ever heard of Minkowski space being described in a coordinate-dependent manner -- it's usually described in terms of its ,[/URL] which an intrinsic property of a metric, and is invariant under all coordinate transformations. In fact, it is effectively the only property of a metric that is invariant under all coordinate transformations.

DrGreg · May 12, 2008

Ken,

Sorry I haven't had chance to respond for a few days.

To echo what Hurkyl has been saying, the modern "geometrical" view of spacetime uses terminology slightly differently than the way you've been using it. It might help to forget relativity for a while and go back to 2D Euclidean geometry. The metric here is given by

[tex]ds^2 = dx^2 + dy^2[/tex]

where x and y are orthonormal Cartesian coordinates. However, that equation is not the metric; it is the equation for the metric in a particular coordinate system. It turns out that the same equation works for all other orthonormal Cartesian coordinates. But it doesn't work for other coordinates. For example, in "skew" coordinates, where the axes are at an angle of [itex]\alpha[/itex] to each other, the metric is given by

[tex]ds^2 = dx^2 + dy^2 - 2\,dx\,dy\,\cos \alpha[/tex]

And in polar coordinates the equation is

[tex]ds^2 = dr^2 + r^2d\theta^2[/tex]

The above three equations are not three different metrics. They all represent the same metric, viz. the 2D Euclidean metric, expressed in different coordinate systems. And the metric has a physical interpretation as "distance", which is invariant under any coordinate change.

In relativity, even though the physical interpretation of the metric is a little more complicated, the same principle applies.

Ken G said:

DrGreg said:

Note that if you rescale rapidity to be [itex] c \log_e k [/itex] then it approximates to coordinate-speed at low speeds.

True, but that's not really a speed, it's a Doppler factor. That's the thing we can measure, speed requires a coordinatization.

But I am saying, if you do the maths, you will find that for low speeds the natural logarithm of the Doppler factor, viz. [tex] c \log_e k [/tex] really does approximate to coordinate speed (at "everyday" terrestial speeds the two values would be indistinguishable), so you could use rapidity as a coordinate-independent measure of motion that is fully compatible with Newtonian (non-relativistic) speed.

Ken G said:

That had me thinking for awhile, but I don't think that would give a unique result. After all, there are infinitely many pairs of mutually stationary objects that could have one object at each event, all with different distances between them. If you further stipulate that the objects must be stationary with respect to the observer doing the measurement, it just means each such pair comes with their own observer, each finding a different "proper distance" between the events. If the events themselves don't have a concept of being "stationary", which they don't normally, then we still have no way to know which observer is getting the "proper" result.

Actually you are right here: what I said isn't enough to define the "interval" between two events. Every inertial observer can measure a different distance between events in the way I said. The "interval" is the shortest possible distance that any inertial observer might measure between those two events, assuming that minimum is not zero (otherwise your two events are timelike separated).

Ken G said:

They are not "different"-- everyone can measure something with an accelerometer. The inertial ones are simply defined as those who measure zero.

The point I was alluding to is that to an inertial observer in GR, Special Relativity still appears to be approximately true in a small local region around himself/herself. (The phrase "approximately true" can be made precise by means of calculus.) An inertial observer, in GR, can set up a local, Einstein-synced coordinate system in such a way that [itex]ds^2 = dt^2 - dx^2/c^2 - dy^2/c^2 - dz^2/c^2[/itex] is still true at the origin of the coordinate system (although it won't be true elsewhere). (And conversely, non-inertial observers can never set up a local Minkowski approximation.) In that sense, inertial observers are "different", even though, as you rightly say, all observers, inertial or not, can set up coordinate systems.

Ken G said:

DrGreg said:

The inner product, or "metric" is invariant, that is you always get the same answer for g(X,Y) no matter what coordinate system you use to carry out the calculation.

Not if you use "radio coordinates". This is part of the point-- the metric space has more general properties than the form of the metric.

No, this is a terminological issue. I think you are thinking of "the metric" as being the formula for ds in terms of the coordinates. I am saying that "the metric" is an entity that exists independently of coordinates, that you can define physically in terms of proper time and proper distance, and whose mathematical properties can be formulated in terms of vector equations, not component equations. So in spherical radar coordinates the equation

[tex]ds^2 = du \, dv - \frac{(u-v)^2}{4} ( d\theta^2 + \sin^2 \theta d\phi^2) [/tex]

represents exactly the same metric as

[tex]ds^2 = ds^2 = dt^2 - dx^2/c^2 - dy^2/c^2 - dz^2/c^2 [/tex]

expressed in Minkowski coordinates. Both equations are the Minkowski metric. The metric is an operator that maps a pair of vectors to a scalar.

DrGreg · May 12, 2008

To get back to the original question, is Einstein synchronisation arbitrary or is there some good reason for it? One good reason is the mathematical one that it makes the maths simpler, and it makes it easy to compare one frame against another and confirm that neither is "special" in any way.

For those that are not aware, there is another "natural" synchronisation method called "ultra slow clock transport". The obvious Newtonian way to sync 2 inertial clocks A and B at rest relative to each other is to put a 3rd clock C next to A, sync it to A, then put it next to B and sync B to C. We know that method is no good in relativity, for if you then moved C back to A you would find that C was no longer synced to A (the twin "paradox"). Syncing B to A gives a different result than syncing A to B, by this "fast clock transport method".

But what if we move C from A to B v-e-r-y s-l-o-w-l-y? The twin paradox discrepancy gets less the slower you go. Although you could never achieve zero speed in practice, you can consider, mathematically, what would happen in the limit. It turns out, when you do the maths, that this method of "ultra slow clock transport" synchronisation gives exactly the same result as Einstein synchronisation (and experiments have confirmed this).

Histspec · May 12, 2008

DrGreg said:

For those that are not aware, there is another "natural" synchronisation method called "ultra slow clock transport". The obvious Newtonian way to sync 2 inertial clocks A and B at rest relative to each other is to put a 3rd clock C next to A, sync it to A, then put it next to B and sync B to C. We know that method is no good in relativity, for if you then moved C back to A you would find that C was no longer synced to A (the twin "paradox"). Syncing B to A gives a different result than syncing A to B, by this "fast clock transport method".

But what if we move C from A to B v-e-r-y s-l-o-w-l-y? The twin paradox discrepancy gets less the slower you go. Although you could never achieve zero speed in practice, you can consider, mathematically, what would happen in the limit. It turns out, when you do the maths, that this method of "ultra slow clock transport" synchronisation gives exactly the same result as Einstein synchronisation (and experiments have confirmed this).

A nice description and comparison of Einstein synchronization and "slow clock transport" can be found in:

Mansouri R., Sexl R.U.: A test theory of special relativity. I: Simultaneity and clock synchronization. In: General. Relat. Gravit.. 8, Nr. 7, 1977, pp. 497–513.

Experiments, which confirmed the equivalence between those methods, were made by:

Wolf P. and Petit G., Satellite test of special relativity using the global positioning system, Phys. Rev. A56, 6, 4405, (1997).

See also:
en.wikipedia.org/wiki/Einstein_synchronisation

Aether · May 14, 2008

DrGreg said:

It turns out, when you do the maths, that this method of "ultra slow clock transport" synchronisation gives exactly the same result as Einstein synchronisation (and experiments have confirmed this).

Wouldn't this method of "ultra slow clock transport" give exactly the same result as any other synchronization method if one assumes the same conventional isotropy/anisotropy of speeds as assumed for the other method? There is no unique connection between Einstein synchronization and slow clock transport.

Ken G · May 14, 2008

Hurkyl said:

I have ever only seen "Minkowski geometry" used to refer to the notion of a coordinate-independent description of spacetime. (Conversely, those who reject Minkowski geometry prefer coordinate-dependence).

The key point I was making is, a metric is only invariant on mappings of the vector space into itself that constitute the "orthonormal transformations" under that metric. Ergo, one cannot say the "Minkowski metric is invariant" and "Minkowski geometry is coordinate independent" in the same breath, they are contradictory. They both have their separate meanings, it is true, but the meanings are different. If you want to count the latter as true, as is the conventional choice, then the former statement is not coordinate independent. That basic confusion is at the heart of what we are trying to get to the bottom of-- the contradiction between imagining that "Minkowski geometry" is coordinate independent, but it is generated by a "Minkowksi metric" (as in any textbook) that is not in general an invariant. If the latter requires assumptions not required in the former, then will the real "Minkowki metric" please stand up?

In fact, I don't remember the last time I have ever heard of Minkowski space being described in a coordinate-dependent manner -- it's usually described in terms of its ,[/URL] which an intrinsic property of a metric, and is invariant under all coordinate transformations. In fact, it is effectively the only property of a metric that is invariant under all coordinate transformations.

Right-- that's why it would normally be considered true that the Minkowski signature is really the heart of special relativity-- not invariants of the Minkowski metric. Yet I will bet you that if you pick up virtually any physics textbook, you will quickly find a confusion between what "coordinate independent" means and what "invariance under Lorentz transformations" means. They get enmeshed as if they were saying the same thing, and untangling that confusion is the progress we are making.

What you are now saying is that the sole "physical" aspect of the Minkowksi metric is that it is symmetric (in the sense <x_i,x_j> = <x_j,x_i>) and gives three positive and one negative norm on an any orthogonal basis. It will not give that on arbitrary bases, however, as if the basis vectors are strange combinations of observables, or if they are the observables of an accelerated observer over a finite time period. Note in particular what happens to the "postulates of special relativity" in the latter cases-- we find they make coordinate assumptions. What's more, any metric with that signature would successfully generate special relativity with the appropriate definitions (i.e., not using the Einstein simultaneity convention, or not requiring inertial observers for finite-time calculations). Thus if the "real heart of special relativity" does not make those assumptions, then the "postulates of special relativity", as they are normally taught, are not in fact the real heart of special relativity. Now we're getting somewhere.

Ken G · May 14, 2008

DrGreg said:

The above three equations are not three different metrics. They all represent the same metric, viz. the 2D Euclidean metric, expressed in different coordinate systems.

Actually, I think your "skew" metric is indeed a different metric. You have just changed the metric when you changed the basis vectors, to make the new basis an orthonormal one under the new metric. I'm pretty sure that for a single metric, saying <x,y> = <Ox,Oy> is the definition of O being an orthonormal coordinate transformation under that metric.

I think part of the problem here is that metrics normally work on a single vector space, from which you select two vectors, but for them to transform in an invariant way you actually have to take one vector from the vector space and the other from the "dual space", so if you want the first vector space to be covariant vectors, you have to select a contravariant vector from its dual space. If you do that, you obtain complete coordinate independence, but that is not the normal way that metrics operate. Maybe we shouldn't be using a Minkowski "metric" at all.

But I am saying, if you do the maths, you will find that for low speeds the natural logarithm of the Doppler factor, viz. [tex] c \log_e k [/tex] really does approximate to coordinate speed (at "everyday" terrestial speeds the two values would be indistinguishable), so you could use rapidity as a coordinate-independent measure of motion that is fully compatible with Newtonian (non-relativistic) speed.

It seems to me that "coordinate speed" and "coordinate-independent measure of motion" are having a little fight in that sentence.

The point I was alluding to is that to an inertial observer in GR, Special Relativity still appears to be approximately true in a small local region around himself/herself...In that sense, inertial observers are "different", even though, as you rightly say, all observers, inertial or not, can set up coordinate systems.

But if there is no difference between an observer whose accelerometer reads zero, and one whose reads something else (which is true on the scales you are describing), then there is still nothing "special" about the one who is inertial. The "specialness" in special relativity appears on finite times, where the physics comes in, and the accelerometer reading becomes important.

I am saying that "the metric" is an entity that exists independently of coordinates, that you can define physically in terms of proper time and proper distance, and whose mathematical properties can be formulated in terms of vector equations, not component equations.

It is my impression that your remark here would only be true if the vectors that the metric acts on were selected from dual spaces (one covariant and one contravariant), but normally metrics are defined with both vectors from the same space. When the latter is used, metrics are only invariant when acting with respect to orthonormal bases, so that is a coordinate constraint that does single out inertial observers observing vectors of finite (i.e., not infinitesmal) length. Nevertheless, as in the above exchange with Hurkyl, it does not appear that the invariance of the metric is a terribly crucial property, as it is actually its signature that determines the physics within any particular coordinate system.

Both equations are the Minkowski metric. The metric is an operator that maps a pair of vectors to a scalar.

I would say that both equations share the signature of the Minkowski metric, and generate Minkowski geometry, but they are not the same metric. Moreover, the way the Minkowski metric is usually taught is as a single metric, not as a class of metrics that all spawn the same geometry but differ in the values of the norms.

Ken G · May 14, 2008

DrGreg said:

To get back to the original question, is Einstein synchronisation arbitrary or is there some good reason for it? One good reason is the mathematical one that it makes the maths simpler, and it makes it easy to compare one frame against another and confirm that neither is "special" in any way.

I agree, there's certainly plenty of motivation from Occam to set up special relativity the way it is done. My issue, however, is when we "cover our tracks" and assert statements of our own choosing, to make the math simple, as though they were "truths about reality" (that's often how the postulates of relativity are taught, I've seen very few counterexamples). The place you'd see the difference is if we ever found evidence that those postulates were wrong, would we say "hey, but I thought we had observations to back them", and the answer would be "no, the observations only backed more general postulates, we added additional elements for no reason other than to simplify the math. We did the same thing with Newton's laws and look where that got us".

Hurkyl · May 15, 2008

Ken G said:

The key point I was making is, a metric is only invariant on mappings of the vector space into itself that constitute the "orthonormal transformations" under that metric. Ergo, one cannot say the "Minkowski metric is invariant" and "Minkowski geometry is coordinate independent" in the same breath, they are contradictory.

Change-of-basis transformations are not "mappings of the vector space into itself". (although, they are equivalent to "mappings of the {coordinate-representation of the vector space} into itself")

From this, and subsequent comments, it looks like you're still confusing "the metric" with "the coordinate representation of the metric". I'm quite serious when I say you should reconsider Euclidean geometry before you continue thinking about Minkowski geometry. (Since I assume you understand the Euclidean case)

The invariance of the metric under local Lorentz transformations means that if you change which direction you look, physics remains the same.

The coordinate-independence of the metric means that lengths and angles remain the same, no matter what chart you use to compute them.

Ken G · May 15, 2008

Hurkyl said:

Change-of-basis transformations are not "mappings of the vector space into itself".

Changing to a different observer is, and that's what we are ultimately talking about here.

I'm quite serious when I say you should reconsider Euclidean geometry before you continue thinking about Minkowski geometry.

The issue was never about Euclidean geometry vs. Minkowski geometry (we agree that is the crucial geometric difference). The issue was about the invariance of a metric, what the "Minksowki metric" means, and how that is different from "Minkowski geometry" (in virtually any textbook). Those are not the same questions, that's the point.

The invariance of the metric under local Lorentz transformations means that if you change which direction you look, physics remains the same.

That's incorrect, it means that if you change from one inertial observer to another, physics remains the same.

The coordinate-independence of the metric means that lengths and angles remain the same, no matter what chart you use to compute them.

Let's define a metric, and denote it by < , >. Now I tell you that <x,y> = <Ox,Oy> for some transformation O. We may imagine that O is how things look different when I change from one observer to another, and we are asserting that the metric remains invariant under that change. Question: what can we say about O?

DrGreg · May 15, 2008

Aether said:

Wouldn't this method of "ultra slow clock transport" give exactly the same result as any other synchronization method if one assumes the same conventional isotropy/anisotropy of speeds as assumed for the other method? There is no unique connection between Einstein synchronization and slow clock transport.

Well I'm not exactly sure what you mean by "the same conventional isotropy/anisotropy of speeds". If you mean one-way coordinate-speed of light isotropy, then you are assuming Einstein synchronization.

Mansouri & Sexl (mentioned in this post) make some homogeneity and "Lorentzian" assumptions which amount to assuming Einstein's postulates are true when expressed in a suitable sync-convention-independent way.

You can also prove the equivalence of ultra slow clock transport and Einstein synchronization using Bondi's k-calculus and radar coordinates, which do not depend on any sync convention.

Assuming SR is true, as we can prove ultra slow clock transport and Einstein synchronization are equivalent, then ultra slow clock transport cannot be equivalent to anything that is not equivalent Einstein synchronization.

Any experimentally confirmed difference between ultra slow clock transport and Einstein synchronization would amount to a disproof of relativity. It hasn't happened yet.

Hurkyl · May 15, 2008

Ken G said:

Changing to a different observer is, and that's what we are ultimately talking about here.

If, by that, you don't mean a change of coordinates, then you need to explain.

The issue was never about Euclidean geometry vs. Minkowski geometry (we agree that is the crucial geometric difference).

Euclidean and Minkowski geometry are identical in all aspects relevant to this discussion. For example, they are both affine spaces equipped with a symmetric, nondegenerate bilinear form (that is compatable with the affine structure). I assume we both consider Euclidean geometry 'simpler', and so there is much to gain reviewing that case first.

Ken G · May 15, 2008

Hurkyl said:

If, by that, you don't mean a change of coordinates, then you need to explain.

When you change the observer, you will change the basis vectors used to label the events of spacetime in terms of the physical measurables "distance" and "time". That is both a change in coordinates, in that the labels are changing in a particular way, and a transformation of the vector space into itself, as all the events are now seen from a different perspective-- that of a new observer. The events are the same, but the vectors are different (indeed, nonlinear transformations would make them no longer even members of a vector space at all). If the transformation was Lorentzian, which happens when you are changing between inertial observers and are using the Einstein simultaneity convention, then the "Minkowski metric" connecting any two events, as it is normally defined, will be invariant. But in general, it will not. If you want to get something that is invariant to all linear transformations, you must choose one vector from the vector space and the other from its dual space, as I dimly understand the situation.

Euclidean and Minkowski geometry are identical in all aspects relevant to this discussion. For example, they are both affine spaces equipped with a symmetric, nondegenerate bilinear form (that is compatable with the affine structure). I assume we both consider Euclidean geometry 'simpler', and so there is much to gain reviewing that case first.

Well, if by "all aspects" you mean "in terms of the meaning of a transformation of a vector space, a coordinatization, and a dual space", then I suppose you are right, and those are all interesting and important but quite mathematical issues that I think we all have much to learn about.

But what I have in mind is a much more interesting physical issue, namely, "what are the minimal postulates required to describe the physics of special relativity." When that is the goal, then the different signatures (of the Euclidean and Minkowski metrics) are indeed important, and I am beginning to suspect that the minimal postulates say that the geometry of spacetime is described by a symmetric metric with a signature with three positive and one negative eigenvalue. Note this requires no simultaneity convention, nor the statement that the speed of light is isotropic, nor the requirement that physics look the same from all inertial reference frames. Interesting, is it not, that those are basically the "three pillars of special relativity" as it is normally taught, and my goal is to get to the bottom of this apparent flaw in the standard architecture.

The reason that's important, once again, is that when observations some day show that special relativity breaks down even in the absence of gravity (say, in quantum mechanics), we'll want to know what are the postulates that our observations really did back up, and what ones did we just imagine they backed up, in the process of mistaking Occam "simplicity" for something more akin to "computational convenience".

Hurkyl · May 16, 2008

Ken G said:

But what I have in mind is a much more interesting physical issue, namely, "what are the minimal postulates required to describe the physics of special relativity."
...
The reason that's important, once again, is that when observations some day show that special relativity breaks down even in the absence of gravity (say, in quantum mechanics), we'll want to know what are the postulates that our observations really did back up, and what ones did we just imagine they backed up

I have more to say, but no time this morning. But I did want to make one quick comment:
Mathematically speaking, at least, "minimal postulates" are not unique. There are many many different ways of formulating any theory.

In terms of your long-term goal, I think that guessing at the 'one true formulation of special relativity' is the wrong approach -- if you instead learn many different ways of formulating special relativity, you're much more likely to know one that can be tweaked to accommodate the new data.

Ken G · May 16, 2008

Hurkyl said:

Mathematically speaking, at least, "minimal postulates" are not unique. There are many many different ways of formulating any theory.

Right, you made that point earlier and that is a very valid one. It is not really the "minimum postulates" that count here, it is the minimal theory. By that I mean, the theory that unifies all the observations, without making unique predictions about what is outside the intended realm of explanation of the measurement set. Newtonian mechanics should have been done that way too, it would have saved us a lot of false surprise (surprise we had no real business being surprised about).

In terms of your long-term goal, I think that guessing at the 'one true formulation of special relativity' is the wrong approach -- if you instead learn many different ways of formulating special relativity, you're much more likely to know one that can be tweaked to accommodate the new data.

That's not the issue, the goal is not to find an equivalent formulation, but a less restrictive one. For example, uniting all metrics with the same signature is already a less restrictive form of dynamics than requiring invariance of a particular one.

Hurkyl · May 16, 2008

Ken G said:

By that I mean, the theory that unifies all the observations, without making unique predictions about what is outside the intended realm of explanation of the measurement set.

With the description you've given thus far, it appears that a database of all experimental data is precisely the "minimal theory" you seek. But it is not useful scientifically (it cannot be falsified), nor practically (it cannot make predictions).

That's not the issue, the goal is not to find an equivalent formulation, but a less restrictive one.

What exactly do you mean by "less restrictive"? My initial reaction is that that's a disadvantageous trait for a scientific theory -- the less restrictive a theory's predictions, the less the possibility for failure, and thus the less confidence we get by empirically testing it. Conversely, we gain a lot of confidence when a theory passes a test in which it makes very specific predictions.

e.g. if we are considering "space is globally Minkowski" versus "space is locally Minkowski" -- the former assertion is very specific. Every piece of experimental data consistent with the former assertion is, of course, also consistent with the latter assertion.

So, according to Bayesian statistical inference, given lots of experimental data confirming both of these assertions, it is correct to favor the stronger assertion, and so we conclude "space is globally Minkowski". (And, of course, being good statisticians, we are willing to drop that conclusion if later evidence contradicts it)

For example, uniting all metrics with the same signature is already a less restrictive form of dynamics than requiring invariance of a particular one.

They look equivalent to me. Every metric of signature +--- determines a unique class of coordinate charts (related by Poincaré transformations) in which the coordinate representation of the metric is given by [itex]d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2[/itex].

The "affine 4-space equipped with a compatable metric of signature +---" formulation does have pedagogical value due to its manifest coordinate-independence, but it is describing exactly the same theory as "affine 4-space equipped with a distinguished class of coordinate charts, and a metric whose coordinate representation in any coordinate chart is [itex]d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2[/itex]."

Ken G · May 16, 2008

Hurkyl said:

With the description you've given thus far, it appears that a database of all experimental data is precisely the "minimal theory" you seek.

No, a "database" is not a theory at all because it is not unified. That's what an "explanation" means-- a way to see all that data as a consequence of a single theory.

But it is not useful scientifically (it cannot be falsified), nor practically (it cannot make predictions).

Correct, the minimal theory cannot be falsified, that is why it is such a useful springboard to making the kinds of "extensions" I specifically mentioned above. It is the extensions that make predictions, and are falsifiable. That way, you know what you are doing, and avoid the "scattershot" approach by which Newtonian mechanics was replaced by special relativity, and that same scattershot approach is how special relativity is still taught today.

You see, there is no point in making predictions of experiments you cannot do, so it makes more sense to look at the experiments you can, and tailor a theory that starts with fitting all the experiments you have done, and simply extends to make a prediction for the new experiment, without being weighed down with a host of other predictions that are not being tested and probably aren't right. That way, we avoid the continual mistake of "believing" in aspects of our theories that we failed to identify as being purely out of convenience. We could also avoid this annoying illusion that science undergoes "revolutions", rather than simply learns new stuff.

What exactly do you mean by "less restrictive"?

I mean the theory would come with fewer requirements on how we picture reality, and a broader understanding of the possibilities that work equally well. For special relativity, that means that inertial observers would not be singled out as special in any way, the speed of light would not need to be isotropic, and no one would need to claim "experiments show there is no ether". We would simply set up the mathematical machinery we need to get the dynamics right, and not bother to make claims about reality that we have no way to test. Because, when we later figure out a way to test them, more often than not we discover we were wrong, and science historians will make a big deal about the shocking revolution, when in fact we were simply pretending to know something we did not know.

Ironically, this is exactly what happened with the Michelson-Morely experiment, but we missed the full lesson there. The lesson was not "M-M showed us we made the wrong assumptions", as it is normally taught, but instead, "M-M showed us the danger in making assumptions that we simply don't need to unify the observations we have on hand". We should have simply gone into M-M with an open mind, realizing that we were entering a new regime and anything could happen. We could have come equipped with several possible extensions of our current theory, and used the experiment to distinguish them, but no one needed to act the least bit surprised when one extension worked better than another.

My initial reaction is that that's a disadvantageous trait for a scientific theory -- the less restrictive a theory's predictions, the less the possibility for failure, and thus the less confidence we get by empirically testing it. Conversely, we gain a lot of confidence when a theory passes a test in which it makes very specific predictions.

But what "confidence" do you mean? Confidence that the theory is indeed working for unifying a particular measurement set, and other measurements that fit into the same overall framework, or confidence that the theory will work when applied to some completely new measurement? The former kind of confidence is the confidence that builds bridges-- the latter is the one that makes fools of the best thinkers of all time.

e.g. if we are considering "space is globally Minkowski" versus "space is locally Minkowski" -- the former assertion is very specific. Every piece of experimental data consistent with the former assertion is, of course, also consistent with the latter assertion.

The former is the more restrictive theory, because it makes more assertions about reality, and has more ways to be false. So this is a good example of just what I'm talking about-- the latter unifies our current observations, the former is false (it breaks down either if there is gravity, or if the observer accelerates). The latter requires extensions to expand its usefulness into those realms, but that's just what it should need.

So, according to Bayesian statistical inference, given lots of experimental data confirming both of these assertions, it is correct to favor the stronger assertion, and so we conclude "space is globally Minkowski". (And, of course, being good statisticians, we are willing to drop that conclusion if later evidence contradicts it)

We already know that is false.

They look equivalent to me. Every metric of signature +--- determines a unique class of coordinate charts (related by Poincaré transformations) in which the coordinate representation of the metric is given by [itex]d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2[/itex].

I agree that +--- is equivalent to -+++, it didn't matter because we are comparing to +++ with absolute time.

The "affine 4-space equipped with a compatable metric of signature +---" formulation does have pedagogical value due to its manifest coordinate-independence, but it is describing exactly the same theory as "affine 4-space equipped with a distinguished class of coordinate charts, and a metric whose coordinate representation in any coordinate chart is [itex]d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2[/itex]."

Again, that is not the coordinate representation of the Minkowski metric in any coordinate chart-- only any orthogonal coordinate chart. Furthermore, we need a finite concept of distance-- an infinitesmal one does not suffice to determine the dynamics, so the latter requires a special treatment of inertial observers, un awkward and unnecessary aspect of the theory that is often mistaken for a physical statement of some kind.

Hurkyl · May 17, 2008

A trivial theory is still a theory -- aesthetic grounds are not sufficient justification for rejecting it. And besides the 'database theory' is the only theory (up to equivalence) that makes no assertions beyond the experimental data. This is fairly easy to see: if you have a theory that is not equivalent to the database theory, then either it deals with things that are not experimental results, or it makes assertions that cannot be proven by the data.

And, of course, it is a trivial exercise to show that each piece of experimental data is a theorem of the database theory.

For special relativity, that means that Inertial Observers would not be singled out as special in any way,

But they can be singled out: an observer is inertial if and only if his worldline is straight,

the speed of light would not need to be isotropic

and it's an easy theorem that null vectors have 'speed' one in any orthonormal affine coordinate chart.

The theory of special relativity, like any other theory, is formulation independent: you get the same theory no matter how you formulate it. e.g. if you formualte it in terms of inertial observers and Poincaré-invariant coordinate metrics, you get exactly the same theory as if you formulate it in terms of a coordinate-independent metric with a specified signature.

Even Lorentz relativity is effectively the same as special relativity. LR includes an extra constant symbol denoting an orthonormal coordinate frame, but is otherwise exactly the same theory as special relativity. (mathematically speaking, at least)

We should have simply gone into M-M with an open mind, realizing that we were entering a new regime and anything could happen.

Tomorrow is a new regime too. :tongue: Yes, a closed mind is bad for science... but so is naïeveté. Scientific theories have been well-supported by empirical evidence, and that affords us confidence that they will continue to be correct. When going into a new experiment, we should have exactly as much confidence in our theories as they deserve... no more, and no less.

But what "confidence" do you mean?

The confidence afforded to us by the scientific method.

The former is the more restrictive theory, because it makes more assertions about reality, and has more ways to be false. So this is a good example of just what I'm talking about-- the latter unifies our current observations, the former is false (it breaks down either if there is gravity, or if the observer accelerates).
...
We already know that is false.

The point is, before we had evidence contradicting the former, it was scientifically correct to favor the "globally Minkowski" hypothesis over the "locally Minkowski" hypothesis. Why was that scientifically correct? Because the "globally Minkowski" hypothesis had stronger empirical support.

Of course, with the evidence we now have, "locally Minkowski" has stronger empirical support.

Hurkyl said:

They look equivalent to me. Every metric of signature +--- determines a unique class of coordinate charts (related by Poincaré transformations) in which the coordinate representation of the metric is given by [itex]d\tau^2 = dt^2 - dx^2 - dy^2 - dz^2[/itex].

I agree that +--- is equivalent to -+++, it didn't matter because we are comparing to +++ with absolute time.

Huh? That has absolutely nothing to do with what I said in that quote.

Again, that is not the coordinate representation of the Minkowski metric in any coordinate chart-- only any orthogonal coordinate chart.

That was a typo, sorry. It was supposed to say "affine 4-space equipped with a distinguished class of coordinate charts, and a metric whose coordinate representation in any distinguished coordinate chart..."

Furthermore, we need a finite concept of distance-- an infinitesmal one does not suffice to determine the dynamics

That's what calculus is for.

Hurkyl · May 17, 2008

Ken G said:

When you change the observer, you will change the basis vectors used to label the events of spacetime in terms of the physical measurables "distance" and "time". That is both a change in coordinates, in that the labels are changing in a particular way, and a transformation of the vector space into itself, as all the events are now seen from a different perspective-- that of a new observer.

Looking at the same events from a different perspective -- that sounds exactly like you're leaving Minkowski space unchanged, but changing the coordinate chart you're using.

For a vivid (but Euclidean) example -- put a sheet of paper on the floor and look at it. Now, walk somewhere else and look at the paper again. Did the paper change?

The events are the same,

And since, physically speaking, events in 'reality' correspond to points in Minkowski space, we see that the operation you propose doesn't transform Minkowski space. (In fact, there is nothing physical that enacts a transformation of Minkowski space)

but the vectors are different (indeed, nonlinear transformations would make them no longer even members of a vector space at all).

Minkowski space is not a vector space; it is an affine space. You can view an affine space as a vector space by choosing an 'origin' and corresponding each point of the affine space with the vector given by subtracting off the origin. If you change the origin, then yes, that correspondence will change.

It looks like you're trying to make your observer correspond to an origin but that doesn't make sense -- the origin is a single point, whereas the observer occupies an entire worldline. (actually an entire 3+1-dimensional region -- we only get a worldline if we assume zero spatial extent)

If you want to get something that is invariant to all linear transformations, you must choose one vector from the vector space and the other from its dual space, as I dimly understand the situation.

If all linear transformations of interest act trivially, you get invariance automatically.

That's what happens with a coordinate change -- the change-of-coordinates transformation doesn't do anything to Minkowski space; it only changes the coordinate functions, and the coordinate spaces.

Now, the fact that the symmetry group of Minkowski space is Poincaré group is interesting... and I suspect the thing you're really interested in; coordinate changes are just a red herring. And the key point is that Minkowski space is not symmetric under skew transformations, or a rescaling along a single axis; only Poincaré transformations preserve the Minkowski structure.

Hurkyl · May 17, 2008

In fact, studying geometry by its symmetry group is the topic of the Erlangen program.

Ken G · May 17, 2008

Hurkyl said:

A trivial theory is still a theory -- aesthetic grounds are not sufficient justification for rejecting it.

Not aesthetic grounds-- the grounds would be the definition of what a theory is.

And besides the 'database theory' is the only theory (up to equivalence) that makes no assertions beyond the experimental data.

There have to be some defining assumptions that theories make, such as objectivity and repeatability. These can never be proven, only falsified. That is the important kind, the "bridge-building" kind, of predictions that theories must make. These are of the "weather prediction" kind, and are the useful predictions, the bridge-building predictions, that science makes. To make predictions of that nature, there is no need to pretend theories are things that they are not.

Nevertheless, that kind of prediction is often (unfortunately) viewed as a trivial aspect of a theory-- people sometimes treat theories as if their value (erroneously) is their ability to predict outside the box of the core assumptions that define what a theory is. Those latter kinds of "predictions" are really just guesses, a way to extend a theory that, once tested, form a means to create new theories, i.e., they become predictions of the important kind. One doesn't need a theory to form a hypothesis, though they can be a helpful guide if we need one. Unfortunately, the latter gets all the attention, despite being extraneous to the value of science, and results in all kinds of misconceptions about what science is and what you can use it for (not to mention a list of "revolutions" in scientific thinking-- rather than just big discoveries, which is all they really are).

But they can be singled out: an observer is inertial if and only if his worldline is straight,

That is circular reasoning, you simply define straight that way. All we can say is their accelerometers read zero, if we want to think of that as special that's up to us-- there's no need to go and build physics around it.

and it's an easy theorem that null vectors have 'speed' one in any orthonormal affine coordinate chart.

At last we see the appearance of the word "orthonormal", which I've been hammering for awhile now.

The theory of special relativity, like any other theory, is formulation independent: you get the same theory no matter how you formulate it. e.g. if you formualte it in terms of inertial observers and Poincaré-invariant coordinate metrics, you get exactly the same theory as if you formulate it in terms of a coordinate-independent metric with a specified signature.

I remain unconvinced of that, and this is an important purpose of the thread. The key thing I have maintained is not that SR makes false predictions for quantitative measurements within the regime where it has been tested, nor that it is unable to predict the dynamics of any particle with a known proper acceleration that satisfies certain other assumptions (as are necessary in either classical physics or Dirac's formulation of quantum mechanics). Rather, its problems are pedagogical, in that it may make unnecessary guesses that could prove to be false in future experiments outside the realm where it has been tested. Such false "predictions" are not an important part of any theory, just as it was not an important part of Newton's laws that they work to arbitrary speeds (and the fact that they don't has in no way compromised their use in situations where they are warranted).

The pedagogical problems of special relativity include the fact that its postulates cannot be applied from the reference frame of an accelerated observer. Also, they imply choices about how we picture reality that are not supported, they are merely assumed. As such, it generates explanations for "why things happen the way they do" that are inconsistent between observers. A classic example is, what is the cause of a blueshift between two rockets in free space. If we take Einstein's convention for "stationary" meaning the frame of any inertial observer describing their universe, then the cause of blueshift observed by an inertial observer is always the squeezing of the wavelength due to the motion of the source, coupled with time dilation of the source. However, a more flexible interpretation of the "cause" of that phenomenon is that the wave period simply depends on the proper time of any receiver on any path that connects the path between the absorption of the prior wavecrest and the following wavecrest (calculus could make that even more precise). That accounts for everything, we do not need either of the two "postulates of special relativity" to perform that calculation, we need only the signature of the metric and the conventions by which the observer measures time (i.e., they will ultimately ratio the period of a wave to the period of a clock).

The rest is pure language and arbitrary picture/coordinates, and does not belong as part of the postulates of a theory. Once again, where you will see the problem with the latter is when some observation contradicts those postulates, and we'll ask, "but why did we expect the postulates to hold, based on the database we already had?" The answer to that will be, "there was no reason, we were deluding ourselves".

Tomorrow is a new regime too. :tongue:

Yes, but all that goes right into the definition of a theory, as I alluded to above. We do not need to add special postulates to handle that, it is in all scientific theories from the start. This is my point, the importance of understanding what aspects of our theory are there because that's how we define scientific theories, what aspects are there because they unify existing observations, what parts are extensions that we are curious about testing and have no idea if they will work or not (like Newton and arbitrary speed), and what parts are just pure fantasy (like MWI) that we have no reason whatsoever to ever pass a falsifiable test.

The confidence afforded to us by the scientific method.

But I still don't know which of the two versions of "confidence" you mean. I would say the confidence afforded to us by the scientific method is of the first kind I listed, but you seem to be talking about the second situation.

The point is, before we had evidence contradicting the former, it was scientifically correct to favor the "globally Minkowski" hypothesis over the "locally Minkowski" hypothesis. Why was that scientifically correct?

It wasn't, any more than it was "scientifically correct" to think Newton's laws would extend to arbitrary speed, or that Ptolemy's model would hold up to more precise observations. The only things that are scientifically correct are to expect predictions "within the box" of the current dataset to work, that's like predicting the weather or building a bridge. Other types of predictions are called "guesses", and are not scientifically correct to expect to work (a point history has been rather clear on, especially once you bear in mind that "the winners write the history").

Because the "globally Minkowski" hypothesis had stronger empirical support.

No, it had no empirical support (even in the absence of gravity), as it was only formulated and tested for inertial observers. Indeed, it breaks down when you leave that observational regime, as is not untypical of phyical theories.

Of course, with the evidence we now have, "locally Minkowski" has stronger empirical support.

If by that you mean that "global Minkowski is known to be wrong", I agree.

Huh? That has absolutely nothing to do with what I said in that quote.

I thought you were pointing out that -+++ is the same as +---. What is written is merely a re-affirmation of what I've been saying all along-- that the Minkowski metric is invariant only under the transformations of the Poincare group (and is not invariant under arbitrary coordinate transformations or changes of observer, though its signature is).

That's what calculus is for.

If you want to use calculus to integrate the metric between events from the perspective of a constantly accelerated observer, you need to integrate the Rindler metric, not the Minkowski metric. The latter gives you the wrong answer, that's the point.

Ken G · May 17, 2008

Hurkyl said:

Looking at the same events from a different perspective -- that sounds exactly like you're leaving Minkowski space unchanged, but changing the coordinate chart you're using.

That depends on what you mean by "Minkowski space", this is very much the point here. What many (most) mean by that phrase is, "a metric space ruled by the Minkowksi metric", but if you take that meaning, your statement is wrong. It is only right if you take the more general meaning of a "space governed by Minkowski geometry, constrained by metrics of Minkowski signature and the tensorial transformation rules between them." More to the point, you seem to be imagining that events themselves are members of a vector space, but they are not-- that stucture must be imposed on them by choosing basis vectors, i.e., by the coordinates. Hence, changing observer/coordinates is indeed a mapping of the vector space into itself, a mapping that if it leaves the events unchanged it changes the vectors that are associated with them, or if it leaves the vectors unchanged then it changes the events associated with them. I think one is the covariant picture, the other the contravariant. I believe the "dual space" is what you get if you make the opposite choice.

For a vivid (but Euclidean) example -- put a sheet of paper on the floor and look at it. Now, walk somewhere else and look at the paper again. Did the paper change?

As I said, if the events are taken to be invariant, then the vectors have changed. As in your example.

And since, physically speaking, events in 'reality' correspond to points in Minkowski space, we see that the operation you propose doesn't transform Minkowski space.

I thought this is what you are imagining, but I think you are incorrect. The events are one thing, the points in Minkowksi space are another, and the connection is made via the coordinatization. You are imagining that they both stay the same when we change oberver/coordinates, but they do not-- one or the other must change.

Minkowski space is not a vector space; it is an affine space.

Translations of the origin are of no interest to me, we are talking about changing reference frame. In other words, we talking about the vectors that connect events, not the vectors that connect events to an origin.

It looks like you're trying to make your observer correspond to an origin but that doesn't make sense -- the origin is a single point, whereas the observer occupies an entire worldline.

Rest assured, that is not what I'm trying to do. The fact that an observer is on a worldline is very much my concern with the standard formulation of special relativity-- it not only requires the worldline be inertial, it further require the worldline has always been inertial and always will be. That's terribly over-restrictive, and there's just no need for it.

That's what happens with a coordinate change -- the change-of-coordinates transformation doesn't do anything to Minkowski space; it only changes the coordinate functions, and the coordinate spaces.

Again, this is not the standard formulation, involving the Minkowski metric.

Now, the fact that the symmetry group of Minkowski space is Poincaré group is interesting... and I suspect the thing you're really interested in; coordinate changes are just a red herring.

Yes, this is what I'm saying, except I'm saying that the red herring is the Minkowski metric. We don't need a symmetry group that limits our postulates, indeed general relativity figures out how to do it with no such limitation.

And the key point is that Minkowski space is not symmetric under skew transformations, or a rescaling along a single axis; only Poincaré transformations preserve the Minkowski structure.

That's what I've been saying, as I recall right about the time I was "boring" and "exasperating" DaleSpam right out of the thread.

Hurkyl · May 17, 2008

Ken G said:

Not aesthetic grounds-- the grounds would be the definition of what a theory is.

Mathematically, that definition is: (given an ambient formal logic and formal language)

A theory is a collection of statements made in the given formal language that is closed under logical deduction.

(And given any set S of statements in a formal language, they generate a theory -- in particular, the theory consisting of all logical statements that are provable from S)

We don't need a symmetry group that limits our postulates

There is always a symmetry group, whether you state it explicitly or not.

general relativity figures out how to do it with no such limitation.

e.g. general relativity is symmetric under any isometry of differential manifolds, and the frame bundle is symmetric under global and local Lorentz transformations.

I thought this is what you are imagining, but I think you are incorrect. The events are one thing, the points in Minkowksi space are another, and the connection is made via the coordinatization.

*shrug* I guess there's nothing left to say but "you're wrong". (And similarly for many of the points in your previous posts)

Translations of the origin are of no interest to me

Minkowski space does not have an origin. It is not a vector space. (Just like Euclidean space)

The fact that an observer is on a worldline is very much my concern with the standard formulation of special relativity-- it not only requires the worldline be inertial, it further require the worldline has always been inertial and always will be.

It wasn't, any more than it was "scientifically correct" to think Newton's laws would extend to arbitrary speed, or that Ptolemy's model would hold up to more precise observations.

If you don't have empirical evidence that Newton's laws shouldn't extend to arbitrary speeds, then you don't have any scientific grounds for expecting them to fail for arbitrary speeds. (I'm assuming you meant to make a sensical statement -- we knew even before special relativity that notions of absolute velocity had no physical meaning)

Ken G · May 17, 2008

Hurkyl said:

Mathematically, that definition is: (given an ambient formal logic and formal language)

A theory is a collection of statements made in the given formal language that is closed under logical deduction.

As this is a physics forum, I would have preferred the scientific definition. For that, I think Wiki (http://en.wikipedia.org/wiki/Theory) does fine:
"In science a theory is a testable model of the manner of interaction of a set of natural phenomena, capable of predicting future occurrences or observations of the same kind, and capable of being tested through experiment or otherwise verified through empirical observation. "

The key words there are "model" and "observations of the same kind". I would say Wiki is right on target here, even so far as underscoring the imporance of "inside the box" predictions over "outside" ones.

There is always a symmetry group, whether you state it explicitly or not.

My point is that the symmetry group is all there is-- there is nothing special about inertial observers simply because they exhibit that symmetry. Arbitrarily accelerated observers exhibit other symmetries, what we need is not a way to flag the inertial ones, but rather a prescription for matching the symmetry to the observer.

A good concrete example of this is the twin paradox. With the standard formulation of SR, the reduced aging of the traveling twin is explained as "due to time dilation", as if time dilation itself was something other than an arbitrary coordinatization. Or we can choose the accelerated frame, and then the standard SR formulation with the Einstein simultaneity convention allows us to shift between inertial frames, account for time dilation, and tack on the simultaneity shift due to switching frames. Then the "reason" for the younger twin comes out sounding like "it's all due to the simultaneity convention when you accelerate", as if that was something physical rather than yet another arbitrary coordinatization. Neither of those are decent physical explanations in my view, they are both simply mistaking a coordinate convenience for a statement about how reality works. The better statement of why the difference in ages occurs is simply that different time elapses on different spacetime paths connecting two events-- but that's not the explanation that stems directly from the postulates of SR (even if we know it is in fact correct).

*shrug* I guess there's nothing left to say but "you're wrong". (And similarly for many of the points in your previous posts)

You said that when you defended the idea that the Minkowski metric was invariant to any coordinate change, but now you are recognizing orthonormal transformations, and when you claimed that changing an observer was "just a coordinate change, not a mapping from spacetime into itself", a claim you have also apparently backed off on.

Minkowski space does not have an origin. It is not a vector space. (Just like Euclidean space)

Metrics apply to vector spaces, so once again, if "Minkowski space" takes on its usual meaning as "the spacetime vector space with the Minkowski metric", then it is a vector space.

I'm confused what confuses you about this perfectly natural statement about the normal formulation of special relativity (and I remind you of the John Baez quote I posted earlier in the thread).

If you don't have empirical evidence that Newton's laws shouldn't extend to arbitrary speeds, then you don't have any scientific grounds for expecting them to fail for arbitrary speeds. (I'm assuming you meant to make a sensical statement -- we knew even before special relativity that notions of absolute velocity had no physical meaning)

You may indeed assume I was making a sensible statement there. Furthermore, if you have no reason to expect they do extend to arbitrary (relative-- obviously) speeds, then why do you think it is "scientifically correct" to expect they will? This is precisely what I am saying is not scientifically correct, as history has shown many times. It is scientifically correct to form no opinion in advance of the observation.

Hurkyl · May 18, 2008

Ken G said:

The key words there are "model" and "observations of the same kind".

Be aware that many people (even technical people!) often make statements with an implicit assumption of nontriviality -- I would be extremely hesitant to take such an common-language heuristic explanation as being accurate on such a detail. (in fact, I would even expect experts to disagree on that detail)

That said, the database theory only needs a slight tweak to predict "observations of the same kind" -- it takes 'same kind' perfectly strictly, and only makes predictions of new experiments that are identical to previous ones.

I would say Wiki is right on target here, even so far as underscoring the imporance of "inside the box" predictions over "outside" ones.

You seem to read that statement much differently -- I would read it as, for example, a theory of fluid motion is not expected to make predictions about photonics.

That said, you are being very dogmatic about what's "inside the box". Every experiment ever performed was performed "before today". Expecting any of that data to give us information about "tomorrow" is clearly just an extrapolation -- no different in principle than extrapolating kinematics to high relative velocities, or assuming that the sun's core obeys same nuclear physics as we observe in the laboratory.

A good concrete example of this is the twin paradox.

The twin (pseudo)paradox is, by definition of 'paradox', an example of fallacious reasoning. The resolutions of the twin paradox are meant specifically to identify the flaw in the reasoning and explain why it is a flaw.

You said that when you defended the idea that the Minkowski metric was invariant to any coordinate change, but now you are recognizing orthonormal transformations, and when you claimed that changing an observer was "just a coordinate change, not a mapping from spacetime into itself", a claim you have also apparently backed off on.

Yes, the metric on Minkowski space is invariant under coordinate change. Yes, what you described as 'changing an observer' appeared to be nothing more than a coordinate change. Yes, for an actual (affine) transformation of Minkowski space itself to respect the metric, it must be Poincaré.

Metrics apply to vector spaces, so once again, if "Minkowski space" takes on its usual meaning as "the spacetime vector space with the Minkowski metric", then it is a vector space.

Yes, vector spaces may have metrics¹. So can affine spaces. And pseudo-Riemannian manifolds by definition have a metric. Minkowski space, like Euclidean space, is not a vector space.

1: The kind of metric we're talking about here. The metric on a metric space is another concept, and the two notions are not compatable in the case of interest here.

I'm confused what confuses you about this perfectly natural statement about the normal formulation of special relativity (and I remind you of the John Baez quote I posted earlier in the thread).

Your assertion that all worldlines are inertial is patently false. And note that the Baez quote doesn't say anything about observers or worldlines.

Furthermore, if you have no reason to expect they do extend to arbitrary (relative-- obviously) speeds, then why do you think it is "scientifically correct" to expect they will?

I wouldn't.

But we do have reasons to expect Newtonian mechanics to work for arbitrary relative speed: all of that pesky empirical evidence supporting Newtonian mechanics. :tongue: In fact those reasons are still applicable today.

Ken G · May 18, 2008

Hurkyl said:

In fact, studying geometry by its symmetry group is the topic of the Erlangen program.

Indeed, and I am suspecting that the right way to do special relatilvity is similar. By "right", I mean the way that carries no unnecessary concepts that are included purely for convenience and familiarity, but which ultimately replace the actual point of what has been discovered about reality with pictures (like the isotropic speed of light or the Einstein simultaneity convention) that are useful in practice but of deceptive physical content. They are fine for doing calculations, but may not be the best way to unify speical relativity with other advances in physics.

Ken G · May 18, 2008

Hurkyl said:

That said, the database theory only needs a slight tweak to predict "observations of the same kind" -- it takes 'same kind' perfectly strictly, and only makes predictions of new experiments that are identical to previous ones.

That's exactly why I would not count it a theory, nor the goal of science.

You seem to read that statement much differently -- I would read it as, for example, a theory of fluid motion is not expected to make predictions about photonics.

Correct, I read it differently. I read it that, for example, a theory of particle dynamics that explains ideal gases would not be expected to describe the motions of those same particles when confined to the scales inside atoms.

That said, you are being very dogmatic about what's "inside the box". Every experiment ever performed was performed "before today". Expecting any of that data to give us information about "tomorrow" is clearly just an extrapolation -- no different in principle than extrapolating kinematics to high relative velocities, or assuming that the sun's core obeys same nuclear physics as we observe in the laboratory.

I agree that it is quite difficult to say categorically what is a difference "in principle", but nevertheless this is the charge that is put to science-- when you are building a bridge, for example, you face that charge all the time.

The twin (pseudo)paradox is, by definition of 'paradox', an example of fallacious reasoning. The resolutions of the twin paradox are meant specifically to identify the flaw in the reasoning and explain why it is a flaw.

The only paradox there stems from the different sounding explanations. That problem would be avoided in the approach I'm advocating.

Yes, the metric on Minkowski space is invariant under coordinate change.

(shrug)-- that is simply wrong, what more can I say. We were making progress when we established that the invariant was only the signature of the metric, and that the manifold was Lorentzian as a result. Don't backslide now.

The metric on a metric space is another concept, and the two notions are not compatable in the case of interest here.

Equivocation. We have always been talking about the standard way special relativity is described and axiomatized, right from the start of the thread. As such, "the metric on a metric space" is just what we've been talking about. What we discovered of importance, in my view, is that it is not the metric at all that generates gravity-free dynamics, it is the Lorentzian geometry of the manifold, which is connected to the signature of the metric. If you want a true coordinate-free invariant, you must work with the covariant/contravariant dual spaces, as neither the covariant metric tensor, nor the contravariant metric tensor, is by itself invariant to the kinds of transformations of spacetime into itself that we need to do physics from the perspective of different observers.

Your assertion that all worldlines are inertial is patently false.

A pretty good indicator that I never made, or even thought, any such assertion.

And note that the Baez quote doesn't say anything about observers or worldlines.

Nevertheless, what it did say is something you have simply ignored.

But we do have reasons to expect Newtonian mechanics to work for arbitrary relative speed: all of that pesky empirical evidence supporting Newtonian mechanics. :tongue: In fact those reasons are still applicable today.

I haven't the vaguest idea what you are trying to say here, because taking the literal meaning is obviously "patently false".

Hurkyl · May 19, 2008

Ken G said:

The only paradox there stems from the different sounding explanations. That problem would be avoided in the approach I'm advocating.

Students have to learn coordinates -- they are so computationally useful that it would be harmful to deny them that knowledge. And while students learn coordinates, many will make mistakes, and some will rediscover the twin pseudoparadox. Showing them an unrelated derivation of the same quantity does not help -- it does not fix their misunderstanding of coordinates.

If we were considering it as a putative paradox, your approach is entirely useless: a paradox consists of two separate arguments that lead to contradictory results. Offering yet another argument does not repair the theory.

(shrug)-- that is simply wrong, what more can I say. We were making progress when we established that the invariant was only the signature of the metric, and that the manifold was Lorentzian as a result. Don't backslide now.[/quote]
You're equivocating. The metric is invariant under all coordinate changes. Amongst symmetries of Minkowski space, it is invariant only under Poincaré transformations. You need to stop confusing those two ideas.

Equivocation. We have always been talking about the standard way special relativity is described and axiomatized, right from the start of the thread. As such, "the metric on a metric space" is just what we've been talking about.

Wrong. See:
(pseudo)Metric tensor
metric (as in a metric space)

A pretty good indicator that I never made, or even thought, any such assertion.
Nevertheless, what it did say is something you have simply ignored.

So, why did you say:

The fact that an observer is on a worldline is very much my concern with the standard formulation of special relativity-- it not only requires the worldline be inertial, it further require the worldline has always been inertial and always will be.

?

I haven't the vaguest idea what you are trying to say here, because taking the literal meaning is obviously "patently false".

I meant exactly what I said -- this very day, we have reasons to believe Newtonian mechanics works for arbitrary relative velocities, and that is perfectly consistent with the scientific conclusion that Newtonian mechanics doesn't work for arbitrary relative velocities. I'm trying to provoke thought about how empiricism works. :tongue:

Ken G · May 19, 2008

Hurkyl said:

Students have to learn coordinates -- they are so computationally useful that it would be harmful to deny them that knowledge. And while students learn coordinates, many will make mistakes, and some will rediscover the twin pseudoparadox. Showing them an unrelated derivation of the same quantity does not help -- it does not fix their misunderstanding of coordinates.

But I'm not talking about the students who have made a mistake, I'm talking about those who did everything correctly and can't figure out why the two observers cannot agree on the reason that one is younger. That's a problem. The standard answer is "the reason is itself coordinate dependent", and I used to accept that answer. Now, I see it as flawed-- it misses the whole point of relativity. If the point of relativity is that different observers can use the same physics, then they should also be able to do it in a way that finds the same answer to the same question. It really isn't that hard to do relativity that way, it would just use coordinates as a convenience rather than as something seen as an integral part of the physics.

Perhaps an example would help-- the ficticious forces of a rotating reference frame. The standard way we teach that is to first show students why such forces are merely illusions, as they don't fit the postulates of Newton's laws and they emerge from the coordinates not the physics. Later, after we have convinced students such forces don't exist, we give them problems where they are terribly convenient to use, and most students end up not really caring if those forces are real or not, they work if you know how to use them. So we are in effect saying, in reality Newton's laws are true, but in practice, we can relax them if we know what we are doing. I'd say this story is a classic example of losing track of what Newton's laws are for, i.e., how to solve problems, not how to tell reality what is real.

We make the same mistake again in relativity, even as we pretend we are "fixing" Newton's laws by not making the mistake of thinking in absolute terms. We were on the right track, but should have gone all the way-- we should have found the way Newton's laws should have been formulated in the first place, where they would have made all the same predictions at low relative speeds and simply not known what they would do at high speeds nor even how high the speed needed to get to discover the breakdown. That would be the "honest" way to do science, and would have avoided any need for a "revolution" in 1905.

If we were considering it as a putative paradox, your approach is entirely useless: a paradox consists of two separate arguments that lead to contradictory results. Offering yet another argument does not repair the theory.

No, my approach is not at all useless. Let's look at how my approach applies to the centrifugal force example. I say, "why do I weigh less at the equator", and someone else says "because centrifugal force counteracts some of gravity". Someone else says "no, that's wrong, there's no such thing as centrifugal force to counteract gravity, it is because I simply require less force from the scale to move in a circle at the equator". Two different answers to the "why" question, which is the only sense to which the twin problem is a "paradox" (once we've recognized failures of our daily intuition does not count as paradoxical). In first-year physics, the first answer is "wrong" and the second is "right", but later when you learn general relativity, either answer is just as good.

What I'm saying is, let's not have either explanation, if we really want our best answer to the "why" question, let's penetrate a step deeper and find the most unifying answer, phrased in a coordinate-independent way that cares about neither the presence nor the absence of "centrifugal force". We can still look at the other answers, and the concept of centrifugal force if we want it, and tailor to the audience or the situation, but we don't imagine we are saying something physically true, we are just selecting a convenient picture to use because we know it gets the right answer.

You're equivocating. The metric is invariant under all coordinate changes.

I do not know what you mean by that statement. To me, if someone says "the metric is invariant", they mean the action of the metric on all vector pairs v_1,v_2, selected from a vector space, is invariant under any coordinate change (i.e, homeomorphisms) of the vector space. Since the action of the metric is not invariant under any homeomorphism that is not orthonormal, that contradicts your statement. Obviously you mean something different by it, but the real issue here is, if you pick up a relativity textbook and start reading about the "Minkowski metric", are you reading about the actual thing that defines the structure of Lorentzian manifolds, or are you reading about some specially coordinatized version of that true structure? I say the latter, and that's the problem with it.

Amongst symmetries of Minkowski space, it is invariant only under Poincaré transformations.

Then it is not invariant under arbitrary transformations. You have not established why there is anything to "confuse" here. Note that we are not discussing whether or not mathematicians know what a Minkowski space is, they invented it, nor are we discussing whether or not dynamics on a pseudo-Riemannian manifold are locally that of a Lorentzian manifold, we know that they are. We are talking about whether or not we should treat the Minkowski metric as something special, or as just one from a whole class of metrics with the same signature that are all equally physically "real" and equally selected from the class of metrics important for understanding spacetime from the perspective of any observer.

Wrong. See:
(pseudo)Metric tensor
metric (as in a metric space)
[/quote]There does appear to be a difference between a metric tensor and a metric space, the former being coordinate independent but taking on different forms in different coordinates. I'm still a bit confused on this point, as for example the "Rindler metric" and the "Minkowski metric" both apply to flat spacetime but for different observers. Whether or not that makes them different metrics or not is the confusing part. I was wrong about using them with the dual space-- the metric tensor is a way to choose both vectors from the same vector space without using the dual space. The connection between the mathematics and the physics, and the "specialness" of inertial coordinates, is murky yet.

So, why did you say:

The fact that an observer is on a worldline is very much my concern with the standard formulation of special relativity-- it not only requires the worldline be inertial, it further require the worldline has always been inertial and always will be.

That quote does not claim worldlines are inertial, it says that noninertial observers also have worldlines and should be able to coordinatize spacetime using constantly changing basis vectors. For example, Einstein's simultaneity convention is a lot different for such an observer (allowing time to be perceived as going backward, for example).

I meant exactly what I said -- this very day, we have reasons to believe Newtonian mechanics works for arbitrary relative velocities, and that is perfectly consistent with the scientific conclusion that Newtonian mechanics doesn't work for arbitrary relative velocities. I'm trying to provoke thought about how empiricism works. :tongue:

I'm not getting the connection, it just sounds like a contradiction. Why would we care if we have reasons to believe Newtonian physics works for arbitrary v if we know it doesn't?

dx · May 20, 2008

I've been following this discussion for a while and although I don't understand some of the details, I would like to offer my views on the following question - "what should we expect in an unfamiliar or untested regime?"

Ken G, if I've understood him, seems to be saying that we should not expect anything, and should not be surprised if our current theories don't extend to it. I think there are a few problems with this idea. Firstly, I think he is confusing the ideas of "expecting something" and "knowing something". It is true that we did not know whether Newtonian mechanics holds at arbitrary velocities, but given the information we had at the time, that was the correct thing to expect.

Take a coin toss for example. Say a superficial examination of the coin did not provide us with any information favoring one side to the other. Then the best thing to expect would be that it is equally likely to get a head or a tail. Of course that does not mean that we know what will happen. It only means that it is the best thing to expect given our current state of information. If we tossed the coin billions of times, and it turns out as we expect, we may start believing our "1/2 theory" very strongly. We expect that no matter how many times we toss it, it will be approximately half heads and half tails. We don't know, but we expect. But then an Einstein might come along and analyse the coin more carefully, and he may discover that there's a slight bias in the coin. According to him, the probability of heads is not .5, but say .5 + 10^(-100). So he says "the 1/2 theory is only approximate, and is valid at 'small tosses'. At 'high tosses' it must be replaced with the (1/2 + 10^(-100)) theory." There might be a physicist that said "I don't expect anything at high tosses", but clearly that position has no value. We must expect what our information leds us to expect.

Ken G said:

... where they would have made all the same predictions at low relative speeds and simply not known what they would do at high speeds nor even how high the speed needed to get to discover the breakdown.

A theory cannot be restricted in that way unless your theory is just the set of observations you have made. A theory by definition predicts the outcomes of experiments that you have not done. Expecting Newtonian mechanics to work at high speeds is the same as expecting Newtonian mechanics to work on Mars. Both these expectations could be wrong, but given the information at the time, that was the correct thing to expect. Remember that no one said that they know that it will work at high speeds.

Hurkyl · May 20, 2008

Ken G said:

Perhaps an example would help-- the ficticious forces of a rotating reference frame. The standard way we teach that is to first show students why such forces are merely illusions, as they don't fit the postulates of Newton's laws and they emerge from the coordinates not the physics.

A cute cartoon: http://www.xkcd.com/123/

I was going to bring this example up myself; things depend upon precisely how you formulate Newton's laws. e.g. consider Newton's first law:

A physical body will remain at rest, or continue to move at a constant velocity, unless an outside net force acts upon it.

Suppose we have a (possibly time-dependent) affine coordinate function on three dimensional Euclidean space; let [itex][P]_t[/itex] denote the coordinates of the point P at time t. This also gives coordinates on the vector space associated to Euclidean space; let [itex]_t[/itex] denote the coordinates of the vector u at time t.

Let P(t) denote the position at time t (as a point in Euclidean space) of an object at that experiences zero net force. The relevant question is: "What do you mean by constant velocity?" The expression [itex]d/dt P(t)[/itex] is, indeed, a vector that remains constant over time. However, the coordinate velocity [itex]d/dt [P(t)]_t[/itex] could be nonconstant if we use non-inertial coordinates.

I'd say this story is a classic example of losing track of what Newton's laws are for, i.e., how to solve problems, not how to tell reality what is real.

On the contrary, it's a classic example of understanding coordinate representations, but not what is being represented by those coordinates.

No, my approach is not at all useless. Let's look at how my approach applies to the centrifugal force example. ... Two different answers to the "why" question, which is the only sense to which the twin problem is a "paradox"

Your example wasn't a paradox -- it was two ways of deriving the same thing. A paradox is when you have two (valid) arguments that arrive at contradictory conclusions.

The (alledged) twin paradox makes two arguments and arrives at contradictory conclusions. It is merely a pseudoparadox because we can identify that one of the arguments is not a valid one.

if we really want our best answer to the "why" question

There is no such thing as a "best" answer. The critera for judging the 'goodness' an answer depend on the circumstances, and generally speaking, different answers will be better in different situations.

any coordinate change (i.e, homeomorphisms)

Right there -- that's your problem. You have confused the notion of a 'coordinate change' with the notion of a 'homeomorphism'.

Let X be a topological space.
Let R^n be a suitable space of coordinates

. A homeomorphism on a topological space is a function X --> X.
. Changing coordinates means switching from one coordinate function R^n --> X to a different coordinate function R^n --> X.

Let's assume for simplicity that X is a vector space, we use the vector space structure on R^n, and we only consider linear transformations.

Suppose we have a coordinate function R^n ---> X, and an automorphism of X. We can compute the 'coordinate representation' of that automorphism, which is an automorphism of R^n computed for a given tuple of coordinates by:
. Compute the point represented by those coordinates
. Transform that point by the given automorphism
. Compute the coordinates of the new point

Suppose we apply a 'change of coordinates', which entails switching which coordinate function R^n --> X we are using. We can compute the corresponding 'change of basis' transformation, which is an automorphism of R^n computed for a given tuple of coordinates by:
. Compute the point represented by those coordinates, according to the first function
. Compute the coordinates of that point, according to the second function

These are two very different ideas, but are both often represented by an automorphism of coordinate space. I think your specific error is that you only think of this automorphism of coordinate space, and so you have difficulty distinguishing the two very different underlying ideas.

That quote does not claim worldlines are inertial, it says that nonInertial Observers also have worldlines and should be able to coordinatize spacetime using constantly changing basis vectors.

Anyone can coordinatize spacetime in any way they please. (I don't think that 'constantly changing basis vectors' has any literal meaning, but I think I know what you mean) Coordinate charts have nothing to do with observers.

I'm not getting the connection, it just sounds like a contradiction. Why would we care if we have reasons to believe Newtonian physics works for arbitrary v if we know it doesn't?

Because we want to do science correctly -- in particular, we don't want to make actual mistakes, nor do we want to force science to conform to our a priori biases. :tongue: (There are ways to accommodate our a priori biases without hacking the philosophy of empiricism to pieces)

In what sense do we 'know' that Newtonian physics works? Certainly not by pure reason -- we 'know' it in the sense that if we consider all of the empirical data, the evidence of failure is stronger than the evidence of success. The evidence favoring Newtonian mechanics hasn't magically vanished! It has simply been outweighed.

I bring this up because you seem to be going about empiricism in entirely the wrong way -- you seem to be going through great lengths to avoid drawing conclusions that could be wrong. But that's wholly unnecessary, and simply not how empiricism works. (And, of course, it appears that your end goal would simply result in never making new predictions)

Ken G · May 21, 2008

dx said:

Ken G, if I've understood him, seems to be saying that we should not expect anything, and should not be surprised if our current theories don't extend to it. I think there are a few problems with this idea. Firstly, I think he is confusing the ideas of "expecting something" and "knowing something". It is true that we did not know whether Newtonian mechanics holds at arbitrary velocities, but given the information we had at the time, that was the correct thing to expect.

But what you haven't answered is, why should we need to "expect" anything at all? If we already have observations that we think are relevant to the new ones, then it is the behavior in the old observations, not any theories we built from them, that cause us to "expect" something in the new observations. That's natural-- if previous results relate to new ones, we use them, that's the bridge building kind of predictions that science makes-- the important kind. But again, in that situation it is the existing observations that creates the cause for new expectations-- theories are nothing but a way to unify the information in the existing observations. When we forget that, we land in all kinds of hot water, throughout history.

If for example, if you have a situation where there are no previous observations that seem relevant, if you are probing something that is really new, then why would you expect some theory we built from other observations to lend insight into the new ones? Is it the purpose of a theory to tell reality what to do, or the other way around? I think it is that former approach that has led to all these "revolutions" in scientific thought, that were never really revolutions at all-- just comeuppances when we made assumptions we had no business making in the first place. Revolution is not a natural part of science, it is an indication of something pathological in how we are going about the process.

The paradox is, if we really did have past observations that were relevant to the outcome of the new one, then that connection exists even in the absence of whatever unifying theory we generated to understand them, and we are not really "predicting" so much as "noticing a pattern". If, on the other hand, we are really making a prediction about something that we really knew nothing about in advance, except some theory we built, then on what basis do we logically form any expectation at all?

Take a coin toss for example. Say a superficial examination of the coin did not provide us with any information favoring one side to the other. Then the best thing to expect would be that it is equally likely to get a head or a tail.

But on what basis do you say that it is equally likely? On the basis of experience, of prior observations of symmetric objects. If you hand someone a shoe, should they also expect it is equally likely to end up on the sole or the top, or would it be natural to adopt no expectation at all until some experience was built up around objects of that shape? Aren't you simply using what you know about symmetries to build your expectation, not any kind of "null hypothesis"?

There might be a physicist that said "I don't expect anything at high tosses", but clearly that position has no value.

Why not? What if it was a shoe instead of a coin?

We must expect what our information leds us to expect.

When we have information, i.e., past experience that seems relevant, yes. It is a very difficult issue to decide what constitutes "relevant past experience", yet we have to do just that every time we build a new bridge or a new airplane, so it's not a new problem to identify when we are really probing a new regime. You might say "but the aprpropriate equations for plane flight are known", but that really isn't true-- there's no such thing as "the appropriate equations" in an absolute sense, paradigm choices always have to be made, based on experience.

A theory cannot be restricted in that way unless your theory is just the set of observations you have made. A theory by definition predicts the outcomes of experiments that you have not done.

But again, note that I am distinguishing two types of prediction here, one is the type that says a certain drug therapy might cure a disease in an individual even though no testing was ever done of that drug on that individual. That's the important kind of predictions that science makes, "inside the box" of what we have experience with, and we need to be able to expect them to be right to gain the value of science. But predictons made "outside the box" are something very different, and have a far spottier record in science-- such as predicting that the drug will also work on other diseases that bear some resemblance but which we have no data for that drug. More to the point, the usefulness of predictions like that is very different, they are only there to guide new hypotheses and new experiments-- there is no need to "expect" them to be right (and any practicing physician who "expected" such predictions to be right could make significant errors in judgement).

Expecting Newtonian mechanics to work at high speeds is the same as expecting Newtonian mechanics to work on Mars.

Well this is exactly "the rub", when can you tell if your theory should be expected to work or not. Some Earthlike physics works on Mars, and some doesn't, pure and simple. There's no reason to expect that it either will or it won't, except as guided by past experience around extrapolating a particular theory in that way.

Even in the case of gravity, we can say that "the physics of gravity on Earth" does not work on Mars! However, since we have experience already with gravity in various different situations in the solar system, we have already equipped it with a capacity to be applied on Earth or on Mars. We already put that into our theory, based on observation, it was never something that we just knew had to be right.

Remember that no one said that they know that it will work at high speeds.

We can certainly agree on that, the issue is, did we have any right to be surprised that it didn't? I say, no, that is false surprise, engendered by a fallacious idea about what physical theories really are-- a fallacious idea that we seem to be even more likely to fall into in modern areas like interpretations of quantum mechanics or the "landscape" in fundamental particle theory. Basically, Einstein got away with telling reality that it ought to bend starlight passing a massive body, and that made us forget, for the umpteenth time, that the winners write the history.

granpa · May 21, 2008

But on what basis do you say that it is equally likely?

on the basis that there are 2 sides. if you knew nothing else about it you would still expect to be able to predict the outcome 50% of the time just by choosing at random. but notice that 50% is not the probability. the probability is unknown. the probability could be 100% or 0% or anywhere in between. if its not the probability what is it? the logical thing to call it would be the expectation but that word is already taken. so its called the bayesian probability.

Ken G · May 21, 2008

Hurkyl said:

The relevant question is: "What do you mean by constant velocity?" The expression [itex]d/dt P(t)[/itex] is, indeed, a vector that remains constant over time. However, the coordinate velocity [itex]d/dt [P(t)]_t[/itex] could be nonconstant if we use non-inertial coordinates.

Right, that is the need for understanding tensor quantities, and how various quantities transform. Those are clear enough to mathematicians, but it is the physicist's problem to try to use that in their interpretation of reality. Are they doing a good job, or mistaking computational conveniences for statements of what is real? Is there any difference? Mathematics doesn't answer those questions, they are metaphysical.

On the contrary, it's a classic example of understanding coordinate representations, but not what is being represented by those coordinates.

Then you can answer, is the centrifugal force something real?

Your example wasn't a paradox -- it was two ways of deriving the same thing. A paradox is when you have two (valid) arguments that arrive at contradictory conclusions.

There is no such paradox in the twin "paradox", for anyone who can do relativity, just like there's no paradox in saying 1+1 is either 1 or 2 for someone who can add. So to call it a paradox is indeed to embrace the type I was talking about-- two different sounding answers to the same question, that are both right, yet seem like they contradict. We tolerate that situation in the current formulation of relativity, and that is what I am suggesting is a weak pedagogy to accept.

The (alledged) twin paradox makes two arguments and arrives at contradictory conclusions. It is merely a pseudoparadox because we can identify that one of the arguments is not a valid one.

But that's easy, it's no more interesting than someone learning to add for the first time, encountering "paradoxes" because they are simply doing something wrong. I'm talking about the aspects of that paradox that survive a fully correct treatment in some axiomatic system, yielding contradictory sounding descriptions of the reality of what happened.

There is no such thing as a "best" answer. The critera for judging the 'goodness' an answer depend on the circumstances, and generally speaking, different answers will be better in different situations.

Of course. Yet it falls to us to make that call anyway, constantly, both as teachers and as we ourselves try to obtain the most facile understanding of how reality works.

Right there -- that's your problem. You have confused the notion of a 'coordinate change' with the notion of a 'homeomorphism'.

You said that before, but I'm claiming that when the "coordinate change" corresponds to the way a different observer measures reality, we are indeed talking about a homeomorphism-- that the physical approach makes that the required picture. In mathematics, you have more freedom to decide if you want to imagine that a change of coordinates either simply relabeled the same vectors with new names, or if it mapped the old vector space onto a new one where the structure is the stucture of the names. You make that choice when, for example, you plot a trajectory in polar coordinates. Do you write rectangular axes labeled theta and r, and plot curvy paths on them to show inertial motion, or do you draw little circles cut by radial wedges, and plot straight lines? These are mathematically equivalent, so I don't see why you are saying that one must make a distinction and cannot see a coordinate change as a homeomorphism.

. A homeomorphism on a topological space is a function X --> X.
. Changing coordinates means switching from one coordinate function R^n --> X to a different coordinate function R^n --> X.

It is trivial to create an automorphism from that by inverting the first coordinate function (it is invertible), and applying the second coordinate function. That is a perfectly valid association of a coordinate change with an automorphism on X, it seems to me, and indeed it is just what is often done in physics (as in using centrifugal forces in finding a Roche lobe, for example, where certain equations are applied prior to the final mapping back to X).

These are two very different ideas, but are both often represented by an automorphism of coordinate space. I think your specific error is that you only think of this automorphism of coordinate space, and so you have difficulty distinguishing the two very different underlying ideas.

But they are not very different ideas, expressly because we are dealing entirely with coordinate homeomorphisms here. Hence, automorphisms of R^n extend trivially by the action of the coordinate function to automorphisms of X. This is crucial, the structure of X is preserved on R^n, so there is not the distinction you describe.

Anyone can coordinatize spacetime in any way they please. (I don't think that 'constantly changing basis vectors' has any literal meaning, but I think I know what you mean) Coordinate charts have nothing to do with observers.

If that were true, then they would have no physical meaning and would be purely abstract mathematical concepts. We must have a way to connect observations to coordinates. Saying I can coordinatize spacetime any way I want is like saying I can name events "Tom", "Dick", and "Harry" if I want-- but I'm not doing physics unless I can connect these names to a ruler and a clock somehow.

I bring this up because you seem to be going about empiricism in entirely the wrong way -- you seem to be going through great lengths to avoid drawing conclusions that could be wrong. But that's wholly unnecessary, and simply not how empiricism works.

I don't see your view that empiricism is some kind of "weighing" of pro and con evidence. It is generally accepted that no theory can be proven true by any number of successes, but can be proven to need modification by any single significant failure. That stems from the need for a theory to unify, not replace, data. Of course we use "false" theories all the time, but again that is based on our experience with using them, not any kind of fundamental theoretical stance about how reality must work. All you seem to be saying here is that if we find a regime where a theory breaks down, we will still use it in regimes where it does not, but that merely serves to underscore what I mean by the reliable type of predictions that science makes-- in contrast with the guesses masquerading as predictions.

(And, of course, it appears that your end goal would simply result in never making new predictions)

My end goal would be to not confuse predictions with hypotheses, to avoid mistaking the various purposes for which we have theories in the first place.

Einstein simultaneity: just a convention?

Similar threads

Hot Threads

Recent Insights