Question on linearity of Lorentz transformations

Ich · Jul 14, 2009

You're right - of course, since straight lines in spacetime represent uniform motion.
So this statement alone does not require linearity. We need homogeneity, too.

Fredrik · Jul 14, 2009

Al68 said:

Wasn't he referring to the same concept of "inertial frame" as that used for a couple hundred years in Newtonian physics?

ie, a reference frame in which Newton's first law is valid, which according to Newton was valid in any reference frame "neither rotating nor accelerating relative to the fixed stars."

It seems obvious to me that Einstein was specifically referring to the same concept of an inertial frame in his postulates.

Would you also define the real numbers as "numbers with the properties that people who don't know math or physics expect distances to have"? The above is obviously not a definition. It doesn't tell us which functions are inertial frames.

Al68 · Jul 14, 2009

Fredrik said:

Would you also define the real numbers as "numbers with the properties that people who don't know math or physics expect distances to have"? The above is obviously not a definition. It doesn't tell us which functions are inertial frames.

I don't see what you're saying here. If F=ma, then the reference frame is inertial. Otherwise it's not. Sounds well defined to me.

Fredrik · Jul 14, 2009

It's not. If it was, then an inertial frame in SR would be the same thing as an inertial frame in Newtonian mechanics. It clearly isn't. A coordinate system is a function from (an open subset of) spacetime into \mathbb R^4. Inertial frames are coordinate systems. If x and y are coordinate systems, then x\circ y^{-1} represents a coordinate change. I'm not sure if it's a standard term, but I call these functions "transition functions". In Newtonian mechanics, the set of transition functions associated with inertial frames is the Galilei group. In SR, it's the Poincaré group. So the transition functions are clearly not the same in both theories, and therefore the inertial frames aren't either.

facenian · Jul 14, 2009

Fredrik, I'm not sure whether you last said is all right because the transition functions are different in both theories doesn't mean the concept of inertial frame are necesarily different.
I think fron a phisical stand point an inertial frame may be defined as one where the law of inertia holds or perhaps can be defined,more abstracly, like Landau does as I mentioned in an erlier post.

Fredrik · Jul 14, 2009

What I said is definitely correct. In both theories you can take spacetime to be \mathbb R^4 and let the identity map on \mathbb R^4 (i.e. the function I defined by I(x)=x for all x) be one of the inertial frames. This choice makes the set of transition functions identical to the set of inertial frames.

facenian · Jul 14, 2009

Fredrik said:

What I said is definitely correct. In both theories you can take spacetime to be \mathbb R^4 and let the identity map on \mathbb R^4 (i.e. the function I defined by I(x)=x for all x) be one of the inertial frames. This choice makes the set of transition functions identical to the set of inertial frames.

I see your definition is mathematically correct and this leads to defferent inertial frames whether you us LT transformations or Galiei transformation.
I don't know if in this contex the physical content an inertial frame is in order.

Al68 · Jul 15, 2009

Al68 said:

I don't see what you're saying here. If F=ma, then the reference frame is inertial. Otherwise it's not. Sounds well defined to me.

Fredrik said:

It's not. If it was, then an inertial frame in SR would be the same thing as an inertial frame in Newtonian mechanics. It clearly isn't. A coordinate system is a function from (an open subset of) spacetime into \mathbb R^4. Inertial frames are coordinate systems. If x and y are coordinate systems, then x\circ y^{-1} represents a coordinate change. I'm not sure if it's a standard term, but I call these functions "transition functions". In Newtonian mechanics, the set of transition functions associated with inertial frames is the Galilei group. In SR, it's the Poincaré group. So the transition functions are clearly not the same in both theories, and therefore the inertial frames aren't either.

I was referring to Newtonian mechanics, since the issue was how well defined an inertial frame was prior to Einstein's postulates in 1905. There was no SR inertial frame definition prior to 1905.

I didn't say the Newtonian definition was correct in SR, I said it seemed to be well defined in Newtonian physics..

Fredrik · Jul 15, 2009

Al68 said:

I was referring to Newtonian mechanics, since the issue was how well defined an inertial frame was prior to Einstein's postulates in 1905. There was no SR inertial frame definition prior to 1905.

I didn't say the Newtonian definition was correct in SR, I said it seemed to be well defined in Newtonian physics..

Huh. Why would it be relevant that there's an older theory in which the same term is used to mean something different?

quZz · Jul 16, 2009

This is very interesting, Fredrik! Though don't quite understand you =) but never mind... The question: is there a homogeneity of spacetime in an inertial system in SR? in Newtonian physics?

Al68 · Jul 16, 2009

Fredrik said:

Huh. Why would it be relevant that there's an older theory in which the same term is used to mean something different?

Because Einstein was clearly referring to a Newtonian inertial frame in his 1905 paper.

Fredrik · Jul 16, 2009

Al68 said:

Because Einstein was clearly referring to a Newtonian inertial frame in his 1905 paper.

I really hope he wasn't, because his postulates are false if he was.

If he was referring to an undefined concept, the task of "deriving" something from the postulates can be interpreted as the task of finding out which definitions are consistent with other assumptions that also seem natural. I haven't been able to find any other way to make sense of Einstein's "postulates" and the attempts to "derive" things from them.

atyy · Jul 16, 2009

Fredrik said:

I really hope he wasn't, because his postulates are false if he was.

Yeah, he was. I've read that it's ok if one says an inertial frame is one in which Newtonian laws hold at low velocities - maybe an article by Ohanian in the American Journal of Physics - didn't study it, so can't reproduce the reasoning here.

atyy · Jul 16, 2009

I don't know what's up with so(4).

http://books.google.com/books?id=PE...oNXYDA&sa=X&oi=book_result&ct=result&resnum=3

Al68 · Jul 16, 2009

Fredrik said:

Al68 said:

Because Einstein was clearly referring to a Newtonian inertial frame in his 1905 paper.

I really hope he wasn't, because his postulates are false if he was.

Clearly, he was, although he didn't use the phrase "inertial frame", he used the phrase "frames of reference for which the equations of mechanics hold good" and "system of co-ordinates in which the equations of Newtonian mechanics hold good."

How does that make his postulates false?

Fredrik · Jul 17, 2009

Al68 said:

Clearly, he was, although he didn't use the phrase "inertial frame", he used the phrase "frames of reference for which the equations of mechanics hold good" and "system of co-ordinates in which the equations of Newtonian mechanics hold good."

How does that make his postulates false?

See #34 and #36. The set of inertial frames in Newtonian mechanics can be identified with the set of Galilei transformations, but not with the set of Poincaré transformations. The first postulate is consistent with both the Galilei group and the Poincaré group, but the second postulate rules out the former.

I don't doubt that you're right about what Einstein said, but he was incredibly sloppy.

Al68 · Jul 17, 2009

Fredrik said:

See #34 and #36. The set of inertial frames in Newtonian mechanics can be identified with the set of Galilei transformations, but not with the set of Poincaré transformations. The first postulate is consistent with both the Galilei group and the Poincaré group, but the second postulate rules out the former.

I think that was his point, that in Newtonian physics, the postulates were contradictory. His derivations were the result of modifying Newtonian physics to be consistent with the postulates.

I don't doubt that you're right about what Einstein said, but he was incredibly sloppy.

I think he preferred the phrase, "not too pedantic". After all, how exact should an inertial frame be defined in a paper that uses a rail car as an example of one?

Fredrik · Jul 18, 2009

Al68 said:

I think that was his point, that in Newtonian physics, the postulates were contradictory. His derivations were the result of modifying Newtonian physics to be consistent with the postulates.

And one of my points is that if you start with a set of assumptions and end up with a contradiction, you have only proved that your theory is inconsistent. You certainly haven't derived a new theory.

That's why I'm saying that the only way to make sense of the "derivation" is to interpret the "postulates" as ill-defined statements, and the "derivation" as finding out which of the corresponding well-defined statements are consistent with the other assumptions we want to make.

Al68 said:

I think he preferred the phrase, "not too pedantic". After all, how exact should an inertial frame be defined in a paper that uses a rail car as an example of one?

I don't have a problem with the fact that the first paper ever written on SR is "not too pedantic". I just don't think that's a good reason for us do the same. It's not even too difficult to talk about SR in a way that makes sense, so we have no excuse. I think it's absurd that professors still give students the impression that SR is defined by Einstein's postulates, and that the rest of the theory can be "derived" from the "postulates". You really can't derive anything from them, and they can't be taken as the definition of SR.

dx · Jul 18, 2009

It is possible to have well defined postulates from which the linearity of Lorentz transformations follows.

Let V be a four dimensional vector space. An inertial frame is a map ψ from the set of events into V which satisfies the following postulates:

1. The world lines of free particles are straight lines.
2. Clock rates are uniform, i.e. intervals measured by clocks agree with the linear structure of V.

That there exist such frames is an experimental question and has nothing to do with the mathematical structure of SR. The ideas of 'free partice' and 'clock' are primitive notions which are not defined within the theory.

Given two such inertial frames ψ and ψ', it is easy to see that the transformation between them given by ψ^-1ψ' : V → V is a linear transformation.

(Note that the Lorentz behavior of clocks represented by the Lorentz metric dt² - dx² - dy² - dz² is not the only one compatible with these postulates)

meopemuk · Jul 18, 2009

dx said:

1. The world lines of free particles are straight lines.
2. Clock rates are uniform, i.e. intervals measured by clocks agree with the linear structure of V.

These two postulates apply only to free particles. So, you need also a third postulate:

3. Events with interacting particles (e.g., their worldlines) transform by the same formulas as events with free particles.

Then, according to the Currie-Jordan-Sudarshan theorem, your theory must be interaction-free.

Al68 · Jul 18, 2009

Fredrik said:

And one of my points is that if you start with a set of assumptions and end up with a contradiction, you have only proved that your theory is inconsistent. You certainly haven't derived a new theory.

That's why I'm saying that the only way to make sense of the "derivation" is to interpret the "postulates" as ill-defined statements, and the "derivation" as finding out which of the corresponding well-defined statements are consistent with the other assumptions we want to make.

I don't have a problem with the fact that the first paper ever written on SR is "not too pedantic". I just don't think that's a good reason for us do the same. It's not even too difficult to talk about SR in a way that makes sense, so we have no excuse. I think it's absurd that professors still give students the impression that SR is defined by Einstein's postulates, and that the rest of the theory can be "derived" from the "postulates". You really can't derive anything from them, and they can't be taken as the definition of SR.

I haven't heard of the postulates being the definition of SR, certainly they aren't. But they marked the historical transition from Newtonian physics to SR.

As far as the postulates being contradictory in Newtonian physics, I think the 1905 paper showed that Newtonian physics was the one of the three assumptions that needed to be modified, not the other two (the postulates).

dx · Jul 19, 2009

meopemuk said:

These two postulates apply only to free particles. So, you need also a third postulate:

3. Events with interacting particles (e.g., their worldlines) transform by the same formulas as events with free particles.

Then, according to the Currie-Jordan-Sudarshan theorem, your theory must be interaction-free.

Why is this postulate needed? My first two postulates are enough to prove the linearity of transformations between inertial frames.

Also, I said that an inertial frame is a map from events to V, not a map from "free particle worldline events" to V. It doesn't matter what type of event it is. All events transform by the same formulas by definition.

meopemuk · Jul 19, 2009

dx said:

It doesn't matter what type of event it is. All events transform by the same formulas by definition.

I am not sure about that. This is your postulate (or definition), and I would like to know if you have any evidence to support it?

CJS theorem provides an example in which points (events) on worldlines of interacting and non-interacting particles transform by different formulas.

dx · Jul 19, 2009

meopemuk said:

I am not sure about that. This is your postulate (or definition), and I would like to know if you have any evidence to support it?

You mean experimental evidence? As far as I know, there are no mainstream theories where different types of events transform differently. The best theory we have about spacetime is general relativity, where spacetime is a manifold, and a coordinate system is a function from some patch of spacetime into R⁴. Given two coordinate systems, i.e. two functions φ, φ' : M → R⁴, it is easy to see that the transformation of coordinates for any event E from φ to φ' is given by φ^-1φ'.

meopemuk · Jul 19, 2009

dx said:

You mean experimental evidence? As far as I know, there are no mainstream theories where different types of events transform differently. The best theory we have about spacetime is general relativity, where spacetime is a manifold, and a coordinate system is a function from some patch of spacetime into R⁴. Given two coordinate systems, i.e. two functions φ, φ' : M → R⁴, it is easy to see that the transformation of coordinates for any event E from φ to φ' is given by φ^-1φ'.

Yes, I agree that both special and general relativity theories are based on your (rarely mentioned, but important) postulate that time-position transformations of events do not depend on the physical nature of the events and on interactions acting in the observed system. So, you are saying that the validity of your postulate is justified a posteriori by the fact that both SR and GR agree well with experiments? This leaves however the possibility that the postulate is not exactly true, and that there is some dependence of the time-position transformations on interactions between particles. If the effect is small, then it wouldn't contradict existing experiments.

Another important (though not appreciated) point is the logical consistency. Suppose that we accepted your postulate and assumed that time-position transformations between different moving frames do not depend on interactions. Then Lorentz transformations are guaranteed to be linear and universal, and all events can be represented as points in the Minkowski space-time. Suppose also that we constructed a dynamical interacting theory based on this principle. The Maxwell-Lorentz electrodynamics is a good example. Then it would be of interest to verify within our theory whether the initial postulate holds.

For example, we can consider a system of two interacting charges (e.g. an electron and a proton), calculate their trajectories, and find space-time coordinates of some localized event, e.g., when the two particles collide. Next, in our theory, we could repeat the same calculation in a moving reference frame. So, we would have space-time coordinates of the same event (collision of the two particles) in two reference frame. Will they be connected by Lorentz formulas?

If the answer is "yes", then our theory is logically consistent (the initial postulate has been confirmed by a direct dynamical calculation). However, can we be sure that the answer is "yes"? As far as I know, nobody has performed this kind of calculation in Maxwell-Lorentz electrodynamics (if you think I missed some relevant works, I would appreciate the reference). Moreover, I have a strong suspicion that a direct calculation of this sort will *not* yield the expected result.

Fredrik · Jul 19, 2009

dx said:

It is possible to have well defined postulates from which the linearity of Lorentz transformations follows.

Let V be a four dimensional vector space. An inertial frame is a map ψ from the set of events into V which satisfies the following postulates:

1. The world lines of free particles are straight lines.
2. Clock rates are uniform, i.e. intervals measured by clocks agree with the linear structure of V.

I agree with your opening statement, but I would drop your second postulate and add stuff to the first. I'd choose V=\mathbb R^4, and change #1 to

1. Each transition function* corresponding to two inertial frames is a smooth** bijection that takes straight lines to straight lines.

*) See my posts earlier in this thread for a definition.
**) All its partial derivatives up to arbitrary order exist.

Let T be a transition function. The axiom guarantees that it can be Taylor expanded.

T(x)=T(0)+x^\mu\partial_\mu T(0)+\frac 1 2 x^\mu x^\nu\partial_\mu\partial_\nu T(0)+\cdots

Let's call a transition function with T(0)=0 a "Lorenz transformation". (This will be our definition of a Lorentz transformation for the rest of this post). Note that a Lorentz transformation defined this way takes straight lines through the origin to straight lines through the origin.

Now let T be a Lorentz transformation, and let x and y be two points on a straight line through the origin. We must have y=kx. Postulate #1 and our definition of a Lorentz transformation imply that we also have T(y)=k'T(x), but T(y)=T(kx), so we have

T(kx)=k'T(x)

for all x. Let's Taylor expand both sides.

kx^\mu\partial_\mu T(0)+\frac 1 2 k^2 x^\mu x^\nu\partial_\mu\partial_\nu T(0)+\cdots=k'\Big(x^\mu\partial_\mu T(0)+\frac 1 2 x^\mu x^\nu\partial_\mu\partial_\nu T(0)+\cdots\Big)

These two expressions must mach term by term, and that's only possible if k'=k and all the higher order terms are =0. So any "Lorentz transformation" must be linear.

I don't have any objections to this sort of argument, but one could point out that the axiom is extremely strong. I mean, we're assuming that transition functions take straight lines to straight lines, so it's not exactly a surprise that they turn out to be linear. So one could argue that we might as well have started by requiring linearity. The counter argument to that is that this approach is more intuitive and "natural" than the abstract requirement of linearity. It only expresses the idea that any inertial observer should be able to describe any other inertial observer as moving with constant velocity.

facenian · Jul 19, 2009

Fredrik said:

I don't have any objections to this sort of argument, but one could point out that the axiom is extremely strong. I mean, we're assuming that transition functions take straight lines to straight lines, so it's not exactly a surprise that they turn out to be linear. So one could argue that we might as well have started by requiring linearity. The counter argument to that is that this approach is more intuitive and "natural" than the abstract requirement of linearity. It only expresses the idea that any inertial observer should be able to describe any other inertial observer as moving with constant velocity.

Fredrik, This line of thought is what I was referring to from the begining. I think what you call natural and intuitive may be translated as "physical",eg, physical reassons instend of purely mathematical assumption with no conection to physical reality.
However,I do have some concern with your derivation because the transformation I posted in #16 of this thread is not linear and seems to be a counter example for your demostration.
According to other posts of this thread the "homgeneity" hypothesis seems to be neccesary so there must be something wrong in your derivation although I don't know what.

dx · Jul 19, 2009

Fredrik said:

I agree with your opening statement, but I would drop your second postulate and add stuff to the first. I'd choose V=\mathbb R^4, and change #1 to

1. Each transition function* corresponding to two inertial frames is a smooth** bijection that takes straight lines to straight lines.

Projective transformations are smooth and take straight lines to straight lines, but they are not linear.

The linearity of Lorentz transformations has more to do with our idea of an inertial frame than it does with any specific property of the Lorentz transformation. Like I said before, once we characterize inertial frames by the two simple postulates from my previous post, it follows that any transformation between inertial frames must be linear. The Lorentz behavior of clocks is just one type that is compatible with these postulates, another being the Galilean/Newtonian.

DrGreg · Jul 19, 2009

facenian said:

However,I do have some concern with your derivation because the transformation I posted in #16 of this thread is not linear and seems to be a counter example for your demostration.
According to other posts of this thread the "homgeneity" hypothesis seems to be neccesary so there must be something wrong in your derivation although I don't know what.

The projective transformation you quoted in #16 does not satisfy T(0) = 0, as Fredrik's argument assumes.

facenian · Jul 19, 2009

DrGreg said:

The projective transformation you quoted in #16 does not satisfy T(0) = 0, as Fredrik's argument assumes.

It does when b_i=0 for i=0,1,2,3

Question on linearity of Lorentz transformations

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Undergrad Euclidean geometry and gravity

Undergrad Synchronizing clocks in an inertial frame if light is anisotropic

Undergrad Question about Parallel Transport

Undergrad EPR revisited

Graduate Assumptions of Hawking-Penrose 1970 Singularity Theorem

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight