Basic Lorentz transformation derivation

Click For Summary

Discussion Overview

The discussion centers on the derivation of the Lorentz transformations, exploring the mathematical and conceptual foundations of special relativity. Participants examine various approaches to deriving the transformations, the assumptions involved, and the implications of time dilation and length contraction.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant expresses frustration with the complexity of existing proofs of the Lorentz transformations and attempts their own derivation based on geometry and standard SR conditions.
  • Another participant suggests that deriving the Lorentz transformations from length contraction and time dilation is suspect, as these concepts are typically derived from the transformations themselves.
  • Some participants argue that the correctness of the Lorentz transformations can be demonstrated by showing that they preserve the speed of light in both frames, while others question whether this constitutes circular reasoning.
  • Concerns are raised about the clarity of notation and the assumptions made in the derivation, particularly regarding the treatment of the Lorentz factor and the synchronization of clocks.
  • Several participants reference external sources, including Wikipedia and academic papers, to support their points or clarify their understanding of the transformations.
  • There is a discussion about the symmetry of length contraction and time dilation, with one participant emphasizing that these effects are reciprocal between reference frames.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the validity of the proposed derivation or the assumptions made. Multiple competing views regarding the derivation and interpretation of the Lorentz transformations remain present throughout the discussion.

Contextual Notes

Participants highlight limitations in the derivations, including unclear notation, the dependence on specific assumptions, and the potential for circular reasoning in proving the transformations. The discussion also reflects varying interpretations of the role of light signals in the context of the transformations.

jeremyfiennes
Messages
323
Reaction score
17
The Lorentz transformations are mathematically simple. I had always imagined they could be easily derived. I however just found out from another PF thread that this is not so. Their originators Lorentz and Poincaré simply stated them without derivation. And the "proofs" I have seen to date have been complex and unconvincing. Since only simple geometry and the standard SR conditions (length contraction, time dilation and the constancy of the speed of light) are apparently involved, I attempted a derivation.
Consider single time and space dimensions. Let observer A see an event at point X=(xa,tx), Fig. 0‑1a. The question is: how does an observer B, moving at steady speed v relative to A, perceive the same event, Fig. 0‑1b?
photo.php

Each observer has a clock. There is also one at point X. Define the time origin t=0 as the instant when frames A and B coincide. Synchronize all three clocks via a signal from the mid-point (xa/2).
In frame A, an event photon takes time xa/c to reach the observer A, who then sees it occurring at:
(xa, tx+xa/c) (eq.1)
For observer B, frame A moves at steady speed v, meaning that its lengths are foreshortened by the Lorentz factor γ. Observer B's space point xb is then:
xb=(xavtx)/γ (eq.2)
where:
γ=1/√(1–β2); β=v/c (eq.3)
Clock X also moves for observer B, and for him runs slow by the factor γ. Including again the event photon travel time, B's time point tb:
tb = tx/γ + xb/c (eq.4)
Substituting and rearranging, and using 'prime' notation, B's points in terms of A's
x' = [(1+β)xvt]/g; t' = [(1–β)t + vx/c2]/γ (eq.5)
These expressions however differ radically from the standard:
x' = γ[xvt]; t' = γ[(tvx/c2] (eq.6)
So something has gone wrong somewhere. But what and where? I checked the derivations with a numerical example, but could find no inconsistencies.
 
Physics news on Phys.org
jeremyfiennes said:
And the "proofs" I have seen to date have been complex and unconvincing.
You don't need to derive it to prove it; whether you've derived it or you found it on a piece of paper in the gutter, you can check it for correctness. A convincing (note: "convincing" and "mathematically rigorous" are not always quite the same thing) demonstration of the correctness of the Lorentz transformation is fairly straightforward: show that the lines ##x\pm{ct}=0## transform to ##x'\pm{ct'}=0## and you have your proof that the Lorentz transforms preserve the speed of light in both frames.
 
Last edited:
jeremyfiennes said:
Since only simple geometry and the standard SR conditions (length contraction, time dilation and the constancy of the speed of light) are apparently involved, I attempted a derivation.
Going about it the derivation this way is vaguely suspect because both length contraction and time dilation are usually derived from the Lorentz transforms and not the other way around; it has to be possible to derive the transforms just from the postulate of the invariant speed of light.

However, you can use that postulate to come up with very convincing arguments for time dilation and length contraction without going through the Lorentz transforms so you can legitimately assume time dilation and length contraction in your derivation if you prefer. Apparently you do, because that's the approach that you've taken... But given that, what's wrong with the derivation at https://en.wikipedia.org/wiki/Deriv...rmations#Time_dilation_and_length_contraction?
 
Last edited:
I can't see your diagram - it appears to require a Facebook account to view. Use the upload button below the post editor to embed images.

Hopefully the diagram completes the explanation, because with just your maths I am completely unable to understand what you think you are doing. Changing notation half way through without saying what is what (is ##t'## supposed to replace ##t_a##, ##t_b## or ##t_x##?) doesn't help at all.
 
jeremyfiennes said:
For observer B, frame A moves at steady speed v, meaning that its lengths are foreshortened by the Lorentz factor γ.
Seems like you should derive the Lorentz factor, not just assume it. I find this approach, sort of like just listing the thing you want to prove as one of the assumptions.

I like this one best:
https://arxiv.org/abs/physics/0302045

And I find the geometrical concept of Bondi’s k-calculus approach appealing.
 
  • Like
Likes   Reactions: m4r35n357
Nugatory said:
show that the lines ##x\pm{ct}=0## transform to ##x'\pm{ct'}=0## and you have your proof that the Lorentz transforms preserve the speed of light in both frames.
Isn't that circle reasoning? You already assume c'=c in your equation. I'd say c=c' is a postulate, and only after assuming that postulate you show that your line is indeed preserved by Lorentz transformations.
 
Nugatory said:
Thanks. The "From physical principles. Time dilation and length contraction" is evidently the bit I want.
Observer F sees the point X as (x,t), Fig,(a). F' sees it as (x',t'), Fig,(b). But due to foreshortening, a length x' in frame F' corresponds to γx' in frame F, Fig.(c). Giving x=vt+γx', and not the Wiki value x=vt+x'/γ: And I don't see the time a light photon takes to get from the event to an observer considered.

lorentz-wiki.jpg
 

Attachments

  • lorentz-wiki.jpg
    lorentz-wiki.jpg
    13.5 KB · Views: 454
  • lorentz-wiki.jpg
    lorentz-wiki.jpg
    13.6 KB · Views: 926
  • lorentz-wiki.jpg
    lorentz-wiki.jpg
    13.6 KB · Views: 456
jeremyfiennes said:
Thanks. The "From physical principles. Time dilation and length contraction" is evidently the bit I want.
Observer F sees the point X as (x,t), Fig,(a). F' sees it as (x',t'), Fig,(b). But due to foreshortening, a length x' in frame F' corresponds to γx' in frame F, Fig.(c). Giving x=vt+γx', and not the Wiki value x=vt+x'/γ: And I don't see the time a light photon takes to get from the event to an observer considered.

View attachment 234849

Wiki is correct on that first point.

The time it takes a light signal (or a sound signal, or a telegram) to get to the origin of the coordinate system is irrelevant. The event could take place in a closed room from which no light escapes, but it would still have coodinates ##(t', x')##.

The "eye" on the y-axis is misleading. If I did those digrams I would have the eye at the point of the event. That's where the local observation is made or recorded.

PS Technically, the Lorentz Transformation is not about observers but about reference frames.

If you want to take light signals into account you'd have to do that for classical, Gallilean relativity as well. (And don't say that Newton thought the speed of light was infinite! He had a good estimate of the speed of light.)
 
  • #10
jeremyfiennes said:
Observer F sees the point X as (x,t), Fig,(a). F' sees it as (x',t'), Fig,(b). But due to foreshortening, a length x' in frame F' corresponds to γx' in frame F, Fig.(c).
Except that length contraction, like time dilation, is symmetrical: A distance the measures ##\Delta{L}## in frame F will measure ##\Delta{L}/\gamma## in in frame F' but also a distance the measures ##\Delta{L}## in frame F' will measure ##\Delta{L}/\gamma## in in frame F - not ##\gamma\Delta{L}##.

Relativity of simultaneity is at work here, in a rather unobvious way. Are the vertical lines representing the T axes the same lines in diagrams (a) and (b)? In frame F the distance between O and O' is ##vt##, and in frame F' the distance between O and O' is ##vt'##, but ##t## and ##t'## are not equal.

And I don't see the time a light photon takes to get from the event to an observer considered.
That's not needed in any derivation of the Lorentz transformations. Instead of having one observer receiving light signals from remote events and subtracting out the light travel time to assign a time coordinate to the remote event, imagine that you have multiple observers with synchronized clocks, all at rest relative relative to one another and strategically positioned so that one of them is present at each interesting event. They make timestamped notes of whatever happens where they are (no light travel time correction needed), and after the fact we gather up all their notes and work out what happened.
 
Last edited:
  • #11
Nugatory said:
You don't need to derive it to prove it."
Physics is based on 1) experimental results, 2) rational deduction. Length contraction and time dilation can be considered experimental results (muons, etc.). So it should be possible to derive the Lorentz transformations from these and logical deduction.
 
  • #12
jeremyfiennes said:
Physics is based on 1) experimental results, 2) rational deduction. Length contraction and time dilation can be considered experimental results (muons, etc.). So it should be possible to derive the Lorentz transformations from these and logical deduction.

Here's an idea. If an event happens at location ##x'## in the primed frame, that means (by definition) that the event happened at a length of ##x'## from the origin. You could imagine a measuring stick of proper length ##x'## protruding from the origin. The events happens at the end of that stick. That's what gives it the ##x'## coordinate, by definition.

What does that look like in the unprimed frame? First, the origin is at ##x = vt##. Second, the measuring sticking is contracted to ##x'/ \gamma##, so we have:

##x = vt + x'/ \gamma##

And, if you rearrange that you get the LT for the x'-coordinate.

For time, you could imagine a clock at the end of the measuring stick. The time on that clock records the time of the event, ##t'##.

Again, switch to what this look like in the unprimed frame. In particular, clocks in the primed frame:

A clock at origin of the primed frame differs from the clock at the event by ##vx'/c^2##. The origin clock reads ##t' + vx'/c^2## at the time of the event.

But, this clock was synchronised with the clocks in the unprimed frame at ##t = 0## and has been dilated ever since. So:

##t = \gamma (t' + vx'/c^2)##

And, in fact, that gives you directly the inverse transformation.

The rest can be done either by a symmetry argument, or by manipulating these two equations.
 
  • #13
jeremyfiennes said:
So it should be possible to derive the Lorentz transformations from these and logical deduction.
Of course it is (although there is an intermediate inductive step: getting from the experimental results to the postulate that light speed is constant so that you have something to deduce from). The point that I'm making is that derivation and validation are not the same thing. "Is this the right answer?" and "Can I improve on the way I came up with this answer?" are different questions to be answered separately.
 
  • #14
Nugatory said:
Of course it is (although there is an intermediate inductive step: getting from the experimental results to the postulate that light speed is constant so that you have something to deduce from). The point that I'm making is that derivation and validation are not the same thing. "Is this the right answer?" and "Can I improve on the way I came up with this answer?" are different questions to be answered separately.

Yes, and personally I wouldn't be satisfied simply to derive the LT from arguments of isotropy and homogeneity etc and not double check that time dilation, length contraction, RoS produce the same thing. The LT is way too important not to double check it!
 
  • Like
Likes   Reactions: jeremyfiennes
  • #15
jeremyfiennes said:
The Lorentz transformations are mathematically simple. I had always imagined they could be easily derived. I however just found out from another PF thread that this is not so. Their originators Lorentz and Poincaré simply stated them without derivation. And the "proofs" I have seen to date have been complex and unconvincing. Since only simple geometry and the standard SR conditions (length contraction, time dilation and the constancy of the speed of light) are apparently involved, I attempted a derivation.

I'm not sure I understood your derivation, but the Lorentz transformation can be thought of (although I'm not sure if this is the best way to think about it) as the combination of three effects:
  1. Length contraction
  2. Time dilation
  3. Relativity of simultaneity: clocks that are synchronized in one frame are not synchronized in another frame.
#3 is necessary if you want to have light have speed c in every inertial coordinate system. But starting with 1 & 2 is kind of unsatisfying, since you have to assume the mathematical form of length contraction and time dilation, when in other derivations, that is derivable.

Anyway, I can sketch how you can get the Lorentz transformations from these three.

Let's make a coordinate system discrete, to make it more concrete. In the moving frame, let's assume that we have a line of clocks at rest along the x-axis, with a distance ##L## between them. The location of clock number ##j## is just:

##x'_j = j L##

In the stationary frame, there are two differences: the length between clocks is shrunk to ##\frac{L}{\gamma}##, and the clocks are moving at speed ##v##. So in this frame, the location of clock number ##j## is:

##x_j = \frac{jL}{\gamma} + v t##

What about the time on clock ##j##? Let ##\Delta T_j## be the offset of clock number ##j## at time ##t=0## (according to the stationary frame). By time dilation, each clock is running slower than a stationary clock by a factor of ##\gamma##. So the time on clock ##j## is given by:

##t'_j = \frac{t}{\gamma} + \Delta T_j##

We're almost done, except for figuring out the offset, ##\Delta T_j##. Let's assume that clock number 0 is started at ##t=0##, so the offset is 0. For ##j > 0##, let's assume that at ##t=0##, a light signal is sent from clock 0 to clock ##j##. We can compute the time that it arrives in the two different frames.

In the moving frame, the moving observers measure the distance between clock number 0 and clock number ##j## is ##j L##. So if clocks are synchronized so that light has speed ##c##, then the signal arrives when:

##t'_j = \frac{j L}{c}##

In the stationary frame, the distance between the clocks is contracted, to just ##\frac{j L}{\gamma}##. Also, in this frame, clock number ##j## is moving away from the light signal, so it takes longer for the light signal to reach clock number ##j##. That time is given by:

##t = \frac{jL}{\gamma (c-v)}##

Putting the facts about ##t## and ##t'_j## together:
  • ##t = \frac{jL}{\gamma (c-v)}##
  • ##t'_j = \frac{jL}{c}##
  • ##t'_j = \frac{t}{\gamma} + \Delta T_j##
So we conclude:

##\frac{jL}{c} = \frac{1}{\gamma} \cdot \frac{jL}{\gamma (c-v)} + \Delta T_j##

##\Delta T_j = -jL (\frac{1}{\gamma^2 (c-v)} - \frac{1}{c}) = -jL (\frac{1-\frac{v^2}{c^2}}{c-v} - \frac{1}{c}) = -\frac{jL v}{c^2}##
(using ##\frac{1}{\gamma^2} = 1-\frac{v^2}{c^2}##)

So we have our transform:

##x'_j = j L##
##t'_j = \frac{t}{\gamma} - \frac{jL v}{c^2}##

To get rid of ##L##, remember that ##x_j = \frac{j L}{\gamma} + vt##, so ##jL = \gamma(x_j - v t)##. So in terms of ##x_j##:

##x'_j = \gamma (x_j - vt)##
##t'_j = \frac{t}{\gamma} - \gamma (\frac{x_j v}{c^2} - \frac{v^2}{c^2} t)##
##= \gamma (t (\frac{v^2}{c^2} + \frac{1}{\gamma^2}) - \frac{v x_j}{c^2}) ##
##= \gamma (t - \frac{v x_j}{c^2})##
(again using ##\frac{1}{\gamma^2} = 1-\frac{v^2}{c^2}##)
 
  • #16
Ibix said:
I can't see your diagram. With just your maths I am completely unable to understand what you think you are doing. Changing notation half way through without saying what it is supposed to replace doesn't help at all.
Sorry, I was trying to use the "image" icon. Ref the notation, I chose ta and tb to differentiate from tx, in fact not necessary. Correcting for these, my original post becomes:
"Consider a universe comprising single time and space dimensions, and two observers A and B moving at steady relative speed v. Each observer has a clock, and there is also one at X. Define the time origin t=0 as the moment when frames A and B coincide. Synchronize all three clocks via a signal from the mid-point (x/2) at this instant.
lorentz.jpg

Consider an event (x,tx) occurring at point X in frame A, Fig. 0‑1a. tx is the time on the clock at X. The question is: what is observer B's perception of this event in terms of A's, Fig. 0‑1b?
In frame A, an event photon takes time x/c to reach observer A, who sees it occurring at:
(x,t) where t = tx+x/c) ) (eq.1)
For observer B, frame A moves at speed v, meaning that its lengths are foreshortened by the Lorentz factor γ. Observer B then perceives the event as ocurring at distance x':
x' = (x vtx)/γ (eq.2)
where:
γ=1/√(1–β2); β=v/c (eq.3)
Relative to observer A's clock, B's runs slow by the factor γ. Including the photon travel time, B perceives the event occurring at time t':
t' = tx/γ + x'/c (eq.4)
Substituting and rearranging, B's points in terms of A's
x' = [(1+β)xvt]/γ; t' = [(1–β)t + vx/c2]/γ (eq.5)
Because these expressions differ radically from the standard:
x' = γ[xvt]; t' = γ[(tvx/c2] (eq.6)
something has gone wrong somewhere. But what and where?
 

Attachments

  • lorentz.jpg
    lorentz.jpg
    8.4 KB · Views: 553
  • #17
Dale said:
Seems like you should derive the Lorentz factor, not just assume it.
The Lorentz factor follows from Einstein's station and traveling observers thought experiement.
 
  • #18
jeremyfiennes said:
something has gone wrong somewhere. But what and where?
At the point where your derivation diverges from the one in the wikipedia section linked to above.
 
  • #19
stevendaryl said:
I'm not sure I understood your derivation,
It is based on 1) length contraction, time dilation and c constant as experimental facts; 2) geometrical reasoning. But this doesn't give the standard result, and I want to know why. I reposted my original derivation in standard notation.
 
  • #20
jeremyfiennes said:
In frame A, an event photon takes time x/c to reach observer A, who sees it occurring at:

No, no, no, no, no. The event takes place in a darkened room. There are no photons.

What you should do is go back to classical physics and introduce the finite speed of light and see all your equations fall apart. An event takes place at time ##t## regardless of who observes it, how and when.
 
  • #21
PeroK said:
No, no, no, no, no. The event takes place in a darkened room. There are no photons.

What you should do is go back to classical physics and introduce the finite speed of light and see all your equations fall apart. An event takes place at time ##t## regardless of who observes it, how and when.

For example, consider an object moving along the x-axis at speed ##v##. Each local observer sees it pass at speed ##v##. But, at time ##t## it is a distance ##vt## from the origin, so an observer at the origin sees that position event at ##t + vt/c##.

At time ##t + 1##, the object is at position ##vt + v##, and the observer at the origin sees that event at ##t + 1 + v(t+1)/c##.

So, for an observer at the origin we have ##\Delta x = v## and ##\Delta t = 1 + v/c##

So, the speed of that object to an observer at the origin is ##v/(1 + v/c) \ne v##.

In other words, if you factor in the time for different observers to receive light signals from events, then all of the equations of classical mechanics fall apart. You no longer have a well-defined speed of an object in an inertial reference frame.
 
  • #22
Nugatory said:
At the point where your derivation diverges from the one in the wikipedia section linked to above.
It diverges where Wiki gets x=vt+x'/γ and I get x=vt+γx'. Lengths are contracted in the moving F frame that F' sees (Fig.(b)). The F-frame correlate of x' (Fig.(c)) must therefore be greater than the F' value. Meaning that the Lorentz factor γ (always >1) must be in the numerator (where I put it) and not in the denominator (where Wiki puts it).
 
  • #23
jeremyfiennes said:
It diverges where Wiki gets x=vt+x'/γ and I get x=vt+γx'. Lengths are contracted in the moving F frame that F' sees (Fig.(b)). The F-frame correlate of x' (Fig.(c)) must therefore be greater than the F' value. Meaning that the Lorentz factor γ (always >1) must be in the numerator (where I put it) and not in the denominator (where Wiki puts it).

It depends on which frame you are in. ##x'## is a length in primed frame: the distance from the origin of the primed frame to the event. This distance is contracted when measured in the unprimed frame.

Again, imagine a stick of length ##x'## from the origin to the event. You want to dilate that stick in the unprimed frame to a length of ##\gamma x'##. That would be length dilation.
 
  • #24
jeremyfiennes said:
It diverges where Wiki gets x=vt+x'/γ and I get x=vt+γx'. Lengths are contracted in the moving F frame that F' sees (Fig.(b)). The F-frame correlate of x' (Fig.(c)) must therefore be greater than the F' value. Meaning that the Lorentz factor γ (always >1) must be in the numerator (where I put it) and not in the denominator (where Wiki puts it).
And you have it wrong. As we pointed out above, length contraction is symmetrical and you have fallen into a somewhat unobvious relativity of simultaneity trap.

You could take advantage of the fact that you know that the Lorentz transforms are correct even though you cannot (yet) derive them, and use them to transform all of the relevant events, and you'll likely see the error in your analysis.
 
Last edited:
  • #25
PeroK said:
The time it takes a light signal (or a sound signal, or a telegram) to get to the origin of the coordinate system is irrelevant.
We are talking about observers perceiving events in their reference frames. Distant events are perceived later than corresponding close events.
 
  • #26
jeremyfiennes said:
We are talking about observers perceiving events in their reference frames. Distant events are perceived later than corresponding close events.

No, no, no, no, no, no, no!

We are talking about events as measured in a reference frame.

The speed of signals has nothing to do with this. When the Titanic sank, no one in London knew until the next day. But, that doesn't mean that the ship sank a day later in a London reference frame.

And, certainly,no one in London actually saw it sink.
 
  • #27
PeroK said:
It depends on which frame you are in. x′x′x' is a length in primed frame: the distance from the origin of the primed frame to the event. This distance is contracted when measured in the unprimed frame.
This is the crux of it. Unprimed distances move, and are contracted, in the primed frame (Fig.(b)). And not in the unprimed frame as you maintain.
 
  • #28
PeroK said:
No, no, no, no, no, no, no!
Explain. Does light not have a finite velocity?
 
  • #29
jeremyfiennes said:
Explain. Does light not have a finite velocity?

It also has had. Einstein was not the first to discover that. Why light? Bats navigate by echo location. So, does SR for bats need to take the speed of sound into account?
 
  • #30
PeroK said:
The Lorentz Transformation is not about observers but about reference frames.
Is not a reference frame the viewpoint of an observer?
 

Similar threads

  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 101 ·
4
Replies
101
Views
7K
  • · Replies 24 ·
Replies
24
Views
3K
Replies
3
Views
1K
  • · Replies 54 ·
2
Replies
54
Views
4K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 22 ·
Replies
22
Views
3K
Replies
8
Views
1K