Derivation of Lorentz transformations

JT7 · Mar 12, 2010

It seems that the common approach to obtain the equations for the Lorentz transformations is to guess at its form and then, by considering four separate situations, determining the values for the constants. From these equations, things like time dilation and length contraction can be worked out. Now, my goal was to go the other way around: starting from time dilation and length contraction, arrive at the Lorentz equations.

Suppose that in the reference frame O, the reference frame O' is moving at a speed v in the x-direction, with their origins coinciding at t = 0. An event E occurs at (x, y, z, t) in the O frame. It's straightforward to show that y' = y and z' = z. Next I considered x'. In the O frame, the distance between O' and E is x - vt. Because the ruler O' uses is shortened by a factor of γ, she will then measure the distance x - vt as being greater and O measures it, by a factor of γ. Thus, x' = γ(x - vt) (if this approach is incorrect, let me know!).

However, I'm having trouble with t'. I know that t' = γ(t - vx/c2). I assume that the -vx/c2 term comes from that, because O' believes that she's at rest, when the light emitted from the event reaches her, she doesn't treat herself as moving into the light, and thus there's a discrepancy as to how long before the light reaches O' did the event actually occur. Unfortunately, I can't arrive algebraically at this term. Finally, the gamma factor. I assume this comes from time dilation, but the wouldn't O' 's clock be running slower? So wouldn't the term have to be 1/γ (because less time transpires on her clock)? Can someone please tell me how to get the final Lorentz term this way? I know there are probably other, easier routes, but for personal reasons I would like to know how to do it this way. Thanks a lot!

JesseM · Mar 13, 2010

Remember that you have to include the possibility that clocks which are synchronized in one frame will be out-of-sync in another (relativity of simultaneity). In post #14 of this thread I derived the Lorentz transformation using an approach like the one you're suggesting, by putting in variables for length contraction, time dilation and the amount by which two clocks would be out-of-sync, and then solving for these variables:

https://www.physicsforums.com/showthread.php?t=180578

JT7 · Mar 13, 2010

Hey thanks a lot for the reply. I read through it, and I seem to understand it (later tonight I'll go through it more thoroughly). Some questions: first, if instead of considering there to be out-of-sync clocks in the O' frame, would taking into account the fact that, in the O frame, light hitting O' will take less time than O' thinks it should (because she's moving into the beam of light in the O' frame) be the same thing (as that, I believe, is the cause of loss of simultaneity)? Because if that's true, then (for personal tastes; I believe this to be more pleasing a route) how would you be able to account for this difference? It should turn out to be vx/c^2, but I can't seem to get the algebra to work. I know that the many clocks method is similar, but I would like to be able to do it this way too. Thanks!

JM · Mar 13, 2010

JT7, Look at Einsteins original derivation in his 1905 paper 'On the electrodynamics of moving bodies' ( in the Dover book The Principle of Relativity). There is no guesswork and the use of the Postulate of Constant Light Speed is shown.
JM

JesseM · Mar 13, 2010

JT7 said:

Hey thanks a lot for the reply. I read through it, and I seem to understand it (later tonight I'll go through it more thoroughly). Some questions: first, if instead of considering there to be out-of-sync clocks in the O' frame, would taking into account the fact that, in the O frame, light hitting O' will take less time than O' thinks it should (because she's moving into the beam of light in the O' frame) be the same thing (as that, I believe, is the cause of loss of simultaneity)?

When you say "take less time", time between what two events? Is it between the event of the light being emitted from one end of the ruler and the event of the light hitting the observer O' at the other end? If so how does the observer O' measure this time? Are you saying that instead of O' having synchronized clocks at either end of the ruler and measuring the time of each event locally, O' could instead just note the length L of the ruler in her frame, and then based on the assumption that she is at rest and that the light moves at c, conclude that the time between the events must have been L/c? That would be fine.

By the way, I realized in retrospect that what I proved there was not quite the same as what you were asking--I showed that starting from the two basic postulates of SR, one could derive the familiar formulas for length contraction and time dilation. But you're asking how, given length contraction and time dilation, we can the derive the Lorentz transformation equations. OK, suppose an event E happens at (x₁,t₁) in the O frame. Since the ruler used to mark position in the O' frame is moving at speed v, any point on the O' ruler will move a distance of vt₁ between t=0 and t=t₁. So, the marking M on the O' ruler that's at position x=x₁ at time t=t₁ in the O frame must have been at position x₁ - vt₁ at time t=0 in the O frame. And we know that at time t=0 the marking x'=0 on the O' ruler coincided with position x=0 in the O frame, so in the O frame the distance between M and the x'=0 mark on the O' ruler must be x₁ - vt₁. But since we know that in the O frame the O' ruler is shrunk by a factor of sqrt(1 - v²/c²), that means that in the O' frame the distance between M and x'=0 is larger than this by a factor of 1/sqrt(1 - v²/c²) = gamma, so the mark M must have a reading of gamma*(x₁ - vt₁) on the O' ruler.

Now let's say the observer O' sits at position x'=0 on the O' ruler, and at time t=0 in the O frame this was at x=0, after which she moved in the positive x direction with speed v. Suppose that when the event E happens at x=x₁ and t=t₁ in the O frame, it sends a beam of light towards O'. In the O frame, O' will be at position x=vt₁ at time t₁, so the initial distance between O' and the event E is x₁ - vt₁ if E happened further in the +x direction than O' was at that point, or vt₁ - x₁ if O' was further in the +x direction at that time. If E happened further in the +x direction, then O' is moving at v in the +x direction while the light is moving at c in the -x direction, so the distance between them is shrinking at a speed of (c + v), meaning the light will take a time of (x₁ - vt₁)/(c + v) to reach O' after being emitted at E. And since it was emitted at t₁, the time t that it reaches O' as seen in the O frame will be:

[t₁] + [(x₁ - vt₁)/(c + v)]
= [(ct₁ + vt₁)/(c + v)] + [(x₁ - vt₁)/(c + v)]
= (x₁ + ct₁)/(c + v)
= (c - v)*(x₁ + ct₁)/(c² - v²)
= (c - v)*(x₁ + ct₁)/((c²)*(1 - v²/c²))

On the other hand, if O' was further in the +x direction, then O' is moving at v in the +x direction while the light is moving at c in the +x direction too trying to catch up with O' from behind, so in this case the distance between them is only shrinking at a speed of (c - v), so the light will take a time of (vt₁ - x₁)/(c - v) to reach O' after being emitted at E. And again, it was emitted at t₁, so the time t it reaches O' as seen in the O frame will be:

[t₁] + [(vt₁ - x₁)/(c - v)]
= [(ct₁ - vt₁)/(c - v)] + [(vt₁ - x₁)/(c - v)]
= (ct₁ - x₁)/(c - v)
= (c + v)*(ct₁ - x₁)/(c² - v²)
= (c + v)*(ct₁ - x₁)/((c²)*(1 - v²/c²))

Now the clock of O' is slowed down by a factor of sqrt(1 - v²/c²) in the O frame, and her clock read t'=0 at time t=0 in the O frame, so at any later time t in the O frame her clock reads t*sqrt(c² - v²)/c. So in the first scenario where E was further in the +x direction, the light reached O' when her own clock read:

sqrt(1 - v²/c²)*(c - v)*(x₁ + ct₁)/((c²)*(1 - v²/c²))
= (c - v)*(x₁ + ct₁)/((c²)*sqrt(1 - v²/c²))

In the second scenario where O' was further in the +x direction, the light reached O' when her own clock read:

sqrt(1 - v²/c²)*(c + v)*(ct₁ - x₁)/((c²)*(1 - v²/c²))
= (c + v)*(ct₁ - x₁)/((c²)*sqrt(1 - v²/c²))

In both cases, if the event E happened at the x' = gamma*(x₁ - vt₁) mark on her ruler, she must subtract a time of gamma*(x₁ - vt₁)/c from the time she observed light from E to get the actual time of E in her frame (that's if the mark is at a positive value of her x' ruler, if it's at a negative value then she would subtract a time of -gamma*(x₁ - vt₁)/c. It will be at a positive value in the first scenario where the event E happened further in the +x direction, and at a negative value in the second scenario where O' was further in the +x direction when E occurred). And gamma*(x₁ - vt₁)/c = c*(x₁ - vt₁)/(c²*sqrt(1 - v²/c²)). So if we subtract this from the observed time in the first scenario to get the actual time of E in the O' frame, we get:

[(c - v)*(x₁ + ct₁)/((c²)*sqrt(1 - v²/c²))] - [c*(x₁ - vt₁)/(c²*sqrt(1 - v²/c²))] =
(cx₁ + c²*t₁ - vx₁ - cvt₁ - cx₁ + cvt₁)/(c²*sqrt(1 - v²/c²)) =
(c²*t₁ - vx₁)/(c²*sqrt(1 - v²/c²)) =
(t₁ - vx₁/c²)/sqrt(1 - v²/c²)

And if we subtract -c*(x₁ - vt₁)/(c²*sqrt(1 - v²/c²)) from the observed time in the second scenario to get the actual time of E in the O' frame, we get:

[(c + v)*(ct₁ - x₁)/((c²)*sqrt(1 - v²/c²))] - [-c*(x₁ - vt₁)/(c²*sqrt(1 - v²/c²))] =
(c²*t₁ - cx₁ + cvt₁ - vx₁ + cx₁ - cvt₁)/(c²*sqrt(1 - v²/c²)) =
(c²*t₁ - vx₁)/(c²*sqrt(1 - v²/c²)) =
(t₁ - vx₁/c²)/sqrt(1 - v²/c²)

So, in both scenarios we find that the time of E in the O' frame is gamma*(t₁ - vx₁/c²).

JesseM · Mar 14, 2010

Incidentally, there was a lot of algebra in the above derivation, but it does become easier if you think in terms of synchronized clocks rather than in terms of a light signal sent to O'. I showed in the derivation on the other thread that:

This means that if there is another clock at the center of the moving ruler, and it reads a time of 0 years at the moment it's next to the light bulb and the light bulb turns on, then in my frame the clock at the back end will read TLv/(c^2 - v^2) at the moment the bulb is turned on, while the clock at the front end will read -TLv/(c^2 - v^2) at the moment the bulb is turned on.

This was for a ruler of length 2L as seen in the observer's frame (not the ruler's own rest frame), so this implies that the difference in time between clocks at either end of a ruler of length L would be TLv/(c^2 - v^2), with the clock at the back being ahead by this much. Then I also derived the fact that the time dilation factor T would be equal to sqrt(1 - v^2/c^2). So, plugging that in, clocks at either end of a ruler of length L will be out-of-sync by sqrt(1 - v^2/c^2)*Lv/(c^2 - v^2) = Lv/(c^2*sqrt(1 - v^2/c^2)) = gamma*Lv/c^2. And the length contraction equation implies that a ruler of length L as seen by the observer who sees it moving at v will have a greater length L' = L/sqrt(1 - v^2/c^2) in its own rest frame, so that means if two clocks are synchronized at either end of a ruler of length L' in their rest frame, in the frame of an observer who sees them moving at speed v the clock at the back end will be ahead of the clock at the front end by vL'/c^2.

So suppose we have some event E that occurs at x₁ and t₁ in the frame of O, and that frame O has a ruler of length x₁ with one end at x=0 and the other at x=x₁, with two synchronized clocks at either end. So, the end of this ruler at x=x₁ is at the position of E when it happens, and the clock there reads t=t₁. Now consider how things must look in the the frame of O', who sees this ruler moving at speed v in the -x' direction. If x₁ is a positive number, then x₁ is the length of the ruler in its own frame, and the end of the ruler at x=x₁ must be the "back end" as seen in O', so if the clock at this end shows a time of t₁ the clock the the "front end", x=0, must be behind by vx₁/c^2 (using the equation at the end of the previous paragraph), showing a time of t₁ - vx₁/c^2. On the other hand, if x₁ is a negative number then the length of the ruler in its own frame is -x₁, and the end of the ruler at x=x₁ must be the "front end", so if the clock at this end shows a time of t₁ the clock at the "back end", x=0, must be ahead by v*(-x₁)/c^2, showing a time of t₁ + v*(-x₁)/c^2 = t₁ - vx₁/c^2. So either way, in the O' frame at the moment the event E is happening the clock at the spatial origin of O reads a time of t₁ - vx₁/c^2.

We also know that the clocks at the origins of each frame both read 0 at time t=0 and t'=0 in each frame, and that in frame O' the clock at the origin of O is running slow by a factor of 1/gamma, so the at the moment the clock at the origin of O reads t=t₁ - vx₁/c^2, the time in O' must be t'=gamma*(t₁ - vx₁/c^2). And we know that the event of the clock at the origin of O shows this reading at the same time as event E in the O' frame! So, the event E must also have a time coordinate of t'=gamma*(t₁ - vx₁/c^2) in the O' frame.

JT7 · Mar 14, 2010

Thanks for the detailed response, I understand now. Cheers!

Derivation of Lorentz transformations

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Undergrad Why is gravity a fictitious force?

Undergrad Relativistic Space Travel: Optimizing Proper Time [Project Hail Mary]

Undergrad KE of rotating disc

Undergrad Why is the Lorentz Force always perpendicular to velocity?

Graduate How valid is the Block Universe theory?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect