Deriving Lorentz transformations

Click For Summary
SUMMARY

The discussion centers on the symmetry of relative speeds in the context of Lorentz transformations in special relativity. Participants argue that if one observer measures an object moving at velocity v, the object must measure the observer's velocity as equal due to the symmetry principle. The conversation references specific assumptions necessary for deriving Lorentz transformations, including spatial isotropy and the behavior of velocity boost transformations. The conclusion emphasizes that the inverse transformation's parameter must be the negative of the original, reinforcing the symmetry of relative motion.

PREREQUISITES
  • Understanding of Lorentz transformations in special relativity
  • Familiarity with the symmetry principle in physics
  • Knowledge of spatial isotropy and its implications
  • Basic concepts of velocity boost transformations
NEXT STEPS
  • Study the derivation of Lorentz transformations in detail
  • Explore the implications of spatial isotropy in physical theories
  • Learn about the symmetry principle and its applications in relativity
  • Investigate the mathematical framework of Lie groups and semigroups in physics
USEFUL FOR

Physicists, students of relativity, and anyone interested in the mathematical foundations of motion and symmetry in the universe.

  • #61
facenian said:
First if \mathbf{a}\neq 0 the transformation is still linear, but that never mind, I think you just missed that.
Consider the T defined by ##T(x)=x+(1,0,0,0)##. It's not linear, since
\begin{align*}
&T(2(1,1,1,1))=T(2,2,2,2)=(2,2,2,2)+(1,0,0,0)=(3,2,2,2),\\
&2T(1,1,1,1)=2((1,1,1,1)+(1,0,0,0))=2(2,1,1,1) =(4,2,2,2).
\end{align*}
facenian said:
On the other hand the problem here is not the existence of a linear transformation, the problem is to conclude that the transformation must be linear.
Right, and you can do that if you add the assumption that T(0)=0. Without that assumption, the correct conclusion is that T-T(0) is linear.
 
Physics news on Phys.org
  • #62
Fredrik said:
Consider the T defined by T(x)=x+(1,0,0,0)T(x)=x+(1,0,0,0). It's not linear, since
Yes,I'm sorry you're right, I was thinking of another kind of linearity. The linearity I was thinking about allows for "linear and not homogeneous"
I don't know if your approach is relevant to our discussion, may be is too advanced for me.
 
  • #63
facenian said:
I don't know if your approach is relevant to our discussion, may be is too advanced for me.
The theorem is relevant to any approach to SR that says "takes straight lines to straight lines" instead of "is linear". (Edit: The last sentence in this post explains why).

Unfortunately the proof is very long. I will only mention a few things from the notes I made a few years ago.

There's a version of this theorem that deals with affine spaces rather than vector spaces. It's called "the fundamental theorem of affine geometry". I studied a proof of that theorem in a book on affine spaces, and sort of "translated" it into a proof about vector spaces.

Let T be a permutation of ##\mathbb R^4## that takes straight lines to straight lines. This assumption is not sufficient to ensure that T is linear, but it is sufficient to ensure that T is affine, i.e. that there's a linear bijection ##\Lambda:\mathbb R^4\to\mathbb R^4## and a vector ##a## such that ##T(x)=\Lambda x+a## for all ##x\in\mathbb R^4##. The key steps of the proof are as follows:

1. Define ##\Lambda=T-T(0)## and prove that ##\Lambda## is a bijection that takes straight lines to straight lines.
2. Prove that for all x,y such that {x,y} is linearly independent, we have ##\Lambda(x+y)=\Lambda(x)+\Lambda(y)##.
3. Prove that for all x and all real numbers k, we have ##\Lambda(kx)=k\Lambda(x)##.
4. Prove that for all x,y such that {x,y} is linearly dependent, we have ##\Lambda(x+y)=\Lambda(x)+\Lambda(y)##.

Step 3 breaks up into a trivial case and a difficult case. If x=0, the proof is trivial. If x≠0, the strategy is to prove that there's a function ##f:\mathbb R\to\mathbb R## such that:
(a) ##\Lambda(kx)=f(k)\Lambda(x)##.
(b) f is bijective.
(c) f is a field homomorphism.

Statements (b) and (c) say that f is a field automorphism. This result is useful because it's possible to prove that the only field automorphism on ℝ is the identity map.

It's a trivial corollary of this very non-trivial theorem that a permutation that takes straight lines to straight lines and 0 to 0 is linear.
 
Last edited:
  • Like
Likes   Reactions: facenian
  • #64
facenian said:
The principle of Relativity only implies the conformal group [...]
That's incorrect. The principle of relativity implies the group of fractional-linear transformations.

If one also invokes the light principle, and applies it by finding the largest group that preserves the (vacuum) Maxwell eqns, one finds the conformal group.

Taken together, the common subgroup consists of linear transformations.

Ref: Fock & Kemmer, "Space, Time & Gravitation", 2nd ed. 1964.
 
  • Like
Likes   Reactions: facenian
  • #65
The 2D Lorentz transformation can be derived from the following mathematical assumptions, which all have physical motivations.

It can be proved that there is a unique one parameter family of transformations ##L_v: \Bbb R^2\to \Bbb R^2##, defined for all ##v\in (-1,1)##, satisfying:

1. For each fixed ##(x,t)\in \Bbb R^2##, the mapping ##H:(-1,1)\to \Bbb R^2## given by ##H(v)=L_v(x,t)## is continuous.
2. Each ##L_v## (##v\in (-1,1)##) is a bijection, and its inverse is ##L_w## for some ##w\in (-1,1)##.
3. For each ##v\in(-1,1)##: Each line ##(t,x)=(t_0,x_0)+s(1,a)## (##s\in \Bbb R##), with ##a\in (-1,1)## and ##t_0,x_0\in\Bbb R##, is mapped by ##L_v## to a line ##(t',x')=(t'_0,x'_0)+r(1,b)## (##r\in \Bbb R##), for some ##b\in (-1,1)## and ##t'_0,x'_0\in\Bbb R##.
4. ##L_v(0,0)=(0,0)##, for all ##v\in (-1,1)##.
5. ##L_0## is the identity transformation on ##\Bbb R^2##.
6. For each ##v\in (-1,1)##: ##L_v(1,v)=(t',0)##, for some ##t'\in \Bbb R##.
7. For each ##v\in (-1,1)##: ##L_v(1,1)=(r,\pm r)## and ##L_v(1,-1)=(s,\pm s)## for some ##r,s\in \Bbb R##.
8. For each ##v\in (-1,1)##: If ##L_v(t,x)=(t',x')##, for some ##t,x,t',x'\in \Bbb R##, then either ##L_{-v}(t,-x)=(t',x') ## or ##L_{-v}(t,-x)=(t',-x')##.

Each ##L_v## (##v\in (-1,1) ##) is then given by ##L_v(t,x)=(1/\sqrt{1-v^2})(t-vx, -vt+x)## for all ##(t,x)\in\Bbb R^2##.

One needs not assume that ##L_v## is linear, for this follows, which was proved by micromass, strangerep and Fredrik in an in an old thread
https://www.physicsforums.com/threa...formations-are-the-only-ones-possible.651640/
for the general case when all lines are mapped onto lines, and it can be proved that it suffices to look at "timelike" lines.

Physical motivations:

1. It is possible to accelerate an object continuously to any speed less than light speed, through intertial frames.
2. The two frames are interchangeable. A consequence of the special principle of relativity.
3. An an object which is not being acted upon by a force, and hence moves with uniform rectilinear (timelike) motion, w.r.t one frame, is moving as freely w.r.t. the other frame. A consequence of the special principle of relativity.
4. Just an arbitrary practical convention about how we put marks on our rods and synchronize our clocks.
5. If ##v=0##, the frames coincide.
6. The relative velocity of Frame 2 w.r.t Frame 1 is ##v##.
7. A consequence of the invariance of the light speed.
8. This is about spatial isotropy. The transformation should still be valid if we change the directions of the spatial axes. See an earlier post by strangerep in this thread.

But it becomes more complex in 4D spacetime...
 
Last edited:
  • #66
strangerep said:
That's incorrect. The principle of relativity implies the group of fractional-linear transformations.
What assumptions are made about the domains of these transformations in this approach? (Apologies if you have already told me. In my defense, the thread the Erland linked to above is 3 years old). If we assume (as I do) that these transformations are permutations of ##\mathbb R^4## (and that they take 0 to 0), we get the stronger result that they are linear.
 
  • #67
Fredrik said:
If we assume (as I do) that these transformations are permutations of ##\mathbb R^4## (and that they take 0 to 0), we get the stronger result that they are linear.
Yes.

Afaict, it's possible to make sense of the FL transformations if one restricts the domain to the interiors of the null bicone of each observer. But that's beyond the scope of this thread.
 
  • #68
In §3 of Einstein's first paper on special relativity,
,ON THE ELECTRODYNAMICS OF MOVING BODIES
By A. Einstein, June 30, 1905
https://www.fourmilab.ch/etexts/einstein/specrel/www/,
he deals with -v. I should appreciate it if someone could explain how Einstein deduced it.
He describes "Since the relations between x', y', z' and x, y, z do not contain the time t, the systems K and
img42.gif
are at rest with respect to one another, and it is clear that the transformation from K to
img42.gif
must be the identical transformation."
 
Last edited:
  • #69
strangerep said:
That's incorrect. The principle of relativity implies the group of fractional-linear transformations.
Yes, I was assuming that the PR includes the LP but that is not a good convention
Thanks for the reference.
 
  • #70
Fredrik said:
The theorem is relevant to any approach to SR that says "takes straight lines to straight lines" instead of "is linear". (Edit: The last sentence in this post explains why).
Very interesting and yes it is relevant to this discussion
 
  • #71
  1. Erland said:
    The 2D Lorentz transformation can be derived from the following mathematical assumptions, which all have physical motivations.
I think it's a good warm up to attack the real important case in 4D.
In that respect I think that strangerep and Fredrik gave the answers
 
  • #72
facenian said:
I think it's a good warm up to attack the real important case in 4D.
The main diificulty in going to 4D is my point 8, about isotropy. How to formultate this mathematically in a sufficiently simple manner?
 
  • #73
facenian said:
Very interesting post. However I would modify the demonstration because it has one flaw. The problem I see is the assertion that since straight lines must transform into straight lines, the transformation must be linear. I think this is a common mistake(for an example see, for instance, "The Special Theory of Relativity" by Aharoni)
The term “linear” in that thread is not the same as linearity in the algebraic sense: f(ax + by) = af(x) + bf(y). Linearity there means polynomial of degree one in the coordinates, i.e., solutions to the system of 2-order PDE’s
\frac{\partial^{2}F^{\sigma}}{\partial x^{\mu}\partial x^{\nu}} = 0 .
This should have been clear to you because the inhomogeneous relation F(x)=Ax + b is not linear in the algebraic sense, but it is a linear relation in the sense used in analytic geometry. Having said this, one can still speak of linear Poincare’ transformations: with every element (\Lambda , a) of the Poincare’ group, we can associate a 5 \times 5 matrix \Gamma defined by
\Gamma = \begin{pmatrix} \Lambda^{\mu}{}_{\nu} &amp; a^{\mu} \\<br /> 0_{4\times 4} &amp; 1<br /> \end{pmatrix} . \ \ \ \ \ \ \ \ (1)<br />
Then the multiplication law in the Poincare’ group (\Lambda_{2},a_{2})\cdot (\Lambda_{1},a_{1}) = (\Lambda_{2}\Lambda_{1} , a_{2} + \Lambda_{2}a_{1}) shows that the correspondence (\Lambda , a) \to \Gamma is an isomorphism of the Poincare’ group on the subgroup of GL(5,\mathbb{R}) consisting of matrices of the form (1), where \Lambda satisfies \Lambda^{T}\eta \Lambda = \eta and a is an arbitrary 4-vector. The Poincare group can, therefore, be identified with this matrix Lie group. And Minkowski spacetime M^{4} can be identified with the hyperplane x^{4}=1 in \mathbb{R}^{5} with coordinates (x^{0}, x^{1}, … , x^{4}). Then the linear operator (1) acts on this hyperplane as the corresponding Poincare transformation (\Lambda , a).

The principle of Relativity only implies the conformal group and this means
This is incorrect. The conformal group does not even act on Minkowski space M^{4}. It acts on the (conformally) compactified version of Minkowski space \bar{M}^{4}\cong S^{3}\times S^{1} / Z_{2}.

An argument like the one given in Landau and Lifshitz Volume 2 now proves \alpha=1

Did you read the whole post? I used argument similar to Landau’s argument to show \alpha = 1.

Finally it can be shown(see, for instance, "Gravitation and Cosmology" by S. Weingberg) that the only transformations that leave ds^2 invariant are linear tranformations.
The theorem you are talking about is the following
“The coordinate transformation from one Minkowski chart to another is a Poincare’ transformation” , or equivalently stated “The Poincare’ group is the maximal symmetry group of Minkowski spacetime”.
Again, you should read my posts in that thread, because I proved the infinitesimal version of this theorem in there.
 
  • #74
Erland said:
The main diificulty in going to 4D is my point 8, about isotropy. How to formultate this mathematically in a sufficiently simple manner?
Well, here is a sketch to get you started...

Pick an arbitrary (fixed) direction in 3-space, denoted by the unit vector ##\widehat {\bf v}##. We consider a transformation to a frame with 3-velocity ##{\bf v} \equiv v \widehat {\bf v}##, where the nonbold ##v## is a real number.

Write the linear transformations in the form: $$t' ~=~ a(v)t + {\bf b}({\bf v})\cdot {\bf x} ~,~~~~ {\bf x'} ~=~ {\bf d}({\bf v})t + {\bf E}({\bf v}) {\bf x} ~,~~~~~~~ (1)$$ where ##{\bf b}, {\bf d}## are 3-vector-valued functions and ##{\bf E}## is a ##3\times 3## matrix-valued function.

Since ##{\bf b}({\bf v})## is 3-vector-valued and depends only on ##{\bf v}##, it is necessarily of the form ##b(v){\bf v}##, where the nonbold ##b## is a new function, now scalar-valued. A similar argument applies to ##{\bf d}({\bf v})##. So we can rewrite the transformation equations (1) as: $$t' ~=~ a(v)t + b(v) {\bf v}\cdot {\bf x} ~,~~~~ {\bf x'} ~=~ d(v){\bf v}t + {\bf E}({\bf v}) {\bf x} ~.~~~~~~~ (2)$$
Now decompose ##{\bf x}## wrt ##\widehat {\bf v}## as $${\bf x} ~=~ {\bf x}_\| + {\bf x}_\perp$$ into parts parallel and perpendicular to ##\widehat {\bf v}##. I.e., ##{\bf x}_\| = x_\| \widehat {\bf v}## and ##\widehat {\bf v} \cdot {\bf x}_\perp = 0##. Also decompose ##{\bf x}'## similarly.

I invite you (or any other readers of this thread) to continue the above via the following exercises.

Exercise 1 (Easy): Using ##\widehat {\bf v} \cdot {\bf x} = v x_\|##, what form do the transformation equations now take?

Exercise 2 (Harder): Contracting both sides of the ##{\bf x'}## equations with ##\widehat {\bf v}##, and using the definition of spatial isotropy I gave earlier, deduce 2 distinct equations governing the transformations for ##x'_\|## and ##{\bf x}'_\perp## separately.

[Edit: Heh, who am I kidding? No one's going to do those.]
 
Last edited:
  • #75
samalkhaiat said:
The term “linear” in that thread is not the same as linearity in the algebraic sense: f(ax+by)=af(x)+bf(y) f(ax + by) = af(x) + bf(y). Linearity there means polynomial of degree one in the coordinates, i.e., solutions to the system of 2-order PDE’s
∂ 2 F σx μx ν =0.​
Yes, and this is precisely the kind of linearity I was referring to. However I reaffirm my only objection to your demonstration, i.e. that straight lines transforming into straight lines requires that the transformation be linear.(this is the only argument you should attack)
samalkhaiat said:
This is incorrect. The conformal group does not even act on Minkowski space M 4 M^{4}. It acts on the (conformally) compactified version of Minkowski space M ¯ 4 ≅S 3 ×S 1 /Z 2 \bar{M}^{4}\cong S^{3}\times S^{1} / Z_{2}.
You are right, I should have said that the light principle(LP) only implies that the transformation is conformal
samalkhaiat said:
Did you read the whole post? I used argument similar to Landau’s argument to show α=1 \alpha = 1.
As for the rest of your comments ,I was only sketching a demonstration and citing known authors because some other people may read the post. Yes I read the post and I noticed you used a similar agument.
 
  • #76
facenian said:
However I reaffirm my only objection to your demonstration, i.e. that straight lines transforming into straight lines requires that the transformation be linear.(this is the only argument you should attack)
.
How about you give me a counter-example?
Okay, I give you a transition functions F_{21}: x \to \bar{x} between two Minkowski charts. It ia assumed that F_{21} is a \mathscr{C}^{3} homeomorphism, i.e., continuosly thirce differentiable (smooth and regular). Furthermore, we demand that F_{21} (or its inverse) maps straight (world) lines onto straight lines. Now, you show me one such homeomorphism that does not correspond to linear tranformation?
 
  • #77
samalkhaiat said:
How about you give me a counter-example?
Okay, I give you a transition functions F21:x→x¯F_{21}: x \to \bar{x} between two Minkowski charts. It ia assumed that F21F_{21} is a C3\mathscr{C}^{3} homeomorphism, i.e., continuosly thirce differentiable (smooth and regular). Furthermore, we demand that F21F_{21} (or its inverse) maps straight (world) lines onto straight lines. Now, you show me one such homeomorphsm that does not correspond to linear tranformation?
Well, the discussion, as I understood it, is not at the mathematical level you are using. I' m sorry if I misunderstood that. At the level I took it there is no place for homeomorphisims, diffeomorphisms, bijective maps, etc. May be I will look like an ignorant fool to you but I'm not the only one, for instance, "Einstein Gravity in a Nutshell" by Zee was written at the mathematical level I'm using in this discussion. Please don't take this the wrong way, I respect the mathematics and I'm not saying it in a demeaning way, on the contrary, I think it's way over my head.
Having clarified this point, what I meant is that many authors working at the mathematical level to which I am referring (Einstein included), attribute the linearity either to the homogeneity and isotropy of space and time or to thecondition that straight lines must be transformed in straight lines, without further explanation, which I find inappropriate even for this level of rigor. At this level of rigor, a fractional linear transformation can work as a counter example, of course even I Know that this is not an homeormorphism for R4
 
  • #78
facenian said:
the discussion, as I understood it, is not at the mathematical level you are using.

This thread is at an "I" level, which means undergraduate level. However, the subthread you are participating in is probably on the borderline between "I" and "A" (which is graduate level). That's probably unavoidable given the nature of the topic; to talk about "derivation" of something you have to have enough rigor to be able to precisely specify the starting point and the conclusion.
 
  • #79
PeterDonis said:
his thread is at an "I" level, which means undergraduate level. However, the subthread you are participating in is probably on the borderline between "I" and "A" (which is graduate level). That's probably unavoidable given the nature of the topic; to talk about "derivation" of something you have to have enough rigor to be able to precisely specify the starting point and the conclusion.
This thread started with an innocent question and I believe by now it went too far, however I must confess I really enjoyed it and learned a lot from it.
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
1K
  • · Replies 101 ·
4
Replies
101
Views
7K
Replies
8
Views
819
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 10 ·
Replies
10
Views
1K
  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 10 ·
Replies
10
Views
1K
  • · Replies 32 ·
2
Replies
32
Views
4K
  • · Replies 33 ·
2
Replies
33
Views
3K