Deriving General Lorentz Boost Equation

Click For Summary

Discussion Overview

The discussion revolves around deriving the general Lorentz boost equation for velocity in an arbitrary direction. Participants explore the mathematical foundations of Lorentz transformations, particularly focusing on the application of boosts in multiple dimensions and the implications of combining these boosts.

Discussion Character

  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant presents a method of deriving the general Lorentz boost by replacing velocities in the 1D Lorentz transformation matrices for each direction and multiplying them, but finds discrepancies with established forms.
  • Another participant suggests that the product of two non-collinear boosts does not yield a boost, proposing instead to use a rotation matrix to align the boost direction with one of the axes before applying the Lorentz transformation.
  • A further reply emphasizes the importance of the order of operations in applying the rotation and boost matrices, indicating that the correct sequence is to first apply the rotation, then the boost, followed by the inverse rotation.
  • Some participants discuss the mathematical structure of the transformations, noting that combining non-parallel boosts results in both a boost and a rotation, referencing the Lorentz algebra to support this claim.
  • Questions arise regarding the necessity of certain orthogonality conditions for the rotation matrix and how these conditions relate to the validity of the transformation process.
  • Participants express confusion about the derivation process and seek clarification on why certain mathematical relationships must hold true for the transformation to be valid.

Areas of Agreement / Disagreement

There is no consensus on the derivation method for the general Lorentz boost. Participants present competing views on the validity of combining boosts directly versus using rotation matrices, and the discussion remains unresolved regarding the best approach.

Contextual Notes

Participants highlight limitations in their understanding of matrix operations and the implications of orthogonality in the context of Lorentz transformations. There are unresolved questions about the mathematical relationships that must be satisfied for the transformations to hold.

Who May Find This Useful

This discussion may be useful for those interested in advanced topics in special relativity, particularly in understanding the mathematical framework of Lorentz transformations and the implications of combining boosts in different directions.

HJ Farnsworth
Messages
126
Reaction score
1
Greetings,

I have been having trouble deriving the equation for the general Lorentz boost for velocity in an arbitrary direction. It seems to me that given the 1D Lorentz transformations...

matrix for Lorentz transformation in x-direction, X:
{{1/sqrt(1-v^2), -v/sqrt(1-v^2), 0, 0},
{-v/sqrt(1-v^2), 1/sqrt(1-v^2), 0, 0},
{0, 0, 1, 0},
{0, 0, 0, 1}}

matrix for Lorentz transformation in y-direction, Y:
{{1/sqrt(1-v^2), 0, -v/sqrt(1-v^2), 0},
{0, 1, 0, 0},
{-v/sqrt(1-v^2), 0, 1/sqrt(1-v^2), 0},
{0, 0, 0, 1}}

matrix for Lorentz transformation in z-direction, Z:
{{1/sqrt(1-v^2), 0, 0, -v/sqrt(1-v^2)},
{0, 1, 0, 0},
{0,0,1,0},
{-v/sqrt(1-v^2), 0, 0, 1/sqrt(1-v^2)}}

...all I should have to do is replace each v in X with vx, each v in Y with vy, and each v in Z with vz, and then multiply the matrices to get ZYX as the general Lorentz boost equation. This should sequentially compute the Lorentz transformation in the x-direction with the x-velocity, then the y-direction with the y-velocity, then the z-direction with the z-velocity. However, when I do that, the answer I get only vaguely resembles the general Lorentz boost equation found on Wikipedia (http://en.wikipedia.org/wiki/Lorentz_transformation#Matrix_form).

I can't find the derivation for the general boost matrix anywhere, and more importantly, I have no idea why the simple math I used is wrong - it seems conceptually correct to me.

Please tell me what the flaw in my logic is, and how to correct it.

Thanks for any help you can give.

-HJ Farnsworth
 
Physics news on Phys.org
By the way, in case this wasn't clear in my previous post, the vx, vy, and vz in the previous post were for 3-velocity vector v = vxex + vyey + vzez, where the e's are orthonormal basis vectors in the x-, y-, and z-directions.
 
HJ Farnsworth said:
Please tell me what the flaw in my logic is, and how to correct it.

In general, the product of two non-colinear boosts is not a boost. To find a boost in an arbitrary direction from a boost B in the x direction, compute
R B R^{-1},
where R is the rotation matrix that rotates the x direction into the arbitrary direction.
 
Offhand, I'd say that what you should do to get the general boost is to perform some rotation R, so that the rotated coorinates xr, yr, zr have the boost going along one of the axes - say the x-axis for specificity.

Then you do the boost.

Then you perform the inverse rotation R^(-1) such that R(-1)R = a unit matrix, and you should have the general boost along an arbitary direction.

It's important to get the right order, and I'm not totally sure I've gotten that right in my text description. Basically, I expect it to go like this:

<br /> R^{-1} \Lambda R<br />

where R is the rotation matrix. The composition process works from right to left, first you apply R, then do the boost \Lambda, then you do the inverse of R, see http://en.wikipedia.org/w/index.php?title=Transformation_matrix&oldid=431791983

"Composing and inverting transformations"There's probably an easier way.

[add] I see George snuck an answer in ahead of me, they seem to be equivalent except that the matrix R is defined diffrently.
 
Hey,

Thanks for the quick responses . Your answers make perfect sense - in fact I can't believe I didn't think of that. Ah well, we all have off days I guess.

I still have one question though. I understand why RBR-1 works - it rotates the coordinate axes to be in the easiest direction, does the standard Lorentz transformation, then rotates the coordinate axes back so we have the transformed coordinates in our original coordinate system. However, I am still not quite clear on why a series of orthogonal boosts does not work. How come the product of two non-colinear boosts is not a boost? Please be explicit/detailed in telling me why my original thought process in my first post was wrong.

Again, thanks very much.

-HJ Farnsworth
 
More precisely, the product of two non-parallel boosts is not purely a boost. Generically, the result will be a boost and a rotation. For example, if you boost in the X direction and then boost in the Y direction, the result will be a boost in some direction in the XY plane, combined with an additional rotation about the Z axis.

This can be seen from the Lorentz algebra:

\begin{gather*}[J_a, J_b] = \varepsilon_{abc} J_c \\ [K_a, K_b] = -\varepsilon_{abc} J_c \\ [J_a, K_b] = \varepsilon_{abc} K_c \end{gather*}

where J_a are the rotations, and K_a are the boosts. In particular, two boosts do not generically commute (as per second line).
 
HJ Farnsworth said:
Hey,

Thanks for the quick responses . Your answers make perfect sense - in fact I can't believe I didn't think of that. Ah well, we all have off days I guess.

I still have one question though. I understand why RBR-1 works - it rotates the coordinate axes to be in the easiest direction, does the standard Lorentz transformation, then rotates the coordinate axes back so we have the transformed coordinates in our original coordinate system. However, I am still not quite clear on why a series of orthogonal boosts does not work. How come the product of two non-colinear boosts is not a boost? Please be explicit/detailed in telling me why my original thought process in my first post was wrong.

Again, thanks very much.

-HJ Farnsworth
Sorry for digging this thread up, but why does RBR-1 work?

Is B the vx boost matrix? If so, why does it cause it go in the 'easiest direction' when it acts on the rotation matrix. I can see why you then have to do the inverse to go back to the original coordinate basis, but why is the resulting matrix now in a more general form? As you can guess, my understanding of matrices is pretty shaky.
EDIT: Using that method I can't seem to get the right result anyway... Is the rotation matrix supposed to be: {{coshx, -sinhx, 0, 0}, {-sinhx, coshx, 0, 0}, {0, 0, 1, 0}, {0, 0, 0, 1}} ?
 
Last edited:
Given a 3D velocity v = (vx, vy, vz) with v = |v|, <br /> \textbf{B} = \begin{bmatrix}<br /> \gamma &amp; -\gamma v &amp; 0 &amp; 0\\ <br /> -\gamma v &amp; \gamma &amp; 0 &amp; 0\\ <br /> 0 &amp; 0 &amp; 1 &amp; 0\\ <br /> 0 &amp; 0 &amp; 0 &amp; 1<br /> \end{bmatrix}<br />
Choose two unit vectors f and g so that v, f and g are all orthogonal to each other.
<br /> \textbf{R} = \begin{bmatrix}<br /> 1 &amp; 0 &amp; 0 &amp; 0\\ <br /> 0 &amp; v_x/v &amp; f_x &amp; g_x\\ <br /> 0 &amp; v_y/v &amp; f_y &amp; g_y\\ <br /> 0 &amp; v_z/v &amp; f_z &amp; g_z<br /> \end{bmatrix}<br />

Now work out RBRT
 
DrGreg said:
Now work out RBRT

Thanks for the help. I did what you said but without understanding why it works.

Comparing it to the required result, I get something very similar but which requires equations such as the following to be true:

vx2/v2 + fx2 + gx2 = 1

and

-(vxvy)/v2 = fxfy + gxgy
Why are these true? (And why does the method even work at all?)
 
Last edited:
  • #10
3nTr0pY said:
Thanks for the help. I did what you said but without understanding why it works.

Comparing it to the required result, I get something very similar but which requires equations such as the following to be true:

vx2/v2 + fx2 + gx2 = 1

and

-(vxvy)/v2 = fxfy + gxgy


Why are these true?
The way I defined R ensures that its columns are orthonormal. It is a fact about square matrices, maybe not immediately obvious but nevertheless true, that orthonormal columns imply orthonormal rows. Or, to put it another way, \mathbf{R}^T\mathbf{R} = \mathbf{I} implies \mathbf{RR}^T = \mathbf{I}. Your equations above follow from this.
3nTr0pY said:
(And why does the method even work at all?)
The matrix R was chosen as an orthogonal matrix satisfying<br /> \textbf{R}^T \, \begin{bmatrix} 0 \\ v_x \\ v_y \\ v_z \end{bmatrix}<br /> = \begin{bmatrix} 0 \\ v \\ 0\\ 0 \end{bmatrix}<br />or equivalently
<br /> \textbf{R} \, \begin{bmatrix} 0 \\ v \\ 0\\ 0 \end{bmatrix} = \begin{bmatrix} 0 \\ v_x \\ v_y \\ v_z \end{bmatrix}<br />(and, of course, leaving the time dimension unmodified).
 
Last edited:
  • #11
Great. Thanks again for your help.
 

Similar threads

Replies
3
Views
1K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 22 ·
Replies
22
Views
3K
  • · Replies 31 ·
2
Replies
31
Views
3K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K