I think that we can narrow down to and organize the discussion around three issues:
1)
if in order to jump:
- from what is obvious, i.e. that the spatial path traversed by light in a given frame, as composed by the Pythagorean combination of the three spatial dimensions, is equal to the time elapsed in such frame times the speed of light
c\Delta t = \sqrt {\Delta {x^2} + \Delta {y^2} + \Delta {x^2}}
-
to what is already the ST interval in the form of
{(c\Delta t)^2} = \Delta {x^2} + \Delta {y^2} + \Delta {z^2}
the argument "I have squared both sides" is sufficient justification or you need "something else", some "added justification"
2)
what this "else" would be, which in my opinion is (being very generic) hypothesizing that the new dimension, cT, is related to the old ones, X-Y-Z, in a manner that precisely justifies squaring both sides and which also has to do with the Pythagorean Theorem, but in a different manner, in order to account for the different sign
3)
if in particular this "else" can be simply saying that the transformation is going to be a hyperbolic rotation, so I identify the ST interval with the equation of the hyperbola, the unit hyperbola
To me 1) is very clear, but I would not like to look stubborn. At a given time, we could agree that the matter has been discussed enough and leave the thread (without need to close it!) until someone comes with a new comment. But as of now I don't lose the hope of convincing you. Some arguments:
Dale said:
This is incorrect. ##\Delta t## can be negative, but this expression cannot have a negative ##\Delta t##.
As already stated, if you want to get rid of negative time you can always stipulate that what goes into the ST interval is the absolute value of cT or either square both sides but immediately squareroot them, which leads to this other (wrong) interval:
\sqrt {{{(c\Delta t)}^2}} - \sqrt {(\Delta {x^2} + \Delta {y^2} + \Delta {z^2})} = \sqrt {{{(c\Delta t')}^2}} - \sqrt {(\Delta x{'^2} + \Delta y{'^2} + \Delta z{'^2})}
Of course, to avoid this ugly expression you must square both sides, but the subtlety that I favor is just admitting that you do it for a reason that goes beyond the pure algebraic trick.
Dale said:
This is a perfectly valid algebraic operation and your restriction doesn’t make sense. If an equation is true then it remains true under correct algebraic manipulations regardless of why you are doing them. That is indeed the point of algebra.
Sure. The equation remains true under the algebraic manipulation of squaring both sides. But the equation would also remain true under the algebraic manipulation of raising both sides to the 11th power, which would lead to a ST that is false. That is because what is valid for the algebraic purpose (solving for unknowns) may not be valid to guess what remains invariant under transformations. We have a set of operations that would be valid for the first purpose (squaring both sides, squaring and squarerooting to get absolute values, raising both sides to any other power...), but only the first is apt for the second purpose and that is due to "something else".
As to 2),
PeterDonis said:
If this argument were valid, it would prove too much: it would prove that the Pythagorean theorem is not correct in Euclidean space.
I would suggest thinking carefully about what justifies the Pythagorean theorem (i.e., summing squares of coordinate deltas) in Euclidean space. The justification for using squares of coordinate deltas in Minkowski spacetime will be the same.
Well, it is clear to me that we can add up vectors through the Pythagorean theorem ("PT") when they are orthogonal to each other. That is what you do for X-Y-Z with the expression:
c\Delta t = \sqrt {\Delta {x^2} + \Delta {y^2} + \Delta {x^2}}
The question is how and why the new dimension cT can join the team of dimensions "somehow" related by the PT. I have thought of two routes:
a) The resultant of combining X-Y-Z through the PT in Euclidean manner combines with cT through the hyperbola equation. In this case, cT is orthogonal to X-Y-Z in the Minkowski way (dot product with negative sign is zero).
b) As reflected in the Loedel diagram, what is (in another sense) perpendicular is cT with (simplifying) X' and cT' with X, so that the ST interval is the height of the right triangle.
How the two things match is, though, a subtle question which I find beyond my reach as of now. Comments are welcome, but really the OP was not meant to discuss this.
In any case, the fact that the orthogonality between cT and X-Y-Z is implicit or embedded in the ST interval itself seems clear to me. As a proof: look at how the Wikipedia derivation continues in the second section "Linearity". At a given point they find a term that is remininscent of the last term of the Law of Cosines and they equate it to 0 after "comparing coefficients" and noting that the term of comparison does not contain a counterpart. To be expected, because the ST interval is built by assuming orthogonality.
As to 3), what was puzzling me was that in the ST diagram the null vector remains intact, how could it be dilated by the eigenvalue = the Doppler factor?
I have now realized that the Doppler factor is also the scale at which the second frame is drawn in the Minkowski diagram, so the null vector "is" dilated in this diagram even if you dont see it on the page, right? Thanks for guiding me to this is insight (if it is correct at all).
Said this, does this mean that you are unfavoring now any derivation of the ST interval that takes as illustration a null vector? Well, the problem with deriving the ST interval is that there are three possible displays, depending on whether you choose a timelike, lightlike or spacelike vector... You have to choose one reference and then generalize the result to the others or you can repeat it with lightlike (what we have done so far) and timelike (which is the other derivation of the light clock though experiment, which Iike, although it seems not be in fashion nowadays), I would not know how to express it with spacelike, since we would present an impossible scenario of something traveling FTL... In any case, I don't see a problem in saying that we are in face of a hyperbolic rotation even if the reference is precisely the vector that acts as eigenvector of the rotation and therefore gets only dilated, without changing direction.
Ultimately, this is a question of semantics: I think that you can perfectly say that the ST interval is what remains invariant under the "hyperbolic rotation" of a vector, no matter what kind of vector we are talking about and regardless whether the effect of the rotation is change of direction or dilation. Take the example of a right triangle where the height is overlapping with the Y axis and you reflect it precisely over the Y axis. You would say that the figure as a whole has been "reflected", even if a particular side (the height) has remained untouched (not even dilated in this case) in the course of the reflection.