On the invariant speed of light being the upper speed limit

Joker93 · Oct 30, 2016

Hello! I have a question that has been bothering me since I first started learning about Special Relativity:
Given only the Minskowskian metric and/OR the spacetime interval, how can one reach the conclusion that the speed of light is invariant for every observer and how can one conclude that it is also the upper speed limit? Or, more generally, how can one conclude that there exists an upper speed limit?
Thanks!

andrewkirk · Oct 30, 2016

It is a speed limit in the sense that a particle with mass, whose velocity is initially less than the speed of light in an inertial frame (strictly - has a timelike velocity), cannot be accelerated to have a velocity that exceeds the speed of light in that frame (has a spacelike velocity). The reason is that, because the object's mass in that frame increases without limit as its velocity in the frame nears c, it takes infinite energy to make its speed reach c.

To my knowledge, there is no prohibition on particles having spacelike velocities for their entire existence. Such hypothetical particles are called tachyons. If we want to rule out their existence we need to introduce a postulate that explicitly does that. The prohibition cannot be deduced from the other postulates.

Joker93 · Oct 30, 2016

andrewkirk said:

It is a speed limit in the sense that a particle with mass, whose velocity is initially less than the speed of light in an inertial frame (strictly - has a timelike velocity), cannot be accelerated to have a velocity that exceeds the speed of light in that frame (has a spacelike velocity). The reason is that, because the object's mass in that frame increases without limit as its velocity in the frame nears c, it takes infinite energy to make its speed reach c.

To my knowledge, there is no prohibition on particles having spacelike velocities for their entire existence. Such hypothetical particles are called tachyons. If we want to rule out their existence we need to introduce a postulate that explicitly does that. The prohibition cannot be deduced from the other postulates.

So, in reality, we study special relativity postulating that everything follows timelike trajectories? If so, the question remains: what part of the math involved in special relativity shows this property? Is it the Minkowskian metric that implies an upper speed limit? Is it the spacetime interval? On a second thought, we obtain the spacetime iterval through the Minkowskian metric, so the strictly timelike trajectories(or lightlike if we are talking about things that travel with c) must be implied by the metric.
But I can't see this though.

andrewkirk · Oct 30, 2016

It is Newton's third law: Force = Mass times acceleration. Since mass approaches infinity, an infinite force is needed to accelerate a massive particle to the speed of light, which in turn will require an infinite amount of energy.

Joker93 · Oct 30, 2016

andrewkirk said:

It is Newton's third law: Force = Mass times acceleration. Since mass approaches infinity, an infinite force is needed to accelerate a massive particle to the speed of light, which in turn will require an infinite amount of energy.

But since all special relativity can be derived from the metric(or so I read), shouldn't what you are saying be explained with the metric?

strangerep · Oct 30, 2016

Joker93 said:

what part of the math involved in special relativity shows this property?

It's from the derivation of the symmetry group of transformations which preserves the Minkowski metric; -- more specifically, the subgroup corresponding to velocity boosts in a given direction. After crunching through the math, one finds the (now well-known) velocity addition formula: $$ v'' ~=~ \frac{v + v'}{1 + (vv'/c^2)} ~,$$ governing the composition of 2 Lorentz boosts where the respective relative speeds are ##v, v'## respectively. Provided both ##v## and ##v'## are ##\le c##, the result ##v''## can never exceed ##c##.

This is explained in every textbook on special relativity.

Dale · Oct 30, 2016

Joker93 said:

Given only the Minskowskian metric and/OR the spacetime interval, how can one reach the conclusion that the speed of light is invariant for every observer

This part at least is easy. Since ##ds^2=-c^2 dt^2 + dx^2 + dy^2 + dz^2## is invariant and when ##ds=0## the equation is a sphere expanding at the speed of light, that means that anything which is a sphere expanding at c in one frame is also a sphere expanding at c in all other frames.

robphy · Oct 30, 2016

Joker93 said:

Given only the Minskowskian metric and/OR the spacetime interval, how can one reach the conclusion that the speed of light is invariant for every observer

Find the eigenvectors of the boost transformations that preserve the Minkowskian metric. (Do the (1+1)-dimensional case for simplicity.)
(Bonus: what are the corresponding eigenvalues?)

PeterDonis · Oct 30, 2016

Joker93 said:

Given only the Minskowskian metric and/OR the spacetime interval, how can one reach the conclusion that the speed of light is invariant for every observer and how can one conclude that it is also the upper speed limit?

By noting that these are already contained in the Minkowski metric. In other words, by assuming the Minkowski metric, you are assuming the conclusion you want to draw.

A better question would be, what is the minimum you can assume in order to get to the conclusion you describe? AFAIK the principle of relativity plus the assumption that the laws of physics are invariant under space and time translations and ordinary rotations is sufficient to narrow down the possibilities to two: (1) no invariant speed and no upper limit (Newtonian physics, Galilean invariance), or (2) a single invariant speed which is also an upper limit (SR, Lorentz invariance). I don't have links handy to papers where this is shown, I'll see if I can find some references.

Orodruin · Oct 31, 2016

andrewkirk said:

The reason is that, because the object's mass in that frame increases without limit as its velocity in the frame nears c, it takes infinite energy to make its speed reach c.

I would here interject that "relativistic mass" is a largely obsolete concept that is prone to misunderstanding. I and many others have spent a considerable amount of time in pulling the concept out of people who come here with these. In modern relativity, "mass" only refers to the invariant mass and all actual uses of relativistic mass for a particle can be exchanged for total energy. In this case, the explanation would be: "The classical mechanics expression for total energy is changed in relativity, in relativity the total energy approaches infinity as v goes to c."

greswd · Oct 31, 2016

Joker93 said:

Hello! I have a question that has been bothering me since I first started learning about Special Relativity:
Given only the Minskowskian metric and/OR the spacetime interval, how can one reach the conclusion that the speed of light is invariant for every observer and how can one conclude that it is also the upper speed limit? Or, more generally, how can one conclude that there exists an upper speed limit?
Thanks!

The Minskowskian metric is built entirely around the notion that the speed of light is invariant.

As for your second question, try applying the Minskowskian metric to a tachyon's frame of reference.

In this paper, they end up with imaginary transverse (y & z) space. which is entirely incomprehensible.
http://dinamico2.unibg.it/recami/erasmo%20docs/SomeOld/RevisitingSLTsLNC1982.pdf

robphy · Oct 31, 2016

PeterDonis said:

By noting that these are already contained in the Minkowski metric. In other words, by assuming the Minkowski metric, you are assuming the conclusion you want to draw.

greswd said:

The Minskowskian metric is built entirely around the notion that the speed of light is invariant.

Theoretically, yes.
But suppose that one is somehow handed the metric (in some form) without the full theory behind it, the question seems to be how to then obtain the other features.

For example, one could conduct an experiment where folks with wristwatches start at a event and, after traveling with various velocities, stop [or somehow mark] when their wristwatches reach (say) 10 seconds. [Suppose muons [with a finite mean lifetime in its rest frame] could be produced with various velocities in the lab frame... we somehow measure the observed lifetime in the lab frame as a function of their velocity in the lab frame.] The locus of those stop-events would trace out a hyperbola on a position-vs-time graph, providing the metric by an experiment. From that, what can one deduce?

greswd · Oct 31, 2016

robphy said:

The locus of those stop-events would trace out a hyperbola on a position-vs-time graph, providing the metric by an experiment. From that, what can one deduce?

That the speed asymptotically reaches but never hits c?

I wish you could get your light clock spinning again too.

robphy · Oct 31, 2016

greswd said:

That the speed asymptotically reaches but never hits c?

I wish you could get your light clock spinning again too.

Right... one would deduce that [or at least suspect it]. Some extrapolation/curve fitting on the experimental data would probably suggest a hyperbola analytically.
But then go further and uncover the rest of special relativity from that. (Maybe one should do the same experiments in the lab moving with different velocities, etc...)

If you click on my username to see my profile, you'll see my light clock avatar animated there. For some reason, it doesn't seem to work in the threads.

PeterDonis · Oct 31, 2016

robphy said:

The locus of those stop-events would trace out a hyperbola on a position-vs-time graph, providing the metric by an experiment.

Providing a portion of the metric. But not all of it; to do that you would need to do experiments involving null and spacelike intervals as well as timelike intervals. In other words, you would have to, for example, do experiments with light clocks moving at various speeds, do experiments with lightning strikes hitting both ends of trains, etc. All of those experiments together (plus, for example, experiments confirming Maxwell's Equations and their prediction that electromagnetic waves always travel at the same speed in vacuum relative to all observers) would give you the full Minkowski metric, including the properties the OP asked about.

greswd · Oct 31, 2016

PeterDonis said:

Providing a portion of the metric. But not all of it; to do that you would need to do experiments involving null and spacelike intervals as well as timelike intervals. In other words, you would have to, for example, do experiments with light clocks moving at various speeds, do experiments with lightning strikes hitting both ends of trains, etc. All of those experiments together (plus, for example, experiments confirming Maxwell's Equations and their prediction that electromagnetic waves always travel at the same speed in vacuum relative to all observers) would give you the full Minkowski metric, including the properties the OP asked about.

Have all of these experiments (or similar experiments) been conducted before? For example the lightning train one.

robphy · Oct 31, 2016

PeterDonis said:

Providing a portion of the metric. But not all of it; to do that you would need to do experiments involving null and spacelike intervals as well as timelike intervals. In other words, you would have to, for example, do experiments with light clocks moving at various speeds, do experiments with lightning strikes hitting both ends of trains, etc. All of those experiments together (plus, for example, experiments confirming Maxwell's Equations and their prediction that electromagnetic waves always travel at the same speed in vacuum relative to all observers) would give you the full Minkowski metric, including the properties the OP asked about.

Certainly, one would have to do a variety of experiments to be exhaustive and to check predictions made as the theory is built up ...
However, if I have the unit circle [the unit hyperbola], that gives me the unit timelike vectors...
as well a definition of orthogonality... the tangent to the "circle" is "orthogonal" to the [radial] unit timelike vector.
(This works in Euclidean space and in the Galilean spacetime [a PHY 101 position-vs-time-graph].)
By comparing diagrams from labs on inertially moving ships doing the same experiment, I can deduce that this orthogonality condition is preserved,
as well as uncover invariant directions [associated with the null eigenvectors] in the diagram. (In the Galilean case, the null eigenvector is along the spatial direction.)
By deducing the transformations that preserve this, I can get the Minkowskian signature.
[Of course, we're making the usual assumptions of isotropy and homogeneity, etc...]

PeterDonis · Oct 31, 2016

greswd said:

Have all of these experiments (or similar experiments) been conducted before? For example the lightning train one.

Einstein did that one in his head.

I don't know that the exact experiments I described have been done in the exact form I described them; but we certainly have mountains of experimental evidence for the Minkowski metric.

PeterDonis · Oct 31, 2016

robphy said:

By comparing diagrams from labs on inertially moving ships doing the same experiment, I can deduce that this orthogonality condition is preserved,
as well as uncover invariant directions [associated with the null eigenvectors] in the diagram. (In the Galilean case, the null eigenvector is along the spatial direction.)

Given only this data, how would you distinguish the Galilean from the Minkowskian case?

robphy · Oct 31, 2016

PeterDonis said:

Given only this data, how would you distinguish the Galilean from the Minkowskian case?

The story I tell is that [figuratively] Galileo did this wristwatch experiment with observers spanning the range of velocities available by the technology of his day... say the upper speed was the speed of a racehorse. From the collected data, Galileo would extrapolate from the small segment on his position-vs-time diagram to a vertical line ranging to all velocities.

With the range of velocities was large enough to approach the speed of light or with high-precision timepieces, Minkowski [figuratively] could repeat the experiment and distinguish his circle from Galileo's circle. In addition, whereas Galileo would have found that the tangent-lines to his unit circle are parallel [implying absolute simultaneity], Minkowski would see that the tangent-lines associated with different unit radii are in fact not parallel... simultaneity is relative. (If one formulated the corresponding transformation laws ["rotations"], one would find that their eigenvectors are different... a little more analysis would uncover additivity of angles [arc lengths on the unit circles] and the non-additivity of slopes in Special Relativity, as one has in Euclidean geometry.)

Here is my desmos diagram again (from physicsforums.com/threads/general-questions-about-special-relativity.878810/page-5#post-5538694 )
https://www.desmos.com/calculator/ti58l2sair [time is along the horizontal axis]
You can tune "E" to go between Minkowski, Galilean, and Euclidean.

proxy.php?image=https%3A%2F%2Fs3.amazonaws.com%2Fgrapher%2Fexports%2Fti58l2sair.png

If you want to see an animation of the "stop when your wristwatch reads 1 sec experiment",
visit items 1. and 2. of my GeoGebra files [time is along the vertical axis]
https://www.geogebra.org/m/HYD7hB9v#

m4r35n357 · Oct 31, 2016

PeterDonis said:

A better question would be, what is the minimum you can assume in order to get to the conclusion you describe? AFAIK the principle of relativity plus the assumption that the laws of physics are invariant under space and time translations and ordinary rotations is sufficient to narrow down the possibilities to two: (1) no invariant speed and no upper limit (Newtonian physics, Galilean invariance), or (2) a single invariant speed which is also an upper limit (SR, Lorentz invariance). I don't have links handy to papers where this is shown, I'll see if I can find some references.

Probably mentioned this once or twice before, but I think this article is not a bad place to start.

Joker93 · Nov 2, 2016

Guys, upon reading your replies, I must state that, although the information about how one could discover the metric from experiments are more than helpful, I was really interested in the mathematics behind it. Let me be more clear. I am now learning about GR and SR for the first time. Our professor just gave us the spacetime interval saying that it's a quantity which was found to be invariant under Lorentz transformations and then we calculated some interesting things. But, without knowing anything else except from the metric or the invariant interval could one deduce mathematically that there is a speed limit in the universe and that it is that of the speed of light? Or, put differently, how is the upper limit speed information encoded into the metric and the spacetime interval?

SiennaTheGr8 · Nov 2, 2016

Joker93 said:

Or, put differently, how is the upper limit speed information encoded into the metric and the spacetime interval?

That's the better way to ask the question, I think. It's the upper speed limit that comes first. Everything else follows from that.

Are you familiar with the light-clock thought experiment? It shows how the invariance of the speed of light implies the invariance of the spacetime interval. The math is simple. See here: https://books.google.com/books?id=PDA8YcvMc_QC&pg=PA67

Same for the Lorentz transformation, which differs from the Galilean transformation precisely because it accounts for the existence of a universal speed limit. The math is simple here, too: http://www.princeton.edu/~kdiab/relativity.pdf (PDF)

I don't know enough GR to speak to the metric.

robphy · Nov 2, 2016

Mathematically, the answer is in @strangerep 's reply.
From a more spacetime-viewpoint,...
Take any future unit-timelike vector (representing the 4-velocity of an observer).
Apply a nontrivial Lorentz boost transformation (i.e. not the identity).
The result is another unit-timelike vector.
Use the same boost on this result... and keep going.
The resulting unit-timelike vector approaches (but never reaches) the lightcone.
The sequence of boosts can be combined into a single boost (essentially, the composition of velocities).
The velocity-parameter v labeling the boosts approaches the speed of light.
Alternatively, the [additive] rapidity-angle parameter approaches infinity.

[Why the Lorentz transformation?... because that is the transformation that preserves the metric [preserves the dot-product].)

Joker93 · Nov 2, 2016

robphy said:

Mathematically, the answer is in @strangerep 's reply.
From a more spacetime-viewpoint,...
Take any future unit-timelike vector (representing the 4-velocity of an observer).
Apply a nontrivial Lorentz boost transformation (i.e. not the identity).
The result is another unit-timelike vector.
Use the same boost on this result... and keep going.
The resulting unit-timelike vector approaches (but never reaches) the lightcone.
The sequence of boosts can be combined into a single boost (essentially, the composition of velocities).
The velocity-parameter v labeling the boosts approaches the speed of light.
Alternatively, the [additive] rapidity-angle parameter approaches infinity.

[Why the Lorentz transformation?... because that is the transformation that preserves the metric [preserves the dot-product].)

But, in your answer and also strangerep's answer, you are assuming that we know what the Lorentz transformations are, right?

robphy · Nov 2, 2016

Joker93 said:

But, in your answer and also strangerep's answer, you are assuming that we know what the Lorentz transformations are, right?

So, the prerequisite problem is this:
Given a metric (a dot-product), find the group of transformations that preserve that metric.
(This is Felix Klein's view... geometry is encoded in the group of transformations...)

Edit: we do this because we are effectively requiring that there is a symmetry among inertial observers [the principle of relativity]. That there are no timelike-eigenvectors of the transformations says that no inertial observer is preferred over any other inertial observer.

Edit: is there some theorem from linear algebra that says something to the effect that ##B^n\vec v ## approaches an eigenvector (the dominant eigenvector) of B?

Joker93 · Nov 2, 2016

robphy said:

So, the prerequisite problem is this:
Given a metric (a dot-product), find the group of transformations that preserve that metric.
(This is Felix Klein's view... geometry is encoded in the group of transformations...)

So, in reality, given just the metric, someone can find the transformations that preserve it and then everything just comes out of knowing the metric and the transformations, right? Well, this seems just about right, but then how can one find those transformations?

Also,consider the other way around. Given the transformations, how can you find the metric(the invariant interval follows from this)?

Please, forgive me if I am asking elementary questions, it's just that I am just now learning SR and GR(at the same time; it's a long story).

robphy · Nov 2, 2016

Joker93 said:

So, in reality, given just the metric, someone can find the transformations that preserve it and then everything just comes out of knowing the metric and the transformations, right? Well, this seems just about right, but then how can one find those transformations?
Please, forgive me if I am asking elementary questions, it's just that I am just now learning SR and GR(at the same time; it's a long story).

For the (1+1)-case, setup an arbitrary 2-by-2 matrix with entries ##\left(\begin{array}{cc} a&b\\c&d \end{array}\right)##.
Then there are certain group properties that have to satisfied. This puts constraints on the entries... until you are led to the boost transformation.
(You can try this for a rotation in Euclidean space... as a warmup.)

Nugatory · Nov 2, 2016

Joker93 said:

But, in your answer and also strangerep's answer, you are assuming that we know what the Lorentz transformations are, right?

It's easy enough to show that the (flat) metric is preserved under the Lorentz transformations and only the Lorentz transformations, so the form of the metric does logically imply the Lorentz transformations - you do not need to assume their truth separately and without proof.

It is true that it's a lot easier to find the Lorentz transformations in the metric if you already know what you're looking for, but that's true of most derivations of non-trivial results.

robphy · Nov 2, 2016

As a followup to my earlier post...
look at this section for an example
https://en.wikipedia.org/wiki/Deriv...rmations#Galilean_and_Einstein.27s_relativity

In the more abstract case, when you don't necessarily know what to expect,
you are led to a differential equation for one of the last matrix elements
[the rest of the elements have already been constrained by the group properties].
Depending on the dot product, your matrix entry will be a hyperbolic trig function for Special Relativity
and a circular trig function to Eucldean space.

On the invariant speed of light being the upper speed limit

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Undergrad Why is gravity a fictitious force?

Undergrad Relativistic Space Travel: Optimizing Proper Time [Project Hail Mary]

Undergrad KE of rotating disc

Undergrad Why is the Lorentz Force always perpendicular to velocity?

Graduate How valid is the Block Universe theory?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect