I Derivation of E=pc & E=MC2: Which Came First?

  • I
  • Thread starter Thread starter imsmooth
  • Start date Start date
  • Tags Tags
    Derivation E=mc2
  • #51
jbergman said:
Especially, when we can show that more directly from physics and the postulates of special relativity.
Sure, I agree. Using the principle of least action is more universal, nobody stops you to do that. See my post #50.
But keep in mind that postulates of special relativity are still the same. I didn't use anything extra in my replies.

Before you said:
jbergman said:
but it doesn't say anything why the zeroth component of 4-momentum is energy.
I think I provided you several arguments in posts #44 and #49 that leads to a conclusion, "it must be a energy". So I find the words "it doesn't say anything" a bit unfair :smile:
 
  • Like
Likes jbergman
Physics news on Phys.org
  • #52
jbergman said:
A more physical way to prove this, IMO, is to first prove ##E=mc^2##.
I think you refer to the "relativistic mass" ##m_R##. That is avoided today. Therefore I replaced it by ##E/c^2##, which is the same anyway.
 
  • #53
lomidrevo said:
Sure, I agree. Using the principle of least action is more universal, nobody stops you to do that. See my post #50.
But keep in mind that postulates of special relativity are still the same. I didn't use anything extra in my replies.

Before you said:

I think I provided you several arguments in posts #44 and #49 that leads to a conclusion, "it must be a energy". So I find the words "it doesn't say anything" a bit unfair :smile:
You have a point. I guess each of has to decide what is a compelling proof. I am only chiming in on this thread because I worked through the same confusion several weeks ago. At the end of the day, the final proof is that it agrees with experiment. But there are other paths there. You provided strong motivation as to why it would be true. I, personally, found Feynman's proof of in his lecture notes based on the conservation of momentum nice.

It is obviously a complex issue, see Ohanian, who claims, Einstein never successfully proved this.

This site, https://plato.stanford.edu/entries/equivME/, has a great article on the history of this relation and attempts to prove it including disputes as to whether or not various proofs were correct.
 
  • Like
Likes lomidrevo
  • #54
I think the whole point here is how we want to define "energy".
One easy way is just to define ##E=p^0## there's nothing more to add. #49 has proved that this definition is consistent with the definition of "energy" used in Newtonian mechanics, so that's good.
So now, one can say that "energy" is defined in another way. And that would be also OK, then we would just need to prove that the two definitions are equivalent.
At the end of the day, if you think that proving ##E=p^0## can only be proved by "experimental agreement" is because your notion of energy requires some experiment to be defined.
 
  • Like
Likes jbergman and lomidrevo
  • #55
Demystifier said:
According to the book
https://www.amazon.com/dp/0393337685/?tag=pfamazon01-20
(which is a serious book, not a crackpot one), Einstein made 7 different proofs of ##E=mc^2## and all 7 proofs had a mistake.
It is not crank, but most claims in this book remain disputed by other experts
 
Last edited:
  • Like
Likes jbergman
  • #56
I’m not an expert in physics by any means, but if you take ##E## as fixed, it follows that ##\gamma=\frac{E}{mc^2}## and ##p=\gamma mv=\frac{Ev}{c^2}##. Since ##\frac{v^2}{c^2}=1-\gamma^{-2}=1-\frac{(mc^2)^2}{E^2}##, taking ##m=0## yields ##v=c## and ##p=\frac{E}{c}##.

Whether or not this makes physical sense, I wouldn’t know, but for example the photon has a well-defined energy from Maxwell’s equations, no? So taking the limit as ##m## approaches zero is similar to approximating a photon as a point mass with a fixed energy and a very, very low mass.
 
  • #57
suremarc said:
it follows that ##\gamma=\frac{E}{mc^2}## and ##p=\gamma mv=\frac{Ev}{c^2}##.

But these formulas are only valid for ##m > 0##, so you can't use them to derive anything that applies to ##m = 0##. Or, if the "take the limit as ##m \rightarrow 0##" idea occurs to you, it's still not valid; formulas involving ##\gamma## are only valid for timelike vectors, not null vectors, and there is no continuous way to go from one to the other.

suremarc said:
the photon has a well-defined energy from Maxwell’s equations

Remember that we are talking classical physics, not quantum physics, so there is no such thing as a "photon". When books on classical relativity use the term "photon", what they really mean is "a pulse of light that lasts a very short time, so it can be approximated by a single null worldline in spacetime". In terms of Maxwell's Equations, this would be modeled as a wave packet with a very small "spread" in spacetime, modeled as a single null worldline using the geometric optics approximation. Such a wave packet does have a well-defined energy, yes, but it is frame-dependent.

suremarc said:
taking the limit as ##m## approaches zero is similar to approximating a photon as a point mass with a fixed energy and a very, very low mass.

No, it isn't, because, as I said above, timelike vectors and null vectors are fundamentally different, and you can't just wave your hands and "take a limit" to go from one to the other.
 
  • Like
Likes suremarc
  • #58
Or put another way, a null vector is as much the limit of a ruler as it is of trajectory; both are invalid, and there is no reason to favor one as an approximation over the other.
 
  • Like
Likes suremarc
  • #59
With regards to limits,
the diagram below from this old post ( https://www.physicsforums.com/threads/massless-photon.900960/post-5842652 ) might be useful

1611209344163.png


(an extension of this diagram is here:
https://physics.stackexchange.com/a/551250/148184 )
 
  • Like
Likes suremarc
  • #60
I'd say, if you are at this level of sophistication to ask "why do the physical laws look as they do", the best answer, found in connection with Einstein's works, are symmetry principles. The most general symmetry principles are related with the space-time models used in the different theories (Galilei-Newton, Einstein-Minkowski, Einstein(-Cartan)). The latter case is a bit more complicated, because GR is after all a gauge theory "gauging" the Lorentz group.

Galilei-Newton and Einstein-Minkowski are pretty much at the same level, i.e., given the space-time structure you derive the most generally valid continuous Lie-symmetry group for these space-time models and write down possible dynamical laws for the natural mathematical objects occurring in these space-time models.

For point mechanics (very natural and simple for Newtonian physics but on the edge of being inconsistent in special relativistic physics) and assuming a closed system of one particle (i.e., automatically a free particle, because there's nothing which the particle can interact with), using the analysis of Noether's theorem within the Lagrangian formulation of the action principle. The common subgroups of the Galilei (homogeneity of space and time, isotropy of space) you end up with the most general form of the Lagrangian (in Cartesian coordinates)
$$L=L(\dot{\vec{x}}^2).$$
The only difference is then in the boosts.

For an infinitesimal Galilei boost,
$$t'=t, \quad \vec{x}'=\vec{x}-\delta v \vec{n} t$$
The "symmetry condition" reads
$$-\vec{n} \frac{\partial L}{\partial \dot{\vec{x}}}+\frac{\mathrm{d}}{\mathrm{d} t} \Omega(\vec{x},t)=0.$$
From this you get
$$-2 \vec{n} \cdot \dot{\vec{x}} L'(\dot{\vec{x}}^2)+\dot{\vec{x}} \cdot \vec{\nabla} \Omega + \partial_t \Omega=0.$$
This is fulfilled for
$$\Omega=m \vec{n} \cdot \vec{x}$$
and
$$L'(\dot{\vec{x}}^2)=\frac{m}{2} \; \rightarrow\;L=\frac{m}{2} \dot{\vec{x}}^2.$$
with ##m=\text{const}##. The conserved quantity then turns out to be
$$m \vec{n} \cdot \vec{X}:=m \vec{n} (\vec{x}-\vec{v} t).$$
From Noether's theorem for temporal and spatial translation invariance this leads to the "definition" of the energy and momentum
$$E=H=\dot{\vec{x}} \cdot \vec{p}-L=\frac{m}{2} \dot{\vec{x}}^2, \quad \vec{p}=\frac{\partial L}{\partial \dot{\vec{x}}}=m \dot{\vec{x}}.$$
For and infinitesimal Lorentz boost you have
$$t'=t-\delta \vec{v} \cdot \vec{x}/c^2, \quad \vec{x}'=\vec{x}-\delta \vec{v} t.$$
A similar analysis using the condition for that to be a symmetry of the action indeed leads to
$$L=-m c^2 \sqrt{1-\dot{\vec{x}}^2/c^2},$$
and from the Noether symmetry under temporal and spatial translations you get
$$E=H=\frac{m c^2}{\sqrt{1-\dot{\vec{x}}^2/c^2}}, \quad \vec{p}=\frac{\partial L}{\partial \dot{\vec{p}}}=\frac{m \dot{\vec{x}}}{\sqrt{1-\dot{\vec{x}}^2/c^2}}.$$
From this it's easy to see that ##(E/c,\vec{p})## are four-vector components from realizing that
$$E/c=m \frac{\mathrm{d} c t}{\mathrm{d}\tau}, \quad \vec{p}=\frac{\mathrm{d} \vec{x}}{\mathrm{d} \tau},$$
where
$$\mathrm{d} \tau=\mathrm{d} t \sqrt{1-\dot{\vec{x}}^2/c^2} = \sqrt{\mathrm{d}t^2 - \mathrm{d} \vec{x}^2/c^2}$$
is the proper time of the (massive) particle. So
$$p^{\mu}=(E/c,\vec{p})=m \mathrm{d}_{\tau} x^{\mu}.$$
Since the proper time is a Lorentz scalar and ##x^{\mu}## are Lorentz-vector components, with ##m## being a scalar, also ##p^{\mu}## are Lorentz-vector components.
 
  • Like
Likes Sagittarius A-Star, PAllen and lomidrevo
  • #61
Einstein's 1935 derivation of the equivalence of mass and energy:
https://projecteuclid.org/download/pdf_1/euclid.bams/1183498131

From an elastic eccentric collision scenario with 2 particles with equal mass, he derives (using a unit system with ##c=1##), that the relativistic momentum ##m\gamma \vec{v}## and the relativistic kinetic energy ##m(\gamma-1)## of the system are conserved in any inertial reference frame. He makes use of the symmetry in the center-of-momentum frame and the Lorentz-transformation.

Then he considers an inelastic collision between 2 particles with equal mass and equal rest-energy ##E_0##. He derives from the energy law for the system, that ##E_0## and ##m## of the particles change equally, and that therefore
$$E_0=m$$
in ##E=E_0+m(\gamma -1)##.

He concludes:
Einstein (1935) said:
If, from the beginning, we had provided the expression for the impulse with a mass-constant different from that of the energy, these considerations would show that the impulse-mass changes in an inelastic collision like the energy-mass. This is a partial justification for setting both mass-constants equal to each other.

The result of this consideration is therefore as follows. If for collisions of material points the conservation laws are to hold for an arbitrary (Lorentz) coordinate-system, the well known expressions for impulse and energy follow, as well as the validity of the principle of equivalence of mass and rest-energy.
 
Last edited:
  • Like
Likes jbergman
  • #62
The important point is that ##E/c=(E_0+E_{\text{kin}})/c## together with ##\vec{p}## forms a four-vector. The methodological breakthru has been Minkowski's formulation of SR spacetime as a 4D pseudo-Euclidean affine manifold!
 
  • #63
vanhees71 said:
The important point is that ##E/c=(E_0+E_{\text{kin}})/c## together with ##\vec{p}## forms a four-vector. The methodological breakthru has been Minkowski's formulation of SR spacetime as a 4D pseudo-Euclidean affine manifold!
Yes. In the 1935 paper of Einstein, that I linked, he started with the definition of the four-momentum and did the momentum-calculations for the elastic collision for all 4 dimensions, see his 4 equation(s) number (1) on page 226.

Einstein said:
We remark that the equations (1) can be derived more clearly if one considers directly the transformation for the sum of the four-vectors of the velocities of a particle-pair. I have chosen the above representation, however, because the conservation laws indicate the use of this 3-dimensionally inhomogeneous manner of writing.
Source:
https://projecteuclid.org/download/pdf_1/euclid.bams/1183498131

I think he did not want to give up completely the relation to the understanding of 3-momentum and energy in classical mechanics for didactical reasons.
 
Last edited:
  • #64
imsmooth said:
I have seen E=pc used to derive E=MC2. I have seen E=MC2 used to derive E=pc. This is circuitous. Which came first and how is E=pc derived?

From relativistic formula
E^2=p^2c^2+m^2c^4
we get ##E=pc## by making ##m=0## for photon and other massless particles.

Mass does not apply for a single photon, so let us prepare N photon gas of same wavelength and no total momentum in a theorotetical massless sphere vessel. In experiments done in the rest frame of this photon ball, it would show inertia and a source of gravity m, i.e.
m=\frac{Npc}{c^2}=\frac{E}{c^2}
 
Last edited:
  • Like
Likes lomidrevo and vanhees71
  • #65
mitochan said:
Mass does not apply for a single photon, so let us prepare N photon gas of same wavelength and no total momentum in a theorotetical massless sphere vessel.
Now assume instead a light-clock at rest in frame ##S## containing 2 photons, each with an energy of ##L##, moving always in opposite direction between the mirrors at the top and at the bottom. Then the energy-content of this light-clock is ##E_0 = 2L##.

In this reference frame ##S##, the two mirrors are removed at the same time. One photon escapes now from the light clock in y-direction, the other in (-y)-direction.

Assume a frame ##S'##, moving constantly with ##v## in positive x-direction. Two observers are at the same x'-coordinate, but at y'-coordinates with opposite signs.

Described in ##S##, those oberserves in ##S'## (with time-dilated clocks) must receive the photons each blue-shifted with energy
##L' = L \gamma (1 - \frac{v}{c} * \cos{90°}) = \gamma L##, according to Einstein's Doppler-formula.
(In frame ##S'##, the blue-shift comes from aberration.)

So in frame ##S'##, the energy content of the light clock must have been ##E = 2L' = \gamma E_0## before loosing the photons. That means:

$$E_{kin} = mc^2(\gamma -1)= E -E_0 = E_0(\gamma -1)$$

Plausibility-check to classical mechanics (using for x << 1: ##\frac{1}{\sqrt{1-x}}\approx 1+\frac{1}{2}x##):
##mc^2(\gamma -1) \approx \frac {1}{2}mv^2##

This is a similar, but simpified argumentation as in Einstein 1905:
https://www.fourmilab.ch/etexts/einstein/E_mc2/e_mc2.pdf
 
Last edited:
  • Like
Likes mitochan and vanhees71
  • #66
Example from classical mechanics:
kinetic energy = ##\int \vec F \cdot d\vec s = \int (m*\vec a) \, d\vec s = \int (m*\frac{d\vec v}{dt}) \, d\vec s = m\int \vec v \cdot d\vec v = \frac{1}{2}mv^2 + const_2##

With ##a=\gamma^3 a_0## (follows from relativistic velocity addition) and if the force is in direction of movement:

relativistic kinetic energy = ##\int F \cdot ds = m \int \gamma^3 a_0 \cdot ds = m \int \gamma^3 \frac{dv}{dt} \cdot ds = m\int_0^v \gamma^3 v \cdot d v = mc^2(\gamma -1) ##

Source (see after Einsteins definition of "longitudinal mass"):
https://en.wikisource.org/wiki/Translation:On_the_Electrodynamics_of_Moving_Bodies
 
Last edited:
  • #67
Correction to last posting:

With proper acceleration ##\alpha=\gamma^3 a## (follows from relativistic velocity addition) and if the force is in direction of movement:

relativistic kinetic energy = ##\int F \cdot ds = \int m\alpha \cdot ds = m \int \gamma^3 a \cdot ds = m \int \gamma^3 \frac{dv}{dt} \cdot ds = m\int_0^v \gamma^3 v \cdot d v = mc^2(\gamma -1) ##

Source (see after Einsteins definition of "longitudinal mass"):
https://en.wikisource.org/wiki/Translation:On_the_Electrodynamics_of_Moving_Bodies

The integral can be calculated via:
https://www.integral-calculator.com/
 
Last edited:
  • #68
Addition to above posting #67:

If the force is in direction of movement, the formula ##F = m\gamma^3 a## can also be derived from the relativistic momentum:

##F = \frac{d}{dt}(m\gamma v) = m \frac{d}{dv}(\gamma v) \frac{dv}{dt} = m\gamma^3 a##

The derivative ##\frac{d}{dv}(\gamma v)## can be calculated via:
https://www.derivative-calculator.net/
 
Back
Top