Teaching Relativity in a College Physics course

AI Thread Summary
The discussion centers on teaching special relativity to non-majors within a limited timeframe of one week, focusing on key concepts like length contraction and time dilation. The instructor plans to use a reading assignment and online homework to prepare students before two class meetings, with an emphasis on spacetime geometry to enhance understanding. Feedback suggests that fundamental principles, such as the constancy of the speed of light and inertial frames, should be included to ensure comprehension of relativity concepts. Some participants express concerns about the use of units where c=1, arguing it may confuse beginners rather than clarify the material. Overall, the conversation highlights the challenges of effectively teaching complex physics concepts in a constrained schedule.
  • #51
Andrew Mason said:
The problem with the latter is that the expression E = m is true ONLY if c=1 whereas E = mc^2 is always true.
There are systems of units where you write ##E_o=kmc^2## and then say ##E_o=mc^2## only for systems where k=1.
 
Science news on Phys.org
  • #52
Orodruin said:
This is wrong. The meter is defined as 1/299792458 s (which happens to be a very suitable unit for measuring the spatial size of many things), you never measure the speed of light.

I am not sure what you are saying... After choosing a system of units of time and distance, of course someone can then measure the speed of light.

The meter is *now* defined this way, but it has not always been defined that way, obviously. And before that definition was chosen, the speed of light *had* to be measured!

What you are talking about is exactly what I said in my earlier post: one can pick an arbitrary unit of time, then pick an arbitrary value for the speed of light and these two will then define a unique measure of distance. They defined the speed of light to be 2999792458 m/s just so that it would then give a unit of distance close to the previous one but they might as well have defined the speed of light to be 7.3891 zoobie per second, which would have defined the zoobie. Or they could have defined the speed to be 1 light-second per second, which would have defined the light-second. The number 1 is prettier than 7.3891 or 2999792 558 but that's just pure aesthetic.
We never said it is wrong, it is just obscuring the geometry. By introducing and using ct everywhere you are essentially doing the same, you are just calling your time variable with a longer and more cumbersome name. And of course you can pick any units you like, just as you can chose to measure one spatial direction in feet and the other in light years. It just obstructs the symmetry and make the coordinate transformations awkward.

Well you did say that it was wrong that c=1 really means c = 1 unit of distance over one unit of time. By the way, it seems like you feel that using different units for time and space is misleading because it introduces an artificial distinction between the two. Despite the two being connected in a deep way, they *are* different and this is not only because of a choice of units. We can move in all directions along the spatial coordinates but we cannot move in both direction in time (at least in SR). The time piece has a different sign than the three spatial pieces in the spacetime interval. We can stay at fixed x,y,z coordinates but not at a fixed time coordinate. Causality is related to whether or not ##(\Delta t)^2## is larger or smaller or equal to ##(\Delta x)^2 + (\Delta y)^2 + (\Delta z)^2##. In all these cases, time is clearly distinguished from the space coordinates. So insisting that because time and space are basically the same thing we should use the same units could also be considered as obscuring the different status of the two. And this difference is physical, it is not due to me using light-years for distance units and years for time units, say.But at least we agree on one thing: it is a matter of choice, not of physics, whether one uses c=1 or not. That's what I have tried to point out since the beginning. Weather something is more or less obscure is a matter of opinion, not of physics.

So now we can go back to the very beginning. What makes something more obscure to someone may make things less obscure to someone else, and vice versa. I agree that for advanced students, it can be interesting to be pointe out units in which c=1. But my point was that I think that for beginning students, it just obscures things by adding one layer of "newness" on top of all the subtleties of the Lorentz transformations. I think that they have enough in dealing with relativity of simultaneity, time dilation, etc etc without having to deal with a new set of units which *for them* will be new and confusing (they will not become comfortable with giving distances in seconds and giving speeds in pure fractions of c within a week or two of classes, while at the same time dealing with all the physics of the Lorentz transformations.

That's all I was trying to say.
 
  • Like
Likes Andrew Mason
  • #53
nrqed said:
My point is that in both B and C, the value chosen for c here is completely arbitrary! Nothing forces it to be 1. What I mean by this is that if two persons use different values of c (and the corresponding two different sets founts), they will agree on any physical result obtained by calculations.
But as I said before, this viewpoint is nothing new and the equivalent of selecting an orthonormal set of basis vectors in a Euclidean space. Sure, you can select a basis where the length of one of the vectors is 53, but why would you do that to yourself?
 
  • #54
Andrew Mason said:
A time interval in a given inertial reference frame is always a time interval to all observers in that reference frame.
A timelike interval is always timelike. It is not the same as saying it is a time interval!
Andrew Mason said:
But all observers make the distinction between time and space. If time and space were equal, it would be just a matter of changing the axes to make a time interval a space interval. But that cannot be done.
But this is a property of the metric, not a property of the coordinates! By stating it otherwise you are missing fundamental relativistic insights.

Andrew Mason said:
I agree that matter contains potential energy and that the potential energy a body contains is proportional to its rest mass.
Seriously, it is not a potential energy! It is the rest energy and there is absolutely no other concept of mass in relativity and it exactly corresponds to the inertia in the classical limit. What more do you want from it to call it mass?

Andrew Mason said:
Particle physicists often express the rest mass of particles in units of energy, such as meV, but it is always understood that this is just a short-hand. The units of mass are meV/c2
I am a particle physicist ... And if you look into papers such as the announcements of the Higgs discovery, you will find that it quotes masses in GeV and not GeV/c^2. Of course, this is not wrong, it is simply using the fact that there is no other concept of mass than the rest energy and there is no point in inteoducing an arbitrary conversion constant.

nrqed said:
I am not sure what you are saying... After choosing a system of units of time and distance, of course someone can then measure the speed of light.
So I challenge you to do this in our current definition of the units. You will not succeed. If anything, you will simply make a calibration measurement of your ruler.

You cannot measure something you have defined.

nrqed said:
The meter is *now* defined this way, but it has not always been defined that way, obviously. And before that definition was chosen, the speed of light *had* to be measured!
Of course, but that was not the point. The point was that you cannot measure it in the current definition.

nrqed said:
Despite the two being connected in a deep way, they *are* different and this is not only because of a choice of units. We can move in all directions along the spatial coordinates but we cannot move in both direction in time (at least in SR). The time piece has a different sign than the three spatial pieces in the spacetime interval. We can stay at fixed x,y,z coordinates but not at a fixed time coordinate.
Again, this is a property of the metric, not of the units you have used and this property also becomes clearer in units where c=1.
 
  • #55
Orodruin said:
A timelike interval is always timelike. It is not the same as saying it is a time interval!
I was speaking of a time interval, ie. the difference in the time coordinates of two events. I was just saying that all observers in the same reference frame will agree on the magnitude and sign of the time interval. Observers in other reference frames will all agree that there is a time interval of some magnitude, and with the same sign, if the interval is time-like. The fact that intervals may be time-like or space-like means that time and space are distinct physical quantities.

But this is a property of the metric, not a property of the coordinates! By stating it otherwise you are missing fundamental relativistic insights.
I don't think I stated otherwise.

Seriously, it is not a potential energy! It is the rest energy and there is absolutely no other concept of mass in relativity and it exactly corresponds to the inertia in the classical limit. What more do you want from it to call it mass?
The coulomb potential energy between protons in a uranium nucleus contributes to the rest mass of the uranium nucleus. Do we stop calling it potential energy because it contributes to rest mass? When an atom absorbs a photon of energy ##E = h\nu## and an electron in the atom moves to a higher energy level, the rest mass of the atom increases by ##\Delta m_0 = E/c^2##. Is it wrong to think of the atom in the higher energy state as having more potential energy?

AM
 
Last edited:
  • #56
Andrew Mason said:
I was speaking of a time interval, ie. the difference in the time coordinates of two events. I was just saying that all observers in the same reference frame will agree on the magnitude and sign of the time interval.
Ok, I misread your statement as a statement about different frames. Being a statement about the same frame makes it moot instead. Of course people using the same frame will get the same time difference, they will get the same space-time difference to, which is a vector in Minkowski space. There is nothing strange going on here and again the physics becomes more transparent if you give all vector components the same dimension.

Andrew Mason said:
I don't think I stated otherwise.

But your statement that it is based on the dimensions of the coordinates seem to imply it. The distinction between a time-like and a space-like vector comes only from the metric tensor. It has absolutely nothing to do with the units used.
Andrew Mason said:
The coulomb potential energy between protons in a uranium nucleus contributes to the rest mass of the uranium nucleus. Do we stop calling it potential energy because it contributes to rest mass? When an atom absorbs a photon of energy E=hν and an electron in the atom moves to a higher energy level, the rest mass of the atom increases by Δm0=E/c2. Is it wrong to think of the atom in the higher energy state as having more potential energy?

No, what is wrong is to say that all mass is potential energy, which is what I understood from your statement. However, this just underlines my point. The only mass concept that is relevant (and a scalar) in special relativity is the rest energy. If you want to argue against this you need to provide a counter example where this is not the case.
 
  • #57
Orodruin said:
Ok, I misread your statement as a statement about different frames. Being a statement about the same frame makes it moot instead. Of course people using the same frame will get the same time difference, they will get the same space-time difference to, which is a vector in Minkowski space. There is nothing strange going on here and again the physics becomes more transparent if you give all vector components the same dimension.
You can give the time dimension units of distance by multiplying by c. You can't do it by simply declaring the time dimension to be the same as a spatial dimension.

But your statement that it is based on the dimensions of the coordinates seem to imply it. The distinction between a time-like and a space-like vector comes only from the metric tensor. It has absolutely nothing to do with the units used.
Yes. But the three spatial dimensions are arbitrary (so long as they are mutually orthogonal) whereas the time dimension is not. In an inertial reference frame one can describe the space time coordinates of an event relative to the origin as (ct, x, y, z) where x, y and z can be anything so long as x2+y2+z2 is always the same. But you cannot do that with t.
No, what is wrong is to say that all mass is potential energy, which is what I understood from your statement. However, this just underlines my point. The only mass concept that is relevant (and a scalar) in special relativity is the rest energy. If you want to argue against this you need to provide a counter example where this is not the case.
I will agree that speaking about mass as a form of potential energy is out of vogue. But there is nothing intrinsically wrong with saying that rest energy is a form of potential energy. The only way a particle's rest energy can be completely released is through annihilation with its anti-particle, and those are generally in short supply in our neck of the universe, so it is not really practical to treat rest energy as potential energy. But even Einstein in his 1907 essay on relativity spoke about rest energy as potential energy.

AM
 
Last edited:
  • #58
Andrew Mason said:
But the three spatial dimensions are arbitrary (so long as they are mutually orthogonal) whereas the time dimension is not.

The time dimension is orthogonal to the three spatial dimensions. By scaling them you form an orthonormal set of basis vectors. To do that each of the four basis vectors must be mutually orthogonal unit vectors.
In an inertial reference frame one can describe the space time coordinates of an event relative to the origin as (ct, x, y, z) where x, y and z can be anything so long as x2+y2+z2 is always the same. But you cannot do that with t.
That's what he means about the metric being different. It's not ##x^2+y^2+z^2##, it's ##(ct)^2-x^2-y^2-z^2## (timelike). As a result of the metric being different you can still exchange x, y, and z with each other, but you cannot exchange ct with x, y, or z.

None of that has anything to do with making the basis vectors orthonormal.
I will agree that speaking about mass as a form of potential energy is out of vogue. But there is nothing intrinsically wrong with saying that rest energy is a form of potential energy. The only way a particle's rest energy can be completely released is through annihilation with its anti-particle, and those are generally in short supply in our neck of the universe, so it is not really practical to treat rest energy as potential energy. But even Einstein in his 1907 essay on relativity spoke about rest energy as potential energy.

In the example you gave of an atom absorbing a photon look at the total energy of the system in its center of momentum frame before the absorption. In the low speed approximation it's ##h \nu + Mc^2##; after the absorption it's ##(M+m)c^2##, where ##M## is the mass of the atom before absorption and ##(M+m)## is the mass afterwards. (I think the approximation I'm using may also require that ##Mc^2>>h \nu##, but either way it's satisfied!) The contribution to the rest energy made by the photon is ##h \nu##, and it's equal to ##mc^2##.

If you divide each term by ##c^2## you get the mass. That is, the mass of the system before the absorption is ##h \nu/c^2 + M## and afterwards it's ##(M+m)##. The contribution to the mass made by the photon is ##h \nu/c^2##, and it's equal to ##m##.

So what is being converted here? The thing called the energy contribution, ##h \nu##, is being converted into the thing called the mass contribution ##m##, or is the thing called the mass contribution ##h \nu/c^2## being converted into the energy contribution ##mc^2##? Now, this is all semantics of course. Rest energy and mass are two names for the same thing. The total mass of the system before the absorption equals the total mass afterwards. And the total rest energy of the system before the absorption equals the total rest energy afterwards.

The pedagogical point being made here is that the factor of ##c^2## is considered by some to obscure the physics. But regardless of anyone's opinion on that matter, rest energy and mass are equivalent. Note that to see the equivalence you must look at composite bodies, that is, systems that consist of entities.

Also note that you can do the same thing with energy and momentum vector components that you can with time and space vector components, respectively. When you do that the factor of ##c## gets in the way because, and I hope I'm saying this right, the basis vectors aren't orthonormal.
 
  • #59
Andrew Mason said:
You can't do it by simply declaring the time dimension to be the same as a spatial dimension.
Yes you can. I am sorry, but stating otherwise is simply untrue. You don't have to, but you can.

Andrew Mason said:
Yes. But the three spatial dimensions are arbitrary (so long as they are mutually orthogonal) whereas the time dimension is not.

Again untrue. The time direction is quite arbitrary, just perform a Lorentz transformation. What stops you from completely exchanging time and space is the geometry (pseudo-Riemannian metric), not the units. The time direction is taken to be orthogonal to the spatial directions in standard Minkowski coordinates. Now, just as you don't have to use units where c=1, you of course do not need to use orthogonal coordinates. I can introduce light-cone coordinates where two coordinates are orthogonal to themselves.
Andrew Mason said:
But there is nothing intrinsically wrong with saying that rest energy is a form of potential energy.
The thing that is wrong with it is that you would be using a terminology which the rest of the world does not use. I can decide to go around calling kinetic energy potential energy (after all, it can be used to create new particles too!) but nobody will understand me.
 
  • #60
Orodruin said:
Yes you can. I am sorry, but stating otherwise is simply untrue. You don't have to, but you can.
If you can it is certainly not going to be obvious and likely not understandable to a student who is learning this in a one week introduction. Apart from the units not working, it is conceptually not clear. In either case, it makes it more difficult to convey the physics. The physics is not that complicated: it all derives from the fact that c is the same in all inertial frames which directly results in the concept of simultaneity being relative rather than absolute. If you just wave your hands and say 'look at this neat geometry - it explains everything' I think you will not succeed in conveying anything except confusion.

Again untrue. The time direction is quite arbitrary, just perform a Lorentz transformation. What stops you from completely exchanging time and space is the geometry (pseudo-Riemannian metric), not the units. The time direction is taken to be orthogonal to the spatial directions in standard Minkowski coordinates. Now, just as you don't have to use units where c=1, you of course do not need to use orthogonal coordinates. I can introduce light-cone coordinates where two coordinates are orthogonal to themselves.
Since you cannot exchange time and space because of the geometry (I would say it is because of the physics), then it is confusing to say that time and space are the same. You seem to be saying that they are the same but different. BTW, I would be interested in seeing you use 'light-cone coordinates' for a reference frame that do not refer to any other reference frame.
The thing that is wrong with it is that you would be using a terminology which the rest of the world does not use. I can decide to go around calling kinetic energy potential energy (after all, it can be used to create new particles too!) but nobody will understand me.
I am not so sure about that. We speak about nuclear potential energy and electric potential energy in an atom which arises by virtue of the configuration of the parts of the atom's nucleus and electrons. So far as I can tell, these potential energies contribute significantly if not entirely to the rest energy of an atom. We speak about the energy contained within a system by virtue of its configuration (such as a compressed spring, compressed air, stretched elastics etc) as potential energy.

It seems to me that there is not a material difference between the use of potential energy in those contexts and the rest energy of an atom.

AM
 
Last edited:
  • #61
Andrew Mason said:
I am not so sure about that. We speak about nuclear potential energy, and electric potential energy. So far as I can tell, these potential energies contribute significantly if not entirely to the rest energy of an atom. We speak about the energy contained within a system by virtue of its configuration (such as a compressed spring, compressed air, stretched elastics etc) as potential energy. It seems to me that there is not a material difference between the use of potential energy in that context and the rest energy of an atom.

The kinetic energy of electrons makes a positive contribution to the mass (rest energy) of an atom. Researchers are finding that this is making a significant contribution to the shielding provided by the inner electrons in the heaviest elements. There was an article on it recently in Physics Today.
 
  • #62
Andrew Mason said:
Apart from the units not working, it is conceptually not clear.
Yes it is. It is very clear. I am sorry if you do not see this.

Andrew Mason said:
In either case, it makes it more difficult to convey the physics.
On the contrary, it makes the actual physics easier to convey as you do not have to worry about unit conversions. There is a reason we do not use metric units in one spatial direction and imperial in another.

Andrew Mason said:
The physics is not that complicated: it all derives from the fact that c is the same in all inertial frames which directly results in the concept of simultaneity being relative rather than absolute.
No, this is not the main physical point in relativity. Simultaneity is a convention, nothing else. Applying that convention gives different result in different Minkowski coordinates, but you could have chosen any other simultaneity convention as well. For example, in a FRW space-time, there is a natural simultaneity convention in terms of the comoving time. The main physics results are the geometry of space-time, the proper time being the pseudo-Riemannian length of a time-like world line, and the division of a space-time into the future, past, and elsewhere for a given event. Things such as length contraction and time dilation are all coordinate dependent statements that rely on an arbitrary definition of simultaneity.

I will agree that this is not the way we usually teach students, but the way we teach students may be wrong. I have seen what happens to students (and PF visitors!) struggling with understanding time dilation, length contraction, and other "classical" concepts in SR when they are confronted with the actual physics. They all want to base their understanding on this arbitrary definition of simultaneity and it can go terribly wrong.

Andrew Mason said:
Since you cannot exchange time and space because of the geometry (I would say it is because of the physics), then it is confusing to say that time and space are the same.
But you can! All you need to do is to introduce a curvilinear coordinate system where the basis varies continuously. Then you have coordinates, let us call them ##\xi## and ##\zeta## where ##\xi## may be time-like at one point and ##\zeta## at another (take polar coordinates on a 1+1 dimensional space-time - there is nothing stopping you from doing this). You seem to fixate on using a set of Minkowski coordinates, but again that is a special case and unless you can convince me that spherical coordinates are useless when dealing with three spatial dimensions, you will not be able to convince me that Minkowski coordinates hold any kind of special role (apart from the fact that the metric is always diag(1,-1,-1,-1)). In particular not if you insist on using a set of Minkowski coordinates which is not normalised.

Physics is not coordinates, physics is things which you can compute and then go to your laboratory and measure.

Andrew Mason said:
BTW, I would be interested in seeing you use 'light-cone coordinates' for a reference frame that do not refer to any other reference frame.
Take a two-dimensional affine space and introduce the coordinates ##\xi## and ##\zeta##. Introduce the metric ##ds^2 = 2 d\xi d\zeta##. Done.

Of course you can easily show that this is equivalent to 1+1-dimensional Minkowski space, but that is not the point. The physics will work in exactly the same way!

Andrew Mason said:
I am not so sure about that. We speak about nuclear potential energy, and electric potential energy. So far as I can tell, these potential energies contribute significantly if not entirely to the rest energy of an atom. We speak about the energy contained within a system by virtue of its configuration (such as a compressed spring, compressed air, stretched elastics etc) as potential energy. It seems to me that there is not a material difference between the use of potential energy in that context and the rest energy of an atom.
This might work until you get down to the level of elementary particles. The mass of the elementary particles is an intrinsic property and has nothing to do with an internal field configuration (it has to do with expanding the theory around a vacuum which does not respect gauge symmetry, but that is another matter).
 
  • #63
Mister T said:
The kinetic energy of electrons makes a positive contribution to the mass (rest energy) of an atom. Researchers are finding that this is making a significant contribution to the shielding provided by the inner electrons in the heaviest elements. There was an article on it recently in Physics Today.
Good point. But I expect that the electron kinetic energy is small in comparison to the coulomb potential energy of the protons in the nucleus, let alone the nuclear potential energy of the neutrons and protons.

In any event, it depends on how deep one looks. Here is an example where we think of kinetic energy as potential energy: When we say that a can of compressed air (ideal gas) has potential energy we are really referring to its (potential) ability to do PV work on its surroundings. And that ability to do work is due to the kinetic energy of the molecules inside the container.

AM
 
  • #64
Orodruin said:
Yes it is. It is very clear. I am sorry if you do not see this.
Don't feel sorry for me. Feel sorry for the student in this one-week course who you are expecting to understand how time = space and c is dimensionless ! Saying it is clear doesn't make it any clearer.
Simultaneity is a convention, nothing else.
The definition of 'simultaneous' is hardly arbitrary and not really a convention. It is a perfectly understandable word in normal use that means 'occurring at the same time'. Relativity accepts that definition: two events are simultaneous to an inertial observer if the observer's measurements give the same time co-ordinate for each event.

Things such as length contraction and time dilation are all coordinate dependent statements that rely on an arbitrary definition of simultaneity.

I will agree that this is not the way we usually teach students, but the way we teach students may be wrong. I have seen what happens to students (and PF visitors!) struggling with understanding time dilation, length contraction, and other "classical" concepts in SR when they are confronted with the actual physics. They all want to base their understanding on this arbitrary definition of simultaneity and it can go terribly wrong.
But we implicitly use simultanaeity to measure lengths. It is hardly arbitrary. The length of a stick is the spatial separation between two simultaneous observations ie. the observations of the location of each end of the stick. That is a very simple thing to convey to a new student. So with c being absolute (i.e the speed of a light signal is measured to be the same in all inertial reference frames) it is easy to show that simultaneous events to one observer are not simultaneous to an inertial observer in an other reference frame, and that explains length contraction. Time dilation simply follows from absolute c and length contraction.

AM
 
  • Like
Likes nrqed
  • #65
Andrew Mason said:
Don't feel sorry for me. Feel sorry for the student in this one-week course who you are expecting to understand how time = space and c is dimensionless ! Saying it is clear doesn't make it any clearer.
I would say we have long since transcended talking about the one-week course. This discussion ensued from you claiming it to be unequivocally wrong to use the same units for time and space coordinates, which I find outright misleading.

Andrew Mason said:
The definition of 'simultaneous' is hardly arbitrary and not really a convention.
Yes it is. It comes down to an arbitrary choice of coordinates - or if you prefer calling a set of Minkowski coordinates a frame - an arbitrary choice of frame. There is a multitude of other possibilities of defining simultaneities as space-like foliations of space-time. In particular, the arbitrariness also becomes clearer in GR where it is not necessarily possible to define a global simultaneity.

Let us study the 1+1 dimensional FRW metric ##ds^2 = c^2 dt^2 - a(t)^2 dx^2## (I inserted the ##c## just for you). Would you agree that selecting comoving time ##t## is a good definition of something being simultaneous? A comoving observer is an observer for which ##dx = 0## and this observer will measure a proper time progressing at the same rate as the time-like coordinate ##t##.

Andrew Mason said:
Relativity accepts that definition: two events are simultaneous to an inertial observer if the observer's measurements give the same time co-ordinate for each event.
This is just one of many different possible definitions of simultaneity. I will give you that it is the most common one in SR, but that does not make it the only one. The problem arises when you realize that ##t## is just a coordinate.

Andrew Mason said:
But we implicitly use simultanaeity to measure lengths.
This is how you define lengths. As such, it is intrinsically dependent on the definition of simultaneity - not the other way around.

Andrew Mason said:
and that explains length contraction.

But length contraction is a coordinate effect, as is time-dilation. It appears because you have defined simultaneity in a particular way - it is a result of the commonly used definition of simultaneity, not something which the definition of simultaneity explains. They are really not the fundamental thing in relativity, regardless of what introductory textbooks would have you believe. They are both an artefact of the coordinate systems used, which has led to many a student obsessing over the symmetry of time dilation when it really is nothing but an effect of rotating the coordinate axes. An effect which you have also in Euclidean space (see, e.g, my Insight on this subject).
 
  • #66
Orodruin said:
I would say we have long since transcended talking about the one-week course. This discussion ensued from you claiming it to be unequivocally wrong to use the same units for time and space coordinates, which I find outright misleading.
I don't think I said it was wrong to use the same units for the time and space coordinates. If the time co-ordinate is ct, it is not a problem. ct is a distance. And since c is a constant, the ct coordinate is always proportional to time (as measured in that inertial reference frame). What I objected to was simply stating that time and space are the same physical phenomena.

Yes it is. It comes down to an arbitrary choice of coordinates - or if you prefer calling a set of Minkowski coordinates a frame - an arbitrary choice of frame. There is a multitude of other possibilities of defining simultaneities as space-like foliations of space-time. In particular, the arbitrariness also becomes clearer in GR where it is not necessarily possible to define a global simultaneity.

Let us study the 1+1 dimensional FRW metric ##ds^2 = c^2 dt^2 - a(t)^2 dx^2## (I inserted the ##c## just for you). Would you agree that selecting comoving time ##t## is a good definition of something being simultaneous? A comoving observer is an observer for which ##dx = 0## and this observer will measure a proper time progressing at the same rate as the time-like coordinate ##t##.
You've lost me there. We are talking about Special Relativity - inertial reference frames.

This is how you define lengths. As such, it is intrinsically dependent on the definition of simultaneity - not the other way around.
How would you define length?
But length contraction is a coordinate effect, as is time-dilation. It appears because you have defined simultaneity in a particular way - it is a result of the commonly used definition of simultaneity, not something which the definition of simultaneity explains. They are really not the fundamental thing in relativity, regardless of what introductory textbooks would have you believe. They are both an artefact of the coordinate systems used, which has led to many a student obsessing over the symmetry of time dilation when it really is nothing but an effect of rotating the coordinate axes. An effect which you have also in Euclidean space (see, e.g, my Insight on this subject).
I think time dilation is a bit more than a coordinate effect, whatever you mean by that. The effect is real and readily observed. The direction of photons emitted from relativistic electrons in a synchrotron is a direct result of time dilation.

AM
 
  • #67
Andrew Mason said:
You've lost me there. We are talking about Special Relativity - inertial reference frames.
But SR is not only about inertial frames! Of course we start by teaching it like that just as we do not start vector analysis in parabolic coordinates, but it is a trivial matter to do SR in a general coordinate system. This is not what separates SR from GR. A very common example of a curvilinear coordinate system in SR is Rindler coordinates.

Also, there is a point to my question about the 1+1-dimensional FRW space-time so I would like you to answer it.
Andrew Mason said:
What I objected to was simply stating that time and space are the same physical phenomena.
Of course they are not, but this is a geometrical effect and not a dimensional one. The fact remains that one person's pure time direction has space components in a different person's frame.

Andrew Mason said:
How would you define length?
The same way you do, the distance in the surface of simultaneity between the end points. But this length definition is going to depend on the simultaneity convention.
Andrew Mason said:
The effect is real and readily observed.
No, this is again just plain wrong. What is observed is that, e.g., muons from the upper atmosphere reach the ground. The reason for this is that their proper time between the creation and hit the ground events is short enough for them not to decay. This will be described differently in different coordinate systems and you may refer to it using whatever coordinates you select. But "time dilation" is the effect of the proper time of an observer being shorter than the difference in time coordinates. Obviously this is a coordinate dependent statement. Just the fact that the wording of the description in different frames is different should tell you that the description is coordinate dependent. The physical observable is the proper time and its value depends only on the world line and the geometry of space-time.

Andrew Mason said:
The direction of photons emitted from relativistic electrons in a synchrotron is a direct result of time dilation.

Time dilation may be used to describe it in the lab frame, but the effect itself is based on the geometry of space-time. You could go to any other coordinate system and the result will be the same.
 
  • #68
Orodruin said:
But SR is not only about inertial frames! Of course we start by teaching it like that just as we do not start vector analysis in parabolic coordinates, but it is a trivial matter to do SR in a general coordinate system. This is not what separates SR from GR. A very common example of a curvilinear coordinate system in SR is Rindler coordinates.

Also, there is a point to my question about the 1+1-dimensional FRW space-time so I would like you to answer it.
As I said, you lost me there. Sounds like GR. I am restricting my comments to SR and inertial reference frames.

Of course they are not, but this is a geometrical effect and not a dimensional one. The fact remains that one person's pure time direction has space components in a different person's frame.
I am really not sure what that means. I am quite certain a student in a one-week course on SR would feel the same. The challenge is not to explain things in a way that appears elegant to a mathematician. The challenge is to make it into a readable colouring book.

It seems to me that the physics is what it is whether or not we apply a mathematical construct to the real world. Dimensions, on the other hand, are physical. For example, collisions of bodies occur or do not occur for dimensional reasons (time being one of those dimensions).

As a general comment, I don't think we disagree on essential points but we have, obviously, a different approach to teaching the subject. As I say, our universe is what it is. While there are various approaches one may take in fitting mathematical models to SR, so long as they are consistent and fit the evidence, they are all valid. The question is which one should be taught and when.

No, this is again just plain wrong. What is observed is that, e.g., muons from the upper atmosphere reach the ground. The reason for this is that their proper time between the creation and hit the ground events is short enough for them not to decay. This will be described differently in different coordinate systems and you may refer to it using whatever coordinates you select. But "time dilation" is the effect of the proper time of an observer being shorter than the difference in time coordinates. Obviously this is a coordinate dependent statement. Just the fact that the wording of the description in different frames is different should tell you that the description is coordinate dependent. The physical observable is the proper time and its value depends only on the world line and the geometry of space-time.
This is very much like the twin paradox: the muon ages less during its journey than the stationary observer on the earth, which is the reference frame in which the muon began and ended its journey. So it seems natural to use the coordinate system of the Earth to analyse what is happening. Such an event cannot happen if the muon's clock ran at the same rate as that of an inertial observer on the earth.
Time dilation may be used to describe it in the lab frame, but the effect itself is based on the geometry of space-time. You could go to any other coordinate system and the result will be the same.
The highly directional light from a synchrotron in the lab frame is omnidirectional in the rest frame of the electron that emits it. So I am not sure why the highly directional result would be the same in any other coordinate system.

AM
 
  • Like
Likes nrqed
  • #69
Orodruin said:
No, this is again just plain wrong. What is observed is that, e.g., muons from the upper atmosphere reach the ground. The reason for this is that their proper time between the creation and hit the ground events is short enough for them not to decay. This will be described differently in different coordinate systems and you may refer to it using whatever coordinates you select. But "time dilation" is the effect of the proper time of an observer being shorter than the difference in time coordinates. Obviously this is a coordinate dependent statement. Just the fact that the wording of the description in different frames is different should tell you that the description is coordinate dependent. The physical observable is the proper time and its value depends only on the world line and the geometry of space-time..
You are saying that the lifetime of the muon in a frame where it is not at rest is not a physical observable?? How do you define "physical observable"? I define it as something that can be physically measured. I can certainly measured the time between the creation of a muon and its disintegration in a frame where it is not at rest. Why is it not a physical observable?? You are saying that only invariant quantities are observables but this is plain wrong.
 
  • #70
I guess it comes down to what one means by "observable".

Suppose the muon is created at event A and dies at event B. A clock carried by the muon is the only inertial clock that experiences both of these events. If I want to measure the time, then I have to say "Well, event A is simultaneous with event A' on my worldline, event B is simultaneous with event B' on my worldline, and my inertial clock (assuming that I am am inertial observer) measures the time difference between A' and B' to be ##\Delta t##."

But, since A and B are not on my worldine and thus not experienced by me, I have to use a simultaneity convention to do this.
 
  • #71
George Jones said:
I guess it comes down to what one means by "observable".

Suppose the muon is created at event A and dies at event B. A clock carried by the muon is the only inertial clock that experiences both of these events. If I want to measure the time, then I have to say "Well, event A is simultaneous with event A' on my worldline, event B is simultaneous with event B' on my worldline, and my inertial clock (assuming that I am am inertial observer) measures the time difference between A' and B' to be ##\Delta t##."

But, since A and B are not on my worldine and thus not experienced by me, I have to use a simultaneity convention to do this.
Hi George,

Ok but to me this sounds more like philosophy than science. A physical observable is something that can be measured and whose result can be used to test predictions from a mathematical formalism (e.g. Special Relativity). Orodruin says that only proper time is a physical observable, that the time measure in any other frame is not. I guess that distances are not physical observables according to that point of view since how can you measure a distance in a frame independent way? One cannot. Ok, so that leaves only one observable: proper time. So one can we test the Lorentz transformations?? I guess that according to that point of view, they are meaningless since they involve things that are not physical observables (like time intervals that are not proper time!). So I guess the equations are meaningless, we cannot even test them in a lab!
This is like saying that the x and y components of a vector (in introductory physics) are meaningless since the depend on the coordinate system used to measure them. It is true that there is a quantity with a deeper meaning here: the magnitude of the vector. But saying that the x and y components are not physical observables and they are meaningless is absurd since they can be measured and used to make predictions using equations from theory, predictions that can then be tested. If that was not true, all of experimental physics would be meaningless.
 
  • #72
Coordinates are not meaningless, but they are conventional. Once a convention has been established, it is very meaningful to ask, e.g., how coordinates transform.
 
  • #73
Andrew Mason said:
As I said, you lost me there. Sounds like GR. I am restricting my comments to SR and inertial reference frames.
But this limits your scope of SR. The full formalism and coordinate independence does not become clear until you realize you can use any coordinates. The light cone and Rindler coordinates are just two examples. The entire point with my question was that it is not GR for a particular choice of the scale factor, just hyperbolic coordinates on Minkowski space. The definition of simultaneity is also very reasonable (it is the one we like to use in cosmology!) and different from the typical definition by "equal time coordinate" in Minkowski coordinates.

Andrew Mason said:
I am really not sure what that means. I am quite certain a student in a one-week course on SR would feel the same.
Again, we have gone beyond that a long time ago. This conversation started with the claim that it is impossible to use the same units for space and time. I would also keep c if teaching at a lower level. I use c = 1 in my course at master level.

Andrew Mason said:
Dimensions, on the other hand, are physical.
Yes, but some things can have the same dimension without physically be the same thing.

George Jones said:
But, since A and B are not on my worldine and thus not experienced by me, I have to use a simultaneity convention to do this.
This. There is no way of measuring the muon "lifetime" without adhering to a simultaneity convention. A convention is not an observable and what you are actually measuring may also be encoded in a language which does not refer to a particular coordinate system. What you are really saying is "if I a set of Einstein synchronised clocks then the muon life time will be dilated with respect to these". Now to make this physically observable you will need to define a procedure for setting up such a system.
Andrew Mason said:
This is very much like the twin paradox: the muon ages less during its journey than the stationary observer on the earth, which is the reference frame in which the muon began and ended its journey.
No, it is not like the twin paradox. The fact that the twins meet up again is crucial for them to compare their clock measurements.

Andrew Mason said:
Such an event cannot happen if the muon's clock ran at the same rate as that of an inertial observer on the earth.
But this line of reasoning is only valid in the Earth rest frame and only with your typical definition of simultaneity! SR does not care about your simultaneity definition and neither does the muon. It only cares about the proper time of its world line.

Andrew Mason said:
The highly directional light from a synchrotron in the lab frame is omnidirectional in the rest frame of the electron that emits it. So I am not sure why the highly directional result would be the same in any other coordinate system.
You are obtaining this result by projecting what is happening onto an arbitrary three-dimensional subspace (which happens to be your lab frame). Again it is a result of your will to separate time and space when they are really a single entity. The synchrotron radiation world lines are something much more physical and they really can be described by any set of coordinates but will still be the same object. It is only a matter of your chosen coordinates, just like a vector which was originally pointing in the x-direction in one coordinate system may be pointing in the y-direction in another.

nrqed said:
You are saying that only invariant quantities are observables but this is plain wrong.
I am sorry, but this statement is simply false. You can only measure invariant things because regardless of what you measure you will be using a measuring device which will give you a number. You would measure your height by placing yourself at rest next to a length scale. Now, regardless of which frame I use to describe this, the same number is going to appear at the top of your head - the measurement is invariant and your coordinates therefore defined by your instrument. A non-invariant measurement would violate the principle of relativity. What is the x-component of a vector may be frame dependent, but how you measure the x-component in a particular frame is not - you place a ruler in the x-direction of that frame. Now this may be in a different direction in a different set of coordinates, but the measurement in that given direction is still the same and therefore invariant.

George Jones said:
Coordinates are not meaningless, but they are conventional. Once a convention has been established, it is very meaningful to ask, e.g., how coordinates transform.
Agreed. I am not arguing against the use of coordinates. I am only warning about ascribing a particularly significant meaning to any (arbitrary) set of coordinates. (This is what a lot of students have difficulties grasping in relativity!) The physics will be the same regardless of the coordinates, but coordinates are generally very important for making quantitative predictions.
 
  • #74
Andrew Mason said:
Good point. But I expect that the electron kinetic energy is small in comparison to the coulomb potential energy of the protons in the nucleus, let alone the nuclear potential energy of the neutrons and protons.

That's hardly a reason for adopting a worldview that ignores it. The kinetic energies of the constituents of a system make a contribution to the system's mass in the same way that their potential energy and masses do. Take the spring energy you mentioned earlier. Suppose you have two particles, each of mass ##m##, on opposite ends of a compressed spring. The rest energy of the system is ##2mc^2+\frac{1}{2}kx^2##.

Let the spring release so that the potential energy of the spring is converted into kinetic energy of the particles. Now the rest energy of the system is ##2 \gamma mc^2##. (Note that ##\gamma=(1-v^2/c^2)^{-1/2}##, where ##v## is the speed of each particle in the rest frame, or center-of-momentum frame).

Rest energy is a relativistic invariant, meaning all inertial observers will agree on its value. The same is true of the mass of the system. Before release it was ##2m+\frac{1}{2}kx^2/c^2##. After it's ##2 \gamma m##. Again, all inertial observers will agree on these values.

I think the central issue here is the modelling process that is physics. Mass, energy, and momentum are all part of the modelling process. They are inventions of the human mind, not something discovered like a geologist might do when finding a buried fossil. Humans define mass, energy, and momentum; so if humans can find ways to measure them all in the same units in ways that are consistent with their definitions, then there are no physical grounds on which to object.

The same is true of distance and time.

Of course we do not need to bring up all of these details when we teach. We bring in the ones we need to in the process of trying to satisfy the learning objectives we've set out for the students. I think it's easier, not harder, for students to understand that it's okay to measure distance and time in the same units.
 
Last edited:
  • #75
Orodruin said:
No, it is not like the twin paradox. The fact that the twins meet up again is crucial for them to compare their clock measurements.
But this line of reasoning is only valid in the Earth rest frame and only with your typical definition of simultaneity! SR does not care about your simultaneity definition and neither does the muon. It only cares about the proper time of its world line.
To be fair, I said it was like the twin paradox. A muon twin that remained at rest in the Earth frame would not survive to compare its clock with its traveling brother. That is the whole point. The twin would have expired. It would be an ex-muon!

AM
 
  • #76
Andrew Mason said:
To be fair, I said it was like the twin paradox. A muon twin that remained at rest in the Earth frame would not survive to compare its clock with its traveling brother. That is the whole point. The twin would have expired. It would be an ex-muon!

That's not like the twin paradox because it's a frame-dependent observation. What you describe is true in the Earth frame but observers in other frames will draw different conclusions about that same scenario. For example, in the traveling muon's frame it's the traveling muon that will decay first.

To make it like the twin paradox the two muons must start out in the same place at the same time. And then later be at the same place at the same time again. The difference in the proper times each experiences between departure and return will be invariant. All observers will agree on its value.
 
Last edited:
  • #77
Mister T said:
That's not like the twin paradox because it's a frame-dependent observation. What you describe is true in the Earth frame but observers in other frames will draw different conclusions about that same scenario. For example, in the traveling muon's frame it's the traveling muon that will decay first.

To make it like the twin paradox the two muons must start out in the same place at the same time. And then later be at the same place at the same time again. The difference in the proper times each experiences between departure and return will be invariant. All observers will agree on its value.
?? In the twin paradox the twin's do not have to meet. If the traveling twin arrives back at Earth and discovers that his twin died about a million years earlier has long since decayed it is still the twin paradox. The point is that the traveling twin lives longer than its stationary twin. All inertial observers would agree that the stationary twin died before the traveling twin.

AM
 
  • #78
Andrew Mason said:
In the twin paradox the twin's do not have to meet.
Yes they do, it is fundamental for the appearance of the "paradox".

Andrew Mason said:
All inertial observers would agree that the stationary twin died before the traveling twin.
This is just plain wrong if you adhere to your statement above. If the twins just go away from each other, who lives longer will be a frame dependent (or more generally, simultaneity convention dependent) statement.
 
  • #79
Orodruin said:
Yes they do, it is fundamental for the appearance of the "paradox".This is just plain wrong if you adhere to your statement above. If the twins just go away from each other, who lives longer will be a frame dependent (or more generally, simultaneity convention dependent) statement.
I disagree. One twin has to start its journey in the reference frame of the twin, then transition to a reference frame moving at a relativistic speed relative to the initial frame, and then transition back to the initial frame. The other twin has to remain in the initial frame. If that occurs, the transitioning twin will have experienced a proper time that is shorter than that of the stationary twin as measured by all observers.

Other observers may disagree on how much the age difference will be but they will all agree that the stationary twin was a dearly departed muon when its robust twin crashed into the muon detector on the Earth surface.

AM
 
  • #80
Andrew Mason said:
?? In the twin paradox the twin's do not have to meet. If the traveling twin arrives back at Earth and discovers that his twin died about a million years earlier has long since decayed it is still the twin paradox.

The twins have met in the sense that they are now at the same place at the same time. The muons don't have two meetings. A meeting simply means an event where both are in the same place at the same time. In the twin paradox there are two such events.

If you draw a spacetime diagram you can see that the world lines of the two twins intersect at two places whereas the muon's only intersect at most once. One twin switches inertial reference frames, neither muon does. Remedying this very misunderstanding is what led me to develop the lesson about spacetime geometry. It gives the students a way to visualize the effects of time dilation. The usual didactic explanations, however carefully and skillfully made, often have little impact on the students' comprehension of this most basic feature of relativity.
 
Last edited:
  • #81
Andrew Mason said:
The other twin has to remain in the initial frame. If that occurs, the transitioning twin will have experienced a proper time that is shorter than that of the stationary twin as measured by all observers

No, this is simply false. In particular, in the inertial frame where the "transitioning" twin was originally at rest, the "transitioning" twin will always be older as the "stationary" twin is always time dilated with respect to this frame but the "transitioning" twin is not until he makes the transition.
 
  • #82
Orodruin said:
No, this is simply false. In particular, in the inertial frame where the "transitioning" twin was originally at rest, the "transitioning" twin will always be older as the "stationary" twin is always time dilated with respect to this frame but the "transitioning" twin is not until he makes the transition.
I don't follow what you are saying. Are you saying that the space-time interval between the deaths of the twins is not time-like?

AM
 
  • #83
Andrew Mason said:
Are you saying that the space-time interval between the deaths of the twins is not time-like?

This depends on the exact setup, but in your case (one twin goes away and then stops and goes back to rest in the inertial frame of the staying twin and if the twins live to be the same proper age) yes. In the case when the twins meet up (before either dies), obviously no.

Let us do the actual maths. In the original inertial frame of the traveling twin, the world-line of the staying twin is given by ##x = vt##. The world line of the traveling twin is given by ##x = 0## for ##t < t_0## and ##x = v(t-t_0)## for ##t > t_0##. Let us say that the twins both live to have a proper age ##\tau##.

The death event of the staying twin will be given by ##t_1 = \tau \gamma## due to the time dilation and consequently also by ##x_1 = \tau\gamma v##.

The death event of the traveling twin will be given by ##t_2 = \gamma(\tau - t_0) + t_0## and therefore ##x_2 = v\gamma(\tau -t_0)##.

Now obviously ##t_2 < t_1## and so the traveling twin dies first in this frame. Looking at the space-time interval between the deaths, you will find that ##\Delta x = x_2 - x_1 = -v \gamma t_0## and ##\Delta t = (1-\gamma) t_0##. This implies that
$$
\Delta t^2 - \Delta x^2 = 2(1-\gamma) t_0^2,
$$
which is obviously negative. The space-time interval is therefore space-like.
 
  • Like
Likes Andrew Mason
  • #84
Ok. I agree that the space-time interval between the muon deaths is not time-like. So I take back my comment that all inertial observers would agree. If the twin muons were created in the LHC and one stayed at the original location while the other made a few circuits at .999c before stopping where it left his stationary twin, all observers would agree that the stationary twin was older. Thank-you for reminding me that one has to do the math - or at least a space-time diagram!

AM
 
  • #85
Andrew Mason said:
If the twin muons were created in the LHC and one stayed at the original location while the other made a few circuits at .999c before stopping where it left his stationary twin, all observers would agree that the stationary twin was older.

Note that stopping at the end (or starting at the beginning) is not even required. All that's required is that they be co-located at two separate times, and in between experience different amounts of proper time.
 
  • #86
Mister T said:
Note that stopping at the end (or starting at the beginning) is not even required. All that's required is that they be co-located at two separate times, and in between experience different amounts of proper time.
Yes. But if it kept going, the situation is symmetrical - it would have appeared to both the traveling muon and the stationary one that the other's clock was moving slower. By putting the traveling muon back in the original Earth reference frame that symmetry is broken. In other words, they both agree that, in the Earth frame, the traveling muon's clock ran slower.

AM
 
  • #87
Andrew Mason said:
Yes. But if it kept going, the situation is symmetrical -

Are you talking about the muon scenario that Orodruin analyzed or the LHC muon scenario you mentioned in your reply?

For the muon in the LHC just look at one lap. In the lab frame at time t = 0 the muon passes by you. In your frame of reference its next passage is at t = 90 μs, but the elapsed time in the muon frame is only 4 μs. Both of those events occur at the same place in each frame of reference, so the elapsed time between the events is a proper time in each frame. Two co-locations separated by different amounts of proper time. You stayed in the same reference frame the whole time, the muon was never in your reference frame at any time.
 
  • #88
Andrew Mason said:
In other words, they both agree that, in the Earth frame, the traveling muon's clock ran slower.

For a particular definition of simultaneity, which is no more physical than the xy-plane is in three dimensions. Neither will live to actually see the other's death.
 
Back
Top