How can we use differential geometry to improve our understanding of SR?

Fredrik · May 11, 2010

When I say "SR" in this post, I mean the set of classical and quantum theories of particles and fields in Minkowski spacetime.

I'm trying to come up with a list of topics in SR that can be dealt with in a better way when we have defined Minkowski spacetime as a manifold instead of as a vector space (or an affine space). Maybe there aren't that many? The ones I can think of right away are

Born rigidity (I've seen a definition that uses Lie derivatives, and I haven't seen one that would work with a spacetime that doesn't have a manifold structure)
A classification of types of fields we might be interested in. (Sections of various vector bundles over Minkowski spacetime).
The solid rotating disc, if we need to analyze it much more deeply than anyone wants to (except one person I know )
Definitions of measurable quantities that we'd prefer to be explicitly coordinate independent (i.e. proper time).
A coordinate independent definition of geodesics and inertial motion.

That's pretty much it. Maybe the Lagrangian/Hamiltonian stuff is more natural in this context too? Let me know if you can think of other stuff that you think is easier to explain or can be treated in a more general or more elegant way in the "manifold version" of the theory than in the "vector space version". Also let me know if you think the stuff I've mentioned can be handled just as well in a vector space.

0mega · May 13, 2010

I would add to Your list yet
geometry in non-inertial frame of reference.

Fredrik said:

[*]The solid rotating disc, if we need to analyze it much more deeply than anyone wants to (except one person I know )

Very curious. Whom do you mean? Maybe You give a link?

dx · May 13, 2010

Fredrik said:

The solid rotating disc, if we need to analyze it much more deeply than anyone wants to

I'm not sure exactly what you mean by "manifold version" and "vector space version" of SR. Would the following argument about the rotating disc be part of the manifold version?

If we work in a Minkowski reference frame, then the points of the disc have worldlines parametrized by circular coordinates r and θ:

[tex] \gamma (r, \theta) : \mathbb{R} \rightarrow \mathbb{R}^3 [/tex]

with

[tex] t \rightarrow( r \cos (\theta + \omega t), r \sin (\theta + \omega t),t) [/tex]

The problem, presumably, is whether there is some kind of curvature here. There is no curvature of spacetime of course, but if we define the distance between two nearby worldlines by their distance in an instantaneous rest frame, then the metric on the (r,θ)-space of worldlines is

[tex] \mathcal{D} = dr \otimes dr + \frac{r^2}{1 - r^2 \omega^2} d\theta \otimes d\theta [/tex]

Fredrik · May 13, 2010

dx said:

I'm not sure exactly what you mean by "manifold version" and "vector space version" of SR.

The way I see it, SR is defined by axioms like "A clock measures the proper time of the curve in spacetime that represents its motion". We can choose to define "spacetime" as a vector space, as an affine space, or as a smooth manifold...or as some combination, like an affine space that also has manifold structure. Since different choices of what mathematical structure to use give us different axioms, and since I consider a theory to be defined by its axioms, I would say that each choice defines a different theory. They're obviously equivalent in the sense that they make the same predictions about the results of experiments, but they're still different theories, or at least different versions of "the" theory.

dx said:

Would the following argument about the rotating disc be part of the manifold version?

Yes, because in the vector space version of the theory, there's no metric tensor field, only a bilinear form on Minkowski spacetime, and we don't even define the tangent spaces at different points, so there's no way to define the components of the bilinear form in a coordinate system that can't be constructed from a basis of the "Minkowski vector space" in the obvious way.

It's good that you made me think about this, because now I understand Omega's comment much better. If we define spacetime as a vector space with a bilinear form instead of as a manifold with a metric, it really limits our ability to work with arbitrary coordinate systems (i.e. arbitrary functions from spacetime to [tex]\mathbb R^4[/tex]).

dx said:

The problem, presumably, is whether there is some kind of curvature here. There is no curvature of spacetime of course, but if we define the distance between two nearby worldlines by their distance in an instantaneous rest frame, then the metric on the (r,θ)-space of worldlines is

[tex] \mathcal{D} = dr \otimes dr + \frac{r^2}{1 - r^2 \omega^2} d\theta \otimes d\theta [/tex]

So I've been told.

What I haven't been able to figure out is why this is interesting. We clearly don't need to define a quotient manifold and study its properties if we're only trying to see e.g. that the material will stretch when we give the disc a spin. So are we doing this just because it's cool, or is the quotient manifold important in some other way? If you or someone else could enlighten me about that (without forcing me to learn everything about the quotient manifold first), I'd appreciate it.

atyy · May 13, 2010

But even in the "vector space" conception, wouldn't one need to make use of a 4-velocity at some point, which surely brings in the tangent space immediately - so can one really do SR without the idea of a manifold with metric?

Hurkyl · May 13, 2010

It's sort of an awkward question; AFAIK pretty much all of the tools of differential geometry were developed for real vector spaces first.

e.g. the tangent bundle to V is nothing more than VxV; a section is (the graph of) a continuous function V -> V.

Fredrik · May 13, 2010

atyy said:

But even in the "vector space" conception, wouldn't one need to make use of a 4-velocity at some point, which surely brings in the tangent space immediately - so can one really do SR without the idea of a manifold with metric?

A world line is a function [itex]x:[a,b]\rightarrow M[/itex] from an interval of the real numbers into spacetime. If we have given spacetime a vector space structure, we can make sense of

[tex]x'(t)=\lim_{s\rightarrow 0}\frac{x(t+s)-x(t)}{s}[/tex]

so we don't need to define tangent vectors the hard way (as derivative operators or as equivalence classes of curves). We don't even have to define a tangent space. The four-velocity is just defined as the derivative above, with the appropriate normalization.

dx · May 14, 2010

Fredrik said:

The way I see it, SR is defined by axioms like "A clock measures the proper time of the curve in spacetime that represents its motion". We can choose to define "spacetime" as a vector space, as an affine space, or as a smooth manifold...or as some combination, like an affine space that also has manifold structure.

I don't think we have a choice in definition of SR this way. All the structures are necessary and are assumed at a basic level. No matter how you choose to axiomatize the theory, I think the following must always be a basic assumption: "There exist mappings of events into the real 4-dimensional topological vector/affine space and smooth manifold R⁴" The necessity of such an assumption is shown in the way it is used in defining the basic elements of the theory. For example, the notion of an 'inertial frame' is essentially the following: a mapping from events into Minkowski space which sends 'free particles' to straight lines, and such that 'clock rates' respect the affine structure of Minkowsi space. So the next postulate would have to be that such mappings exist, i.e. that inertial frames exist. Starting from this foundation, we can, in the manner of mathematicians, prove the following theorem:

Theorem. Let φ and φ' be inertial frames. Then φ'⋅φ^-1 is an affine transformation, i.e. the transformations between inertial frames are affine transformations.

Once we add the conformal structure derived from light cones, the class of symmetries reduces from general affine to Poincare.

This statement is, of course, not a mathematical one. The domains of the mappings φ and φ' are not defined mathematically. 'Events' are assumed to be something given in experience, and the idea can only be communicated by demonstration, not definition. Basically, our language is always a little part of our theory, and this statement must be interpreted as a linguistic contruct, which describes the situation.

Fredrik said:

So I've been told. What I haven't been able to figure out is why this is interesting.

By 'quotient manifold', I assume you're talking about the manifold of worldlines parametrized by r and θ. The quotient manifold is important because it is the space which is curved. D = dr² + r²dθ²/(1 - r²ω²) is the induced geometry of a class of spatial hyperslices D_t (t in R), i.e. the 3-dimensional intersections of the world-tube of the disc with the set of space-slices of an inertial frame. Surely, this is the central object of discussion in the problem of the rotating disc in special relativity?

Fredrik · May 14, 2010

dx said:

I don't think we have a choice in definition of SR this way. All the structures are necessary and are assumed at a basic level. No matter how you choose to axiomatize the theory, I think the following must always be a basic assumption: "There exist mappings of events into the real 4-dimensional topological vector/affine space and smooth manifold R⁴"

I agree that I can't just say that it's "a vector space". Without a topology, we can't make sense of limits, continuity and derivatives like the one in my previous post. But how about this? We define "spacetime" as the set [tex]\mathbb R^4[/tex] with the standard topology, the standard vector space structure, and the bilinear form g defined by [itex]g(x,y)=x^T\eta y[/itex]. We then define coordinate systems as smooth bijections from [tex]\mathbb R^4[/tex] into itself. (The topological vector space structure is sufficient to make sense of "smooth"). Some of the coordinate systems are associated with bases of the vector space in the following way: For each basis [itex]B=\{e_\mu\}[/itex], we define a coordinate system [itex]f_B[/itex] by [itex]f_B(x)=(x_0,x_1,x_2,x_3)[/itex], where the [itex]x_\mu[/itex] are defined by [itex]x=x_\mu e_\mu[/itex]. Since I'm not going to be talking about tensors, I'm putting all indices downstairs. All inertial frames belong to this class. Technically it's the inverses of my [itex]f_B[/itex] functions that should be called "frames", but I'll stick with the standard abuse of terminology and refer to the "nice" coordinate systems as "inertial frames".

dx said:

For example, the notion of an 'inertial frame' is essentially the following: a mapping from events into Minkowski space which sends 'free particles' to straight lines, and such that 'clock rates' respect the affine structure of Minkowsi space. So the next postulate would have to be that such mappings exist, i.e. that inertial frames exist. Starting from this foundation, we can, in the manner of mathematicians, prove the following theorem:

Theorem. Let φ and φ' be inertial frames. Then φ'⋅φ^-1 is a Poincare transformation, i.e. the transformations between inertial frames are Poincare transformations.

With the definitions I made above, we can define inertial frames mathematically, as coordinate systems that take straight lines to straight lines. (The concept of straight lines is already well-defined by the vector space structure). We can then prove as a theorem that the set of functions of the form [itex]\phi'\circ\phi^{-1}[/itex], where [itex]\phi,\phi'[/itex] are inertial frames, is a group with "multiplication" defined as composition of functions. The identity element is the identity map. We call this group the Poincaré group, and its members Poincaré transformations.

There are still some issues that must be dealt with when we write down our complete set of axioms for the theory. (The axioms are statements that identify things in the real world with things in the mathematical model. The complete list of axioms is what defines the theory). For example, we seem to need an axiom that identifies zero acceleration in the real world with straight lines in the theory, so now we have to think about how to define "accelerometer" operationally. The language used in the axioms may depend slightly on what mathematical structure we have chosen (topological vector space or manifold), but neither of the choices seems to make any of the issues significantly harder to deal with than the other.

dx said:

This statement is, of course, not a mathematical one. The domains of the mappings φ and φ' are not defined mathematically. 'Events' are assumed to be something given in experience, and the idea can only be communicated by demonstration, not definition. Basically, our language is always a little part of our theory, and this statement must be interpreted as a linguistic contruct, which describes the situation.

This confused me at first, but I think I know what you mean now. When you talk about spacetime, events and the domains of those mappings, you're talking about things in the real world, right? That's not how I think about these things. I prefer to have a clear separation between the real world and the mathematics, e.g. no "functions" that take events in the real world to points in [itex]\mathbb R^4[/itex]. I agree that things in the real world can only be "defined" by descriptions in plain English or whatever language you prefer, but when I talk about spacetime, events, coordinate systems, etc, I'm always referring to mathematical concepts.

dx said:

By 'quotient manifold', I assume you're talking about the manifold of worldlines parametrized by r and θ.

Yes, that's what I meant.

dx said:

The quotient manifold is important because it is the space which is curved. D = dr² + r²dθ²(1 - r²ω²) is the induced Minkowski geometry of a class of spatial hyperslices D_t (t in R), i.e. the 3-dimensional intersections of the world-tube of the disc with the set of space-slices of an inertial frame. Surely, this is the central object of discussion in the problem of the rigid rotating disc in special relativity?

The thing is, I don't see how any of that is relevant to anything. I have to admit that I have never studied these aspects of the rotating disc problem in detail, but the reason is that I have never been able to find a reason why I should. This manifold seems completely irrelevant to me, and I'm wondering if it's only used by people who just think the math is cool and people who incorrectly think that we need it to solve problems that we can solve without it.

0mega · May 14, 2010

dx said:

The quotient manifold is important because it is the space which is curved.

IMHO space is flat. While I do not want to say more. I think Demistifier support.

Fredrik · May 14, 2010

0mega said:

IMHO space is flat. While I do not want to say more. I think Demistifier support.

If we define "space" as a hypersurface of constant time coordinate in the rotating coordinate system, then you're right, but that's not the manifold we're talking about. We're talking about defining a manifold structure on the set of world lines (of points in the disc), and that manifold is curved. Note that we're not talking about a submanifold of spacetime.

atyy · May 15, 2010

So in the manifold version, the metric acts on tangent vectors and the spacetime interval is defined by integrating over a straight worldline. In the vector space version, the spacetime interval is obtained by the scalar product acting on position vectors directly, no need to go to tangent vectors and integration. Is that consistent with how you are defining the two versions?

Fredrik · May 15, 2010

atyy said:

So in the manifold version, the metric acts on tangent vectors and the spacetime interval is defined by integrating over a straight worldline. In the vector space version, the spacetime interval is obtained by the scalar product acting on position vectors directly, no need to go to tangent vectors and integration. Is that consistent with how you are defining the two versions?

Except for the "straight worldline" part, yes. We have to define the proper time of an arbitrary timelike curve and the proper length of an arbitrary spacelike curve in both versions. The definitions of proper time are explained here. See #8 for the topological vector space version and #4 for the manifold version.

The main limitation of the topological vector space version is that we can't define the components of the metric in a coordinate system that isn't associated with a basis for the vector space as described in my reply to dx above.

0mega · May 15, 2010

Fredrik said:

If we define "space" as a hypersurface of constant time coordinate in the rotating coordinate system, then you're right, but that's not the manifold we're talking about. We're talking about defining a manifold structure on the set of world lines (of points in the disc), and that manifold is curved. Note that we're not talking about a submanifold of spacetime.

I am surprised that You agreed.

Usually begin to argue.
Yes. The standard definition of simultaneity leads to a curvature of space.
Geometry IMHO is a conditional concept.

dx · May 15, 2010

Fredrik said:

This confused me at first, but I think I know what you mean now. When you talk about spacetime, events and the domains of those mappings, you're talking about things in the real world, right? That's not how I think about these things. I prefer to have a clear separation between the real world and the mathematics, e.g. no "functions" that take events in the real world to points in [itex]\mathbb R^4[/itex]. I agree that things in the real world can only be "defined" by descriptions in plain English or whatever language you prefer, but when I talk about spacetime, events, coordinate systems, etc, I'm always referring to mathematical concepts.

I don't think we can avoid using a phrase like "one can find a map of events into the linear space R⁴ such that the world-lines of free particles are straight lines", even though 'event' and 'free particle' are not defined in the theory. This is because there is real physical content in this: imagine a picture of two particles coming in; they first come into contact at point A, and then again at point B, and then continue without interacting. Using the above postuate we can conclude that these particles cannot be free particles. This is because there is no continuous map from this set of events into R⁴ that will straighten out both of the trajectories, as is required by the postulate above. We need similar postulates about clocks and light.

Fredrik said:

But how about this? We define "spacetime" as the set [tex]\mathbb R^4[/tex] with the standard topology, the standard vector space structure, and the bilinear form g defined by [itex]g(x,y)=x^T\eta y[/itex]. We then define coordinate systems as smooth bijections from [tex]\mathbb R^4[/tex] into itself. (The topological vector space structure is sufficient to make sense of "smooth"). Some of the coordinate systems are associated with bases of the vector space in the following way: For each basis [itex]B=\{e_\mu\}[/itex], we define a coordinate system [itex]f_B[/itex] by [itex]f_B(x)=(x_0,x_1,x_2,x_3)[/itex], where the [itex]x_\mu[/itex] are defined by [itex]x=x_\mu e_\mu[/itex]. Since I'm not going to be talking about tensors, I'm putting all indices downstairs. All inertial frames belong to this class. Technically it's the inverses of my [itex]f_B[/itex] functions that should be called "frames", but I'll stick with the standard abuse of terminology and refer to the "nice" coordinate systems as "inertial frames".

Here's how I would define "manifold SR": First, we start with the mathematical structure, the real linear space R⁴. Then we represent 'free particles', 'events', 'clocks', 'light', and postulate the existence of maps such that free particles are straight lines etc. We introduce the idea of inertial frame as a map of events into R⁴ that respects all the postulates about clocks etc. above. Now the set of maps has been greatly reduced. In fact, it restricts it such that the transformation between any two such reference frames is a linear transformation. If we further restrict the maps by taking into account the 'constancy of the speed of light' in the definition of an inertial frame, i.e. in precise language, the conformal structure of spacetime encoded in the properties of light (from postulates), then we are left with exactly what is required: Lorentz tranformations (of if we had been a little more general and used affine above, Poincare.) So although people usually say Einstein postulates are not well defined and so on, the thing about the constancy of the velocity of light if properly interpreted does contain special relativity. The clock behavior implied by the formula s² = t² - x² - y² - z² is the one which has the correct symmetry (Lorentz symmetry).

This is all at a basic level. For practical purposes, I don't think we can avoid using tensors in their modern guise. Inertial frameins, in practice are presented as follows: We have four functions t, x, y, z : R⁴ → R. From these we can contruct the objects ∂_t, ∂_x, ∂_y, ∂_z, dt, dx, dy, dz (at each point of R⁴). Using these objects, we can represent the clock behavior in the Minkowski metric tensor field

[tex] N = \eta_{\mu \nu} dx^{\mu} \otimes dx^{\nu}[/tex]

Fredrik said:

I'm wondering if it's only used by people who just think the math is cool and people who incorrectly think that we need it to solve problems that we can solve without it.

I don't think its necessary to solve anything; in fact, I don't really consider this issue about the geometry of the disc relevant or interesting (unless we discuss it in the context of general relativity.) The metric on this quotient space was defined in a specific way, i.e. by constructing it locally in an intantaneous rest frame moving with the worl-lines at that point. This has nothing to do with any curvature of the rigid disc as it appears in space. The assumption that it is rigidly rotating is essentially equivalent to saying that it appears as a rigidly and uniformly rotating disc in your reference frame. This notion of rigidity is not in any way in harmony with SR, I think. Maybe there would be some interesting features if we try to relate the definition of the metric to born rigidity.

Troponin · May 16, 2010

0mega said:

I am surprised that You agreed. Usually begin to argue.
Yes. The standard definition of simultaneity leads to a curvature of space.
Geometry IMHO is a conditional concept.

Doesn't Weinberg have a book on gravitation where he tries to de-emphasize the geometric view?
I think he says something like "I believe the geometric view has driven a wedge between GR and the theory of elementary particles."

I don't know how relevant that is with what you're saying, but I remember reading that in the preface.
(However...when looking through the book, I couldn't find much difference in pedagogy in Weinberg's book over any other GR book. The only thing I remember finding somewhat different is that he had a few chapters on different topics of GR, including discussion of g^uv, affine connections, and gravitational forces in GR before his chapter on tensor analysis)

Passionflower · May 16, 2010

Fredrik said:

We're talking about defining a manifold structure on the set of world lines (of points in the disc), and that manifold is curved.

Why do you think it is curved?

It is true that a rotating disk is not Born rigid and its clocks are not synchronized and neither is there a single rest frame that shows all circumnavigated points on the disk to come back to the same spatial location.

But all that does not imply the spacetime is curved.

Fredrik · May 16, 2010

dx said:

I don't think we can avoid using a phrase like "one can find a map of events into the linear space R⁴ such that the world-lines of free particles are straight lines", even though 'event' and 'free particle' are not defined in the theory.

We're probably thinking along pretty similar lines, but I don't like your terminology. A "map" is a mathematical concept, with a very precise definition, and it hurts my eyes to see it used this way. I would define "spacetime" as a specific mathematical structure (either a topological vector space or a smooth manifold). I would define proper time as a mathematical property of a timelike curve in spacetime. I would also define "inertial frame" and "Poincaré group" mathematically. I would define "events", "clocks", and so on, operationally, i.e. by descriptions in plain English. Then I can write down the axioms of the theory. Maybe the term "axiom" hurts someone else's eyes, but I haven't thought of a better one. These "axioms" tell us what pieces of the mathematics that correspond to the operationally defined things in the real world. They would look roughly like this: (I'm quoting myself from one of the rotating disc threads).

Fredrik said:

1. Physical events are represented by points in Minkowski spacetime. (A consequence of this is that motion is represented by curves, and this suggests the definition of a "particle" as a system the motion of which can be represented by exactly one curve).
2. A clock measures the proper time of the curve in Minkowski spacetime that represents its motion.
3. A radar device measures infinitesimal lengths in the following way: If the roundtrip time is T, then cT/2 is the approximate proper length of the spacelike geodesic from the midpoint of the timelike geodesic through the emission event and the detection event to the reflection event. The approximation becomes exact in the limit T→0. (I haven't found a way to say this that isn't really awkward).

Actually these just define a framework in which we can define classical special relativistic theories of matter and interaction (in several different ways). I won't go into details about those things here. Note that I could have chosen to define 3 in a different way:

3'. A radar device moving as represented by a timelike geodesic measures lengths in the following way: If the roundtrip time is T, then cT/2 is the proper length of the spacelike geodesic from the midpoint of the worldline between the emission event and the detection event to the reflection event.

With 3', we have a theory that's at least as worthy of the name "special relativity" as anything Einstein could have written down in 1905, but it doesn't make any prediction at all about the circumference of the disc in the rotating frame. This theory simply doesn't tell us how to make measurements with non-inertial measuring devices. This is of course exactly why we should prefer 3 over 3'.

I don't see any reason to think that it matters if I choose to define spacetime as a topological vector space or as a manifold. Different choices require slightly different definitions of proper time, but the axioms would be the same except that I should probably replace "geodesic" with "straight line" in the topological vector space version. And these two theories, or two versions of the same theory if that sounds better, still make the same predictions about results of experiments.

I probably have to add one more axiom though, one that identifies straight lines with non-accelerating motion. This would require an operational definition of "accelerometer", but then we can just say that the motion of an accelerometer that reads zero is represented by a timelike straight line in spacetime (if we chose to define spacetime as a topological vector space) or by a timelike geodesic in spacetime (if we chose to define spacetime as a manifold).

We don't need anything more about inertial frames, invariance of the speed of light, and so on, because all of that stuff is already included in the definition of inertial frame, and we just needed this last axiom to relate it to reality.

dx said:

I don't think we can avoid using tensors in their modern guise.

We can define tensors in the topological vector space version too. A tensor would be a multilinear map from M*×...×M*×M×...×M into the real numbers, and a tensor field would be a tensor-valued function. So even if we need tensors, it doesn't imply that we need manifolds.

dx said:

I don't think its necessary to solve anything;
...
Maybe there would be some interesting features if we try to relate the definition of the metric to born rigidity.

I think this is an occasion where differential geometry is nice. Without it, we can prove "locally" (i.e. by considering an infinitesimally small region of the disc) that we can't give the disc a spin without stretching the material, but I think "global" proofs are hard. (Is there even a global definition of Born rigidity that doesn't use differential geometry?). If we use techniques from differential geometry, and a differential geometry definition of Born rigidity, a "global" proof is definitely possible. But we don't need the quotient manifold for that (as far as I know), and I'm thinking that if we don't even need it for that, then what do we need it for? I suspect that we don't need it for anything.

Fredrik · May 16, 2010

Passionflower said:

Why do you think it is curved?

Because people have told me so (and because the circumference of the disc in this manifold isn't 2*pi*r). I haven't even bothered to find out how the metric of spacetime induces a metric on this manifold, because I haven't found a single reason to think this manifold is significant.

Passionflower said:

But all that does not imply the spacetime is curved.

Please read what I said again. I wasn't talking about spacetime or about a submanifold of spacetime.

Passionflower · May 16, 2010

Fredrik said:

Because people have told me so (and because the circumference of the disc in this manifold isn't 2*pi*r).

That is related to the difficulty for a single rest frame to interpret what a complete circumnavigation for all outer points of the disk actually means.

Fredrik · May 16, 2010

Passionflower said:

That is related to the difficulty for a single rest frame to interpret what a complete circumnavigation for all outer points of the disk actually means.

I know.

dx · May 17, 2010

Fredrik said:

We're probably thinking along pretty similar lines, but I don't like your terminology. A "map" is a mathematical concept, with a very precise definition, and it hurts my eyes to see it used this way. I would define "spacetime" as a specific mathematical structure (either a topological vector space or a smooth manifold). I would define proper time as a mathematical property of a timelike curve in spacetime. I would also define "inertial frame" and "Poincaré group" mathematically. I would define "events", "clocks", and so on, operationally, i.e. by descriptions in plain English. Then I can write down the axioms of the theory. Maybe the term "axiom" hurts someone else's eyes, but I haven't thought of a better one. These "axioms" tell us what pieces of the mathematics that correspond to the operationally defined things in the real world. They would look roughly like this: (I'm quoting myself from one of the rotating disc threads).

So, in your language, 'event' would be operationally defined. This will have a 'representation' in the mathematical structure called Minkowski space.

Just focusing on free particles for now, and assuming they are 'operationally defined' or 'whose meaning has been demostrated', how do we incorporate this in Minkowski space? What I've been saying is, we postulate "There exists a map of events into Minkowski space such that the worldlines of free particles are straight lines". How would you say this in your language, without referring to 'maps', but still containing the same content (including the example that I included in my previous post)?

Fredrik · May 17, 2010

dx said:

So, in your language, 'event' would be operationally defined. This will have a 'representation' in the mathematical structure called Minkowski space.

I still don't know the best way to say this, but what I have in mind is that we have an intuitive concept of what space, time and events are, and we define the structure we call Minkowski space or Minkowski spacetime, to be the mathematical representation of these things. I'm not sure how I'd like the word "event" to be defined, i.e. if I want it to be something in the real world, or a point in Minkowski spacetime. I think I want to use the same word to mean two different things, even though I usually try very hard to avoid doing that.

Maybe I should even drop the word "event" completely from what I just said, and instead just say that Minkowski spacetime is the mathematical representation of space and time in the real world. Then we can reserve the term "event" for the times and places in the real world where something is actually happening. This always corresponds to an intersection of at least two world lines in the mathematical model.

The problem with that terminology is that then there are "more" events in the model than in the real world. (Hm, to make sense of "more", we have to think about bijections and stuff, so I guess it's not so easy to completely avoid that "map" talk after all). This "problem" is probably not very significant, because of what I'm saying below.

dx said:

Just focusing on free particles for now, and assuming they are 'operationally defined' or 'whose meaning has been demostrated', how do we incorporate this in Minkowski space? What I've been saying is, we postulate "There exists a map of events into Minkowski space such that the worldlines of free particles are straight lines". How would you say this in your language, without referring to 'maps', but still containing the same content (including the example that I included in my previous post)?

I think I don't want to define things like "particle" operationally. Hm, I think we only need operational definitions for the measuring devices. Think of it this way: We define all the mathematics without making any connection whatsoever to the real world. Then we define a theory by writing down a list of instructions that tells us how to interpret the mathematics as predictions about results of experiments. These instructions will contain words like "clock", which can only be defined operationally, but they won't mention "particles".

An "operational" definition of a measuring device is essentially a set of instructions that tells us how to build one. To understand such instructions, you must already have an intuitive understanding of some of the properties of space and time. It would be nice to have an operational definition of those terms, but such a definition clearly can't mention measuring devices, since that would make these definitions circular. So there's only one way to "define" the terms "space" and "time" that must be understood before we can understand the operational definitions of measuring devices: By a reference to human senses and conscious experience.

We probably shouldn't be calling it a "definition" though. I've seen the term "elucidation" used in a similar context, so let's use that. The reference to human senses is an elucidation (i.e. a remark that clarifies a point on an intuitive level) of the operational definition of measuring devices.

I'm glad that you're making me think about these things.

I want to see them completely worked out, either by me or by someone else. The same thing goes for this thread. (I actually spent a lot of time working on the post I said I would write in #15, but I didn't have the energy to complete it. I will definitely write a good explanation of the points I was trying to make there some time, but it's probably not going to be really soon. And now I can't find the text I wrote back then.

Maybe I accidentally deleted it).

atyy · May 18, 2010

Fredrik said:

It would be nice to have an operational definition of those terms, but such a definition clearly can't mention measuring devices, since that would make these definitions circular.

But is there anything wrong with being circular, as a matter of principle? Is being circular different from being self-consistent?

Fredrik · May 18, 2010

atyy said:

But is there anything wrong with being circular, as a matter of principle? Is being circular different from being self-consistent?

It's not nearly as bad as an inconsistency, but it's certainly undesirable. I think we can define "inconsistent" as "it's possible to derive the negation of one of the axioms from the others". This would obviously be disaster, but I think it's also pretty bad to have a situation where "it's possible to derive one of the axioms from the others". Then we have an axiom that adds nothing to the theory, and I would certainly prefer to remove it. To include it would give people the impression that we either believe that it adds something significant, or don't understand that it doesn't.

atyy · May 18, 2010

Fredrik said:

It's not nearly as bad as an inconsistency, but it's certainly undesirable. I think we can define "inconsistent" as "it's possible to derive the negation of one of the axioms from the others". This would obviously be disaster, but I think it's also pretty bad to have a situation where "it's possible to derive one of the axioms from the others". Then we have an axiom that adds nothing to the theory, and I would certainly prefer to remove it. To include it would give people the impression that we either believe that it adds something significant, or don't understand that it doesn't.

Well, I'm thinking of something like: What's an a charged particle? Something that responds to an electric field. What's an electric field? Something that makes a charged particle move.

I like this MTW claim: Here and elsewhere in science, as stressed not least by Henri Poincare, that view is out of date which used to say, "Define your terms before you proceed." All the laws of physics, including the Lorentz force law, have this deep and subtle character, that they both define the concepyts they use and make statements about these concepts. Contrariwise, the absence of some body of theory, law and principle deprives one of the means properly to define or even to use concepts. Any forward step in human knowledge is truly creative in this sense: that theory, concept, law, and method of measurement - forever inseparable - are born into the world in union.

Fredrik · May 19, 2010

atyy said:

Well, I'm thinking of something like: What's an a charged particle? Something that responds to an electric field. What's an electric field? Something that makes a charged particle move.

I would define all of those things in the mathematical model and leave the real-world concepts undefined. I don't think I like that MTW claim.

atyy · May 19, 2010

Fredrik said:

I would define all of those things in the mathematical model and leave the real-world concepts undefined. I don't think I like that MTW claim.

I vacillate back and forth about this point. Let's say you want to find the Higgs boson, and you build the LHC. Clearly, to do the experiment, one needs the concept of "Switzerland" - otherwise experimentalists are not going to find their way to the experiment. According to MTW, "Switzerland" would be a concept definable from the Standard Model. At first that seems preposterous. On the other hand, we do have a vague way of going from the Standard Model to relativistic quantum mechanics to quantum mechanics to classical mechanics to "Switzerland" ...

Fredrik · May 20, 2010

Yes, clocks, particle detectors and Switzerland all consist of particles of the types that the SM makes predictions about. But that doesn't mean that we have to use that fact when we define our theories (or that we can use it as an excuse to not define anything, or to use circular definitions). The "clocks" that we have to mention in the axioms of SR are defined by instructions on how to build them, and the terms in those instructions are defined by human language, human senses, and conscious experience. The last two of those have to be considered primitives, i.e. things we don't define. (That probably goes for some things in the language as well). The fact that we have to resort to talking about human language, senses and experiences is annoying as hell, but there's no way around it. All we can do is to try to make sure that these things don't introduce any more ambiguities into the theory than we can tolerate.

How can we use differential geometry to improve our understanding of SR?

What is special relativity (SR)?

What is differential geometry?

How does SR use differential geometry?

What is the difference between special relativity and general relativity?

What are some real-world applications of SR and differential geometry?

Similar threads

Hot Threads

Recent Insights