Ben Niehoff said:
Defining a limit on a space of paths requires you to define a topology on the space of paths. This is what topology is all about.
I think there are perfectly sensible topologies on the space of paths in which a null path is the limit of a sequence of timelike paths. Or spacelike paths, for that matter. These topologies, like the underlying manifold topology, are not induced from the pseudo-Riemannian metric.
However, as you point out, the proper time along the path will approach zero as the path approaches the limit. Therefore proper time is not a good parameter along null paths (which we already knew).
Something that I realized that I don't know how to do is calculus of variations in the case where the derivative of the Lagrangian is undefined.
If we are looking for a parametrized path P(s) that extremizes the proper time, then we can cast this as a Lagrangian dynamics problem:
A = \int{L ds}
where the action A is interpreted as the proper time \tau, and the lagrangian L is interpreted as the expression \sqrt{g_{\mu \nu} \dfrac{dx^{\mu}}{ds} \dfrac{dx^{\nu}}{ds}}
Using the Euler-Lagrange equations of motion gives for the extremizing path:
\dfrac{d}{ds} \dfrac{\partial L}{\partial U^{\mu}} - \dfrac{\partial L}{\partial x^{\mu}} = 0
where U^{\mu} = \dfrac{d x^{\mu}}{ds}
For the particular choice of L = \sqrt{g_{\mu \nu} \dfrac{dx^{\mu}}{ds} \dfrac{dx^{\nu}}{ds}}, the equations of motion become:
\dfrac{\partial_{\lambda} g_{\mu \nu} U^{\lambda} U^{\nu}}{L} + \dfrac{g_{\mu \nu} \dfrac{dU^{\nu}}{ds}}{L}<br />
- \dfrac{g_{\mu \nu} U^{\nu} \dfrac{dL}{ds}}{L^{2}} - \dfrac{1}{2 L} \partial_{\mu} g_{\nu \lambda} U^{\nu} U^{\lambda} = 0
If we multiply through by L, this becomes:
\partial_{\lambda} g_{\mu \nu} U^{\lambda} U^{\nu} + g_{\mu \nu} \dfrac{dU^{\nu}}{ds}<br />
- g_{\mu \nu} U^{\nu} \dfrac{d \ log (L)}{ds} - \dfrac{1}{2} \partial_{\mu} g_{\nu \lambda} U^{\nu} U^{\lambda} = 0
Finally, if we assume an affine parametrization, so that L is constant along the path, then this simplifies to:
\partial_{\lambda} g_{\mu \nu} U^{\lambda} U^{\nu} + g_{\mu \nu} \dfrac{dU^{\nu}}{ds}<br />
- \dfrac{1}{2} \partial_{\mu} g_{\nu \lambda} U^{\nu} U^{\lambda} = 0
which is equivalent to the usual geodesic equation. The problem, mathematically, is that the manipulations only make sense if L is nonzero. If L is zero, then all the equations (with L in the denominator) are undefined.
For this reason, it seems to me that one needs to assume during the derivation that L is small, but nonzero. So I don't see how you can actually get a null geodesic this way, except possibly as a limiting case.