A Dirac's "GTR" Eq (27.4): how momentum ##p^\mu## varies

Kostik
Messages
250
Reaction score
28
TL;DR Summary
Dirac derives Einstein's equation for a dust ##G^{\mu\nu}=-8\pi \rho v^\mu v^\nu## from the principle of stationary action using the appropriate Lagrangian for the gravitational field and a dust. First, he "infers" an important relation between an arbitrary variation of an element of the matter field and the resultant change in the momentum vector density. How to derive this rigorously?
In Dirac's "General Theory of Relativity", chapters 26-30, he builds up various action principles from which Einstein's equation ##G^{\mu\nu}=-8\pi T^{\mu\nu}## can be obtained. In chapter 27, he extends the result of chapter 26 (Einstein's vacuum equation) to the case of a dust, where $$T^{\mu\nu}=\rho v^\mu v^\nu \,\, .$$ Here, ##\rho## is the mass density in the MCRF ("rest frame") of a fluid element of the dust.

The momentum density is $$p^\mu = \rho v^\mu \sqrt{-g} \, .$$ Dirac makes arbitrary changes ##b^\mu## in the position of the dust, and uses his Eq. (27.4) $$\delta p^\mu = (p^\nu b^\mu - p^\mu b^\nu )_{,\nu} \qquad\qquad(*)$$ to find the corresponding change in ##p^\mu##.

Using ##(*)##, Dirac does indeed derive Einstein's field equation and the geodesic equation for the motion of the matter. I'm not asking about that, and won't mention the details; Dirac's derivation is straightforward and easy.

My question is about ##(*)##, his Eq. (27.4). He infers this, but does not derive it. I am attaching below my notes, in which I had hoped to derive ##(*)##. (A clearer MSWord doc is also attached.) I start by considering the dust as a collection of individual matter particles. The first result is to show that the velocity field ##v^\mu## is divergenceless, i.e., ##v^\mu_{\,\, ,\mu}=0## (my Eq. (27.5)). Using this fact and ##p^\mu_{\,\, ,\mu}=0##, I obtain the desired result.

Unfortunately, if both ##v^\mu_{\,\, ,\mu}=0## and ##p^\mu_{\,\, ,\mu}=0##, then the proportionality factor ##\rho \sqrt{-g}## must be constant. But ##\rho \sqrt{-g}## definitely is not a constant.

In fact, ##v^\mu_{\,\, ,\mu}=0## cannot be right -- this would only be true if ##\rho## were constant, which again is not the case. The only conservation law is ##p^\mu_{\,\, ,\mu}=0##.

Is my entire derivation pure rubbish? I have a feeling that it's not. I must be missing something; something connected with the fact that this is a dust, and I am working with a sum over individual particles (which can be supposed to have mass ##m##). Somewhere, I am not using the full power of the delta functions in the derivation.

I think the fact that spacetime may be curved, hence the correct conservation law is ##p^\mu_{\,\, ,\mu}= (\rho v^\mu \sqrt{-g})_{\,\, ,\mu} =0##, has nothing whatever to do with proving ##(*)##. Dirac does not even bring in the metric until after his Eq. (27.4). (In fact, he merely states that the momentum density ##p^\mu## "lies in the same direction" as the velocity vector.)

Any help, please?

1751809672114.webp
1751809703910.webp
1751809726142.webp
 

Attachments

Last edited:
Physics news on Phys.org
In his derivation there's no mention of being in a curved spacetime. There's no √|g|, nor any semicolon. So what's wrong with his argument leading to (27.4)?
 
dextercioby said:
In his derivation there's no mention of being in a curved spacetime. There's no √|g|, nor any semicolon. So what's wrong with his argument leading to (27.4)?
Dirac infers (27.4) but doesn't derive it. I was looking to derive (27.4).
 
Kostik said:
Dirac infers (27.4) but doesn't derive it. I was looking to derive (27.4).
Are you satisfied with Dirac's arguments that led him to the equation below (27.3) ?
If yes, note that it is equal to ##\delta p^0## in (27.4). For the other components ##\delta p^i## , I think you can re-use Dirac's arguments: Starting with the equation above (27.3), this time with$$p^1 dx^0 dx^2 dx^3$$(for example). The main difference from before is that ##p^1## may vanish now, leaving us short of a constraint. To remedy that, we may try adding an infinitesimal ##\epsilon## component to ##p^1## , demand vanishing up to ##O(\epsilon)## when ##b^1=0## or when ##b^\mu## is parallel to ##p^\mu## (as before), and expect continuity when ##\epsilon \rightarrow 0## (I haven't worked it out all the way).

Late edit: Note that ##\delta p^\mu## is a vector function of ##p^\mu## and ##b^\mu## which is expected to be linear in both. We can also argue that it should not include any second order (or higher) derivatives. These restrictions considerably narrow down the possibilities.
 
Last edited:
JimWhoKnew said:
Are you satisfied with Dirac's arguments that led him to the equation below (27.3) ?
Heck, no. Dirac says that if ##b^\mu \propto p^\mu \propto u^\mu##, then at ##x=z(\lambda)##: $$z^\mu (\lambda) + b^\mu (x) = z^\mu (\lambda) + \epsilon \frac{dz^\mu}{d\lambda} = z^\mu (\lambda + \epsilon)$$ where ##\epsilon## is small since we assume ##b^\mu## is small. Thus, ##b^\mu## displaces the element of matter along its worldline, and therefore ##\delta p^\mu = 0## . (This is where he uses the assumption that the matter distribution is a dust: all matter particles in the fluid element have the same velocity vector ##u^\mu##, hence the same ##p^\mu##.)

But he does not derive the equation $$\delta p^0 = (p^r b^0 - p^0 b^r)_{,r} \,\, ,$$ he merely observes that it satisfies (27.3) when ##b^0 = 0## and satisfies ##\delta p^0 = 0## when ##b^\mu \propto p^\mu##.
 
Last edited:
JimWhoKnew said:
If yes, note that it is equal to ##\delta p^0## in (27.4). For the other components ##\delta p^i## , I think you can re-use Dirac's arguments: Starting with the equation above (27.3), this time with$$p^1 dx^0 dx^2 dx^3$$(for example). The main difference from before is that ##p^1## may vanish now, leaving us short of a constraint. To remedy that, we may try adding an infinitesimal ##\epsilon## component to ##p^1## , demand vanishing up to ##O(\epsilon)## when ##b^1=0## or when ##b^\mu## is parallel to ##p^\mu## (as before), and expect continuity when ##\epsilon \rightarrow 0## (I haven't worked it out all the way).
It's not clear that the argument above (27.3) for ##p^0## applies for ##p^1##. The integral $$\int_V p^0 dx^1 dx^2 dx^3$$ is the mass within ##V## at a fixed time. Why is $$\delta \left( \int_V p^1 dx^0 dx^2 dx^3 \right) = -\int p^1 b^r \, dS_r \,\, ?$$ Not obvious at all.

##\qquad## I'm inclined to think that Dirac should have made use of ##{p^\mu}_{,\mu}=0## to conclude directly that ##\delta p^\mu## must be $$(\text{something antisymmetric in } \mu \text{ and } \nu)_{,\nu} \,\, .$$ But even that would still be inferring (27.4), not deriving it.
 
Last edited:
Kostik said:
(This is where he uses the assumption that the matter distribution is a dust: all matter particles in the fluid element have the same velocity vector ##v^\mu##, hence the same ##p^\mu##.)
For dust, all matter particles in the infinitesimal fluid element have the same velocity vector ##v^\mu##, hence the same ##p^\mu##. The vector field defined by ##v^\mu## is not constant nor uniform in general.

Kostik said:
It's not clear that the argument above (27.3) for ##p^0## applies for ##p^1##. The integral $$\int_V p^0 dx^1 dx^2 dx^3$$ is the mass within ##V## at a fixed time. Why is $$\delta \left( \int_V p^1 dx^0 dx^2 dx^3 \right) = -\int p^1 b^r \, dS_r \,\, ?$$ Not obvious at all.
##p^0## is positive and bound below. In all other aspects, the 4D treatment by Dirac is symmetric with respect to the roles of the components of ##p^\mu## . The conservation assumption (27.2)$$p^\mu{}_{,\mu}=0$$says that the geodesics of the congruence in 4D are analogous to the force lines (in vacuum) in electrostatics: nothing gets accumulated. Dirac mentions that $$p^1 dx^0 dx^2 dx^3$$is the amount (of matter) flowing through a volume element ##dx^2 dx^3## in the time interval ##dx^0## .

To visualize $$\delta \left( \int_V p^1 dx^0 dx^2 dx^3 \right) = -\int p^1 b^r \, dS_r $$we may try to suppress the ##x^3## component. In the ##x^0-x^2## plane, draw a closed curve, representing the volume and its boundary. The ##x^1## direction points into the page. We may choose ##b^r## , with ##r=(0,2)## , to be small enough, so that changes in ##p^\mu## over the "length" ##b^r## vanish to lowest order. Now, Dirac is considering an active displacement of the dust by ##b^r## , but we can equally well consider the passive displacement by ##(-b^r)##, where the dust "stays put" and the volume is displaced. Try to see what you get for $$\delta \int p^1 dx^0 dx^2$$when ##b^r=\epsilon \delta^r_2## (no boundary deformation) and when ##b^r=\epsilon(x) \delta^r_2## (boundary is deformed).

Kostik said:
But he does not derive the equation $$\delta p^0 = (p^r b^0 - p^0 b^r)_{,r} \,\, ,$$ he merely observes that it satisfies (27.3) when ##b^0 = 0## and satisfies ##\delta p^0 = 0## when ##b^\mu \propto p^\mu##.
So we see that$$ (p^r b^0 - p^0 b^r)_{,r}$$satisfies all the requirements. I would not argue with your standards for derivation, but if it can be argued that it is the unique acceptable possibility that does so, I'll consider it a proof.
 
Last edited:
  • Like
Likes dextercioby
JimWhoKnew said:
##p^0## is positive and bound below. In all other aspects, the 4D treatment by Dirac is symmetric with respect to the roles of the components of ##p^\mu## . The conservation assumption (27.2)$$p^\mu{}_{,\mu}=0$$says that the geodesics of the congruence in 4D are analogous to the force lines (in vacuum) in electrostatics: nothing gets accumulated. Dirac mentions that $$p^1 dx^0 dx^2 dx^3$$is the amount (of matter) flowing through a volume element ##dx^2 dx^3## in the time interval ##dx^0## .

To visualize $$\delta \left( \int_V p^1 dx^0 dx^2 dx^3 \right) = -\int p^1 b^r \, dS_r $$we may try to suppress the ##x^3## component. In the ##x^0-x^2## plane, draw a closed curve, representing the volume and its boundary. The ##x^1## direction points into the page. We may choose ##b^r## , with ##r=(0,2)## , to be small enough, so that changes in ##p^\mu## over the "length" ##b^r## vanish to lowest order. Now, Dirac is considering an active displacement of the dust by ##b^r## , but we can equally well consider the passive displacement by ##(-b^r)##, where the dust "stays put" and the volume is displaced. Try to see what you get for $$\delta \int p^1 dx^0 dx^2$$when ##b^r=\epsilon \delta^r_2## (no boundary deformation) and when ##b^r=\epsilon(x) \delta^r_2## (boundary is deformed).
I thought some more about it and found that my argument is insufficient.
An arbitrary ##b^\mu (x)## displacement introduces density changes. In the ##\delta p^0## case, where the volume is spacelike and ##b^0=0## , each particle is counted once, along with its displacement-invariant contribution to ##p^0##. So the difference between the integrals comes entirely from the boundary, as in Dirac's book. In the case of$$
\delta \left( \int_V p^1 dx^0 dx^2 dx^3 \right)$$where ##b^1=0## , each worldline passes through infinitely many infinitesimal volume elements, so (currently) I can't argue that all the difference must come solely from the boundary.
 
Last edited:
JimWhoKnew said:
I thought some more about it and found that my argument is insufficient.
An arbitrary ##b^\mu (x)## displacement introduces density changes. In the ##\delta p^0## case, where the volume is spacelike and ##b^0=0## , each particle is counted once, along with its displacement-invariant contribution to ##p^0##. So the difference between the integrals comes entirely from the boundary, as in Dirac's book. In the case of$$
\delta \left( \int_V p^1 dx^0 dx^2 dx^3 \right)$$where ##b^1=0## , each worldline passes through infinitely many infinitesimal volume elements, so (currently) I can't argue that all the difference must come solely from the boundary.
I found a reasonable proof that Dirac's (27.3) holds for ##p^k## (still in the ##b^0=0## case). Dirac should actually have done this before moving on to the ##b^0 \neq 0## case:
1752071625604.webp
 
Last edited:
  • #10
Dirac simply guesses that, for ##b^0 \neq 0##, the formula is $$\delta p^\mu = (p^\nu b^\mu - p^\mu b^\nu)_{,\nu} $$ and verifies that it gives Eq. (27.3) when ##b^0 = 0##, and also that ##\delta p^0 = 0## when ##b^\mu## is proportional to ##p^\mu##.

He could also have rationalized this guess by showing that it satisfies ##{p^\mu}_{,\mu}=0##. But either way, he is simply rationalizing a guess.

Which brings me back to my original post ... I think that the derivation is correct, I may just need to convince myself of this.
 
Last edited:
  • #11
Kostik said:
since ##u^0=dt/d\lambda## has no spatial dependence
Read again the first sentences of chapters 25 and 27 in Dirac's book.
 
Last edited:
  • #12
JimWhoKnew said:
Read again the first sentences of chapters 25 and 27 in Dirac's book.
I think you have misunderstood. In Ch. 27, Dirac defers introduction of the metric, so the trajectory of a matter element (fluid element) is ##z(\lambda)##, where ##\lambda## is any parameter. In the "laboratory frame", ##t=x^0## is time. He could easily have taken ##\lambda=t##, in which case ##dz/d\lambda = dz/dt##. It makes no difference; but at this stage, since there is no metric, there is no spacetime interval ##ds##.

##dt/d\lambda## is just the ratio of the parameters: it does not have any spatial dependence. You can choose ##\lambda=t## if you like, and then ##dt/d\lambda=1##.
 
  • #13
The displacement field ##b^\mu## can not be fully arbitrary, since we want a diffeomorphism (we don't want 2 initially distinct points to land on the same target one, nor to cross each other's "path"). A way to guarantee safety is to demand ##b^\mu{}_{,\mu}=0## ("force lines don't cross").
So with ##b^\mu{}_{,\mu}=0## and ##p^\mu{}_{,\mu}=0## , equation (27.4) is just the negative of the Lie derivative$$\mathrm{L}_\mathbf{b}p^\mu=\left[\mathbf{b},\mathbf{p}\right]^\mu$$
 
Last edited:
  • Like
Likes dextercioby
  • #14
Kostik said:
I think you have misunderstood. In Ch. 27, Dirac defers introduction of the metric, so the trajectory of a matter element (fluid element) is ##z(\lambda)##, where ##\lambda## is any parameter. In the "laboratory frame", ##t=x^0## is time. He could easily have taken ##\lambda=t##, in which case ##dz/d\lambda = dz/dt##. It makes no difference; but at this stage, since there is no metric, there is no spacetime interval ##ds##.

##dt/d\lambda## is just the ratio of the parameters: it does not have any spatial dependence. You can choose ##\lambda=t## if you like, and then ##dt/d\lambda=1##.
Dirac's discussion doesn't forbid the choice that ##u^0## is spatially independent. But then, if you add the assumption ##u^k{}_{,n}=0##, you run into troubles with the possibility of ##v^\mu## to vary from one point to a neighboring one, when the metric is re-introduced.
(For example: if at a single point ##u^\mu \propto \delta^\mu_0## , then ##v^\mu=\delta^\mu_0## everywhere on the timeslice)

Note that ##u^\mu## drops out from the final results, as it should.
 
Last edited:
  • #15
JimWhoKnew said:
Dirac's discussion doesn't forbid the choice that ##u^0## is spatially independent. But then, if you add the assumption ##u^k{}_{,n}=0##, you run into troubles with the possibility of ##v^\mu## to vary from one point to a neighboring one, when the metric is re-introduced.
(For example: if at a single point ##u^\mu \propto \delta^\mu_0## , then ##v^\mu=\delta^\mu_0## everywhere on the timeslice)

Note that ##u^\mu## drops out from the final results, as it should.
Just take ##\lambda=t## and the issue goes away. Of course, ##u^\mu## varies throughout the matter field, but the whole point of the matter field being a dust is that, within a sufficiently small "matter (volume) element", the tangent vector ##u^\mu## (or 4-velocity ##v^\mu##) is constant. (Equivalently, all the matter particles are at rest in the Momentarily Co-moving Reference Frame of the matter particle at a particular point ##x##.) Therefore, choosing ##b^n## smaller than the size of the fluid element, we have ##\delta u^k = 0##, and also ##{u^k}_{,n} = 0##.
 
  • #16
Kostik said:
Just take ##\lambda=t## and the issue goes away.
That is choosing, as I said above. It can be done, but Dirac does not impose it.

Kostik said:
Of course, ##u^\mu## varies throughout the matter field, but the whole point of the matter field being a dust is that, within a sufficiently small "matter (volume) element", the tangent vector ##u^\mu## (or 4-velocity ##v^\mu##) is constant. (Equivalently, all the matter particles are at rest in the Momentarily Co-moving Reference Frame of the matter particle at a particular point ##x##.) Therefore, choosing ##b^n## smaller than the size of the fluid element, we have ##\delta u^k = 0##, and also ##{u^k}_{,n} = 0##.
Well... It seems that Dirac implicitly does something in that spirit below equation (25.8), so maybe his ##u^\mu## is indeed piecewise spatially constant. In chapter 27 he claims that his approach "makes the discussion clearer". It is certainly not clear enough for me.
 
Last edited:
  • #17
If Dirac is not going to actually derive (27.4) (which I think I have correctly done in the original post), but rather to simple infer its reasonableness, he might as well simply have shown -- as he does -- that using (27.4), the principle of stationary action yields Einstein's equations for a dust (25.7) starting from the appropriate action ##I## given in (27.9). Since we expect the action principle with the correct action for any particular matter field to yield Einstein’s equation with the energy-momentum tensor corresponding to the field, we must infer the correctness of (27.4).
 
  • #18
JimWhoKnew said:
Well... It seems that Dirac implicitly does something in that spirit below equation (25.8), so maybe his ##u^\mu## is indeed piecewise spatially constant. In chapter 27 he claims that his approach "makes the discussion clearer". It is certainly not clear enough for me.
##u^\mu## must be constant within a sufficiently small volume element of the fluid. That's what it means for all matter particles to be at rest within the momentarily co-moving reference frame of the fluid at a point. If not, then the particles would have to be interacting with one another, creating pressure and/or shear forces (or the particles would be under the influence of other external forces).
 
  • #19
Kostik said:
Just take ##\lambda=t## and the issue goes away. Of course, ##u^\mu## varies throughout the matter field, but the whole point of the matter field being a dust is that, within a sufficiently small "matter (volume) element", the tangent vector ##u^\mu## (or 4-velocity ##v^\mu##) is constant. (Equivalently, all the matter particles are at rest in the Momentarily Co-moving Reference Frame of the matter particle at a particular point ##x##.) Therefore, choosing ##b^n## smaller than the size of the fluid element, we have ##\delta u^k = 0##, and also ##{u^k}_{,n} = 0##.
I'm thinking about this. I am comfortable with ##{u^k}_{,n} = 0##, but I'm having doubts about ##\delta u^k = 0##. The displacement ##b^\mu## applies to the matter element; and if you move an entire matter element from one place to another nearby place, I don't see why ##u^\mu## stays the same. Alternatively, $$\delta u^k = \delta \left( \frac{dz^k}{d\lambda} \right) = \frac{d(\delta z^k)}{d\lambda} = \frac{db^k}{d\lambda} \,\, .$$ I don't see why this vanishes. The displacement field ##b^k## is arbitrary (but small).

Hang on: maybe ##db^k/d\lambda## vanishes simply because ##b^k## doesn't depend on ##lambda#??
 
Last edited:
  • #20
JimWhoKnew said:
The displacement field ##b^\mu## can not be fully arbitrary, since we want a diffeomorphism (we don't want 2 initially distinct points to land on the same target one, nor to cross each other's "path"). A way to guarantee safety is to demand ##b^\mu{}_{,\mu}=0## ("force lines don't cross").
So with ##b^\mu{}_{,\mu}=0## and ##p^\mu{}_{,\mu}=0## , equation (27.4) is just the negative of the Lie derivative$$\mathrm{L}_\mathbf{b}p^\mu=\left[\mathbf{b},\mathbf{p}\right]^\mu$$
Since I joined this thread, I was puzzled by the question why doesn't ##\delta p^\mu## in equation (27.4) just equal the Lie derivative (or its negative, depending on chosen convention). Today I noticed something that I've missed due to lack of sufficient attention. In chapter 27 Dirac defines ##p^\mu## to be a vector density, rather than a vector field. Now, if we set$$p^\mu\equiv \sqrt{g} P^\mu$$where ##P^\mu## is a vector field, we get from equation (21.1) that the assumption ##p^\mu{}_{,\mu}=0## implies ##P^\mu{}_{;\mu}=0## .
The active displacement of matter can be viewed as a passive displacement of the coordinates in the opposite directions:$$x'^\mu=x^\mu-b^\mu \quad .$$Using the conventions and results in sections 4.7 and 10.9 in Weinberg's book:$$\delta p^\mu = \delta \sqrt{g} P^\mu+\sqrt{g} \delta P^\mu=\frac{\sqrt{g}}2 g^{\rho\sigma}\delta g_{\rho\sigma} P^\mu+\sqrt{g} \delta P^\mu \quad .$$Setting$$\delta P^\mu=-\mathrm{L}_\mathbf{b}P^\mu=-b^\nu P^\mu{}_{,\nu}+P^\nu b^\mu{}_{,\nu}$$and $$\delta g_{\mu\nu}=-\mathrm{L}_\mathbf{b}g_{\mu\nu}=-b_{\mu ; \nu}-b_{\nu ; \mu} \quad ,$$along with Dirac's constraint ##p^\mu{}_{,\mu}=0## and $$(p^\mu b^\nu)_{,\nu}=[P^\mu(\sqrt{g}b^\nu)]_{,\nu}=P^\mu{}_{,\nu}\sqrt{g}b^\nu+p^\mu b^\nu{}_{;\nu}$$ to eliminate the connections with equation (21.1), I get equation (27.4) $$\delta p^\mu = (p^\nu b^\mu - p^\mu b^\nu)_{,\nu} \quad .$$

(no further assumptions added this time)
 
Last edited:
  • Like
Likes dextercioby
  • #21
JimWhoKnew said:
post under construction. please dont reply

Since I joined this thread, I was puzzled by the question why doesn't ##\delta p^\mu## in equation (27.4) just equal the Lie derivative (or its negative, depending on chosen convention). Today I noticed something that I've missed due to lack of sufficient attention. In chapter 27 Dirac defines ##p^\mu## to be a vector density, rather than a vector field. Now, if we set$$p^\mu\equiv \sqrt{g} P^\mu$$where ##P^\mu## is a vector field, we get from equation (21.1) that the assumption ##p^\mu{}_{,\mu}=0## implies ##P^\mu{}_{;\mu}=0## .
The active displacement of matter can be viewed as a passive displacement of the coordinates in the opposite directions:$$x'^\mu=x^\mu-b^\mu \quad .$$Using the conventions and results in sections 4.7 and 10.9 in Weinberg's book:$$\delta p^\mu = \delta \sqrt{g} P^\mu+\sqrt{g} \delta P^\mu=\frac{\sqrt{g}}2 g^{\rho\sigma}\delta g_{\rho\sigma} P^\mu+\sqrt{g} \delta P^\mu \quad .$$Setting$$\delta P^\mu=-\mathrm{L}_\mathbf{b}P^\mu=-b^\nu P^\mu{}_{,\nu}+P^\nu b^\mu{}_{,\nu}$$and $$\delta g_{\mu\nu}=-\mathrm{L}_\mathbf{b}g_{\mu\nu}=-b_{\mu ; \nu}-b_{\nu ; \mu} \quad ,$$along with Dirac's constraint ##p^\mu{}_{,\mu}=0## and ##(p^\mu b^\nu)_{,\nu}=[P^\mu(\sqrt{g}b^\nu)]_{,\nu}## to eliminate the connections with equation (21.1), I get equation (27.4) $$\delta p^\mu = (p^\nu b^\mu - p^\mu b^\nu)_{,\nu} \quad .$$

(no further assumptions added this time)
We know ##p^\mu = \rho v^\mu \sqrt{-g}##. Your ##P^\mu## is ##\rho v^\mu##.

Your equation below "Using the conventions and results in sections 4.7 and 10.9 in Weinberg's book" follows immediately from Dirac's equation (26.10).

I don't see how Weinberg's (10.9.9) is the same as the equation immediately above (10.9.7). Not even remotely. Can you help with this?

Then I think I see what you are doing, using Weinberg's (10.9.9) and the equation on p. 292 for ##\Delta_\epsilon U^\mu##.
 
Last edited:
  • #22
Kostik said:
We know ##p^\mu = \rho v^\mu \sqrt{-g}##. Your ##P^\mu## is ##\rho v^\mu##.

Your equation below "Using the conventions and results in sections 4.7 and 10.9 in Weinberg's book" follows immediately from Dirac's equation (26.10).

I don't see how Weinberg's (10.9.9) is the same as the equation immediately above (10.9.7). Not even remotely. Can you help with this?

Then I think I see what you are doing, using Weinberg's (10.9.9) and the equation on p. 292 for ##\Delta_\epsilon U^\mu##.
I explicitly asked "please dont reply yet" (check the quotation in your reply #21). The LaTeX preview issue causes some of us to post unfinished versions, especially when many equations are involved. Common courtesy will be appreciated.

Kostik said:
Your equation below "Using the conventions and results in sections 4.7 and 10.9 in Weinberg's book" follows immediately from Dirac's equation (26.10).
So??

Edit:
Equation (10.9.9) in Weinberg's follows immediately from the general case (10.9.10). A nice treatment can be found in d'Inverno's book (new edition, and likely in the old one too), although he declares an active displacement but appears to be using a passive one without changing the sign.

Additional edit:
Ok, now I think I see what's bothering you. In the line above (10.9.7) Weinberg writes "In covariant terms" and applies the "comma goes to semicolon" rule to the equation above (10.9.7). Replacing ##g_{\mu\nu}## by ##T_{\mu\nu}## in the equation above (10.9.7), and then "covarianting", gives you (10.9.10).
 
Last edited:
  • #23
@JimWhoKnew No, I am perfectly happy with (10.9.9) being a special case of (10.9.10). I get to Weinberg's equation above (10.9.7) alright, and then (10.9.7) says that ##\Delta_\epsilon g_{\mu\nu}(x)## is equal to the last three terms in the equation above (10.9.7). But how the heck are these three terms equal to (10.9.9)?

(PS. Apologies for the premature post above.)
 
  • #24
Kostik said:
@JimWhoKnew No, I am perfectly happy with (10.9.9) being a special case of (10.9.10). I get to Weinberg's equation above (10.9.7) alright, and then (10.9.7) says that ##\Delta_\epsilon g_{\mu\nu}(x)## is equal to the last three terms in the equation above (10.9.7). But how the heck are these three terms equal to (10.9.9)?

(PS. Apologies for the premature post above.)
Given the "additional edit" in post #22, I don't understand what is still bothering you. So I'll repeat the same argument, but with some more details.
The 3 difference terms above (10.9.7) are$$g_{\rho\nu}\epsilon^\rho{}_{,\mu}+g_{\rho\mu}\epsilon^\rho{}_{,\nu}+g_{\mu\nu,\rho}\epsilon^\rho \quad .$$Now, on the line between the equations, Weinberg says "in covariant terms, we conclude that". I interpret it as an implicit application of the "comma goes to semicolon" practice that guarantees covariance. The 3 terms will become$$\Delta_\epsilon g_{\mu\nu}\equiv g_{\rho\nu}\epsilon^\rho{}_{;\mu}+g_{\rho\mu}\epsilon^\rho{}_{;\nu}+g_{\mu\nu;\rho}\epsilon^\rho \quad ,$$which is the same as (10.9.9).

Does that answer your question? If not, what is missing?
 
Last edited:
  • #25
JimWhoKnew said:
Does that answer your question? If not, what is missing?
Sorry, brain freeze. Yes, it's obvious. Thanks, this has been really useful, and a much better way to show Dirac's relation without mucking about with all those delta functions. (Interestingly, for a guy who invented the delta function, he never uses it once in his "GTR" book, opting to work only with matter fields treated as continua.)
 
  • #26
@JimWhoKnew How do you get $$\delta P^\mu = -\mathcal{L}_b \, P^\mu$$ and $$\delta g_{\mu\nu} = -\mathcal{L}_b \, g_{\mu\nu} \quad ?$$ I get ##+## signs. Yes, here ##\delta P^\mu = P'^\mu - P^\mu## (and likewise for ##\delta g_{\mu\nu}##), which is opposite to the usual Lie derivative definition, but also the coordinate transformation is ##x'^\mu = x^\mu - b^\mu##. Accordingly, I get the opposite of Dirac's eq. (27.4).

To be clear, if Dirac applies a displacement field $$x \rightarrow x+\delta x = x+b$$ then The components of a tensor ##T## at ##x## change to ##T(x-b)##. Thus, the coordinate change here is ##x' = x - b##.

Here is my logic; could you please identify the mistake?

1752519002364.webp
 

Attachments

  • 1752516104191.webp
    1752516104191.webp
    19.7 KB · Views: 4
Last edited:
  • #27
Kostik said:
@JimWhoKnew How do you get $$\delta P^\mu = -\mathcal{L}_b \, P^\mu$$ and $$\delta g_{\mu\nu} = -\mathcal{L}_b \, g_{\mu\nu} \quad ?$$ I get ##+## signs. Yes, here ##\delta P^\mu = P'^\mu - P^\mu## (and likewise for ##\delta g_{\mu\nu}##), which is opposite to the usual Lie derivative definition, but also the coordinate transformation is ##x'^\mu = x^\mu - b^\mu##. Accordingly, I get the opposite of Dirac's eq. (27.4).

To be clear, if Dirac applies a displacement field $$x \rightarrow x+\delta x = x+b$$ then The components of a tensor ##T## at ##x## change to ##T(x-b)##. Thus, the coordinate change here is ##x' = x - b##.

Here is my logic; could you please identify the mistake?

View attachment 363216
With my physics background, I operate on the belief that signs will always take care of themselves and that one should never confront his calculations with reality unless absolutely necessary.

The present case doesn't qualify...

I mentioned "conventions" because these cases are tricky. Compare the equation above (10.9.7) in Weinberg's with equation (6.7) in d'Inverno's. In case you don't have access, I'll mention that d'Inverno uses ##T(x')-T'(x')## . They both get the same expressions for the Lie derivatives because d'Inverno uses ##x'=x+b## , unlike Weinberg's (10.9.6). This teaches us that we should consider all the relevant assumptions, rather than just the final expression. The second equation in your attachment contains an implicit assumption about the sign of ##\delta## .

So now we have to reconstruct Dirac's "meaning". Maybe you've read it in one of the earlier chapters, I didn't. But anyway, we can use the paragraph leading to (27.3) for this purpose. Since we are only interested in the sign, we can choose ##b^\mu=\epsilon \delta^\mu_1## and assume that ##p^0{}_{,1}## and ##P^0{}_{;1}## are both positive. With that,$$\Delta_b g_{\mu\nu}=0 \quad ,$$$$\delta \sqrt{g}=0 \quad , $$and$$\Delta_b P^\mu = \epsilon P^\mu{}_{;1} \quad .$$Since all ##~\partial x'^\mu / \partial x^\nu-\delta^\mu_\nu~## vanish with our choice, we don't have to worry about their contributions.
Following the example in the equation above (10.9.7) in Weinberg's, we'll get for this specific case $$P'^0(x)-P^0(x)=P^0(x+b)-P^0(x)+O(\epsilon^2) \quad (>0) \quad . $$But Dirac seems to use $$p^0(x-b)-p^0(x) \quad (<0) $$instead (actively moving the vector density "to the right" and subtracting the values of the vector density that was at that point before), hence the extra minus sign in his definition of ##\delta p^0## . To align my results with Dirac's, I chose the negative of Weinberg's Lie derivatives in #20.

BTW:
Note that the variation that uses Lie derivatives, as in #20, is general and can be applied to any tensor. Applied to ##p^\mu##, it will be equal to equation (27.4) only because of the assumption ##p^\mu{}_{,\mu}=0 ## .
 
Last edited:
  • #28
@JimWhoKnew I think Dirac is using ##\delta## is the most basic way it is used when varying a path or dynamical coordinate, to implement the principle of stationary action. Using the action ##I_g + I_m##, Dirac must vary ##g_{\mu\nu}## and -- in view of Eq. (27.9) -- ##p^\mu##. Thus, the varied path or variation is $$\bar{g}_{\mu\nu} = g_{\mu\nu} + \delta g_{\mu\nu}$$ $$\bar{p}^{\mu} = p^{\mu} + \delta p^{\mu} \, .$$ My error isn't in the definition of the Lie derivative. I am actually using the computations from Ohanian & Ruffini's "Gravitation and Spacetime", and I am carrying them through carefully, line by line. And as I said, I get $$\delta p^\mu = \mathcal{L}_b \, p^\mu$$ $$\delta g_{\mu\nu} = \mathcal{L}_b \, g_{\mu\nu}$$ which gives me the opposite of Dirac's Eq. (27.4).

Dirac's simple derivation of ##\delta p^0## in the base ##b^0=0## (his Eq. (27.3)) is very clear and gives us the correct signs.

My mistake must be something conceptual in the way I have set up my picture of the active vs. passive transformation, the definition of ##\delta p^\mu## in terms of ##p^\mu## and ##{p'}^\mu##, etc.

I have D'Inverno & Vickers (2nd Ed) and will read their treatment carefully. I just need to get myself off on the right foot in terms of clearly describing how the displacement of the matter field $$z^\mu \longrightarrow z^\mu + \delta z^\mu = z^\mu + b^\mu$$ implies $$p^\mu \longrightarrow p^\mu + \delta p^\mu$$ is equivalent to a change of coordinates ##x \rightarrow x'##, with the appropriate formula for ##\delta p^\mu## in terms of ##p^\mu## and ##{p'}^\mu##. (Note that D'Inverno essentially does the same thing, when he writes ##x^a \rightarrow x^a + \delta u X^a(x)##.)

My work is shown below:

1752573325964.webp

1752573425127.webp

1752573475038.webp

1752573504197.webp

1752573526269.webp
 
Last edited:
  • #29
Kostik said:
I think Dirac is using ##\delta## is the most basic way it is used when varying a path or dynamical coordinate, to implement the principle of stationary action.
In general we have$$\delta= ~^"\mathrm{new}^"~ - ~^"\mathrm{old}^" \quad .$$Apparently this should be straightforward: we know who the old is, we can figure out the new, go ahead. So why "apparently"?

Let's consider a simplified example:
Consider a scalar field ##\phi(x)## , since we currently have no need in the headache of transforming components. Let's assume that ##g_{\mu\nu}=\eta_{\mu\nu}+O(\epsilon^2)## , so we can sloppily ignore the difference between ##\phi## and its associated scalar density. Assume further ##b^\mu=\epsilon \delta^\mu_1## .
Case 1:
We actively displace ##\phi## by ##b## . The new value at the point ##x## is the old ##\phi(x-b)## so$$\delta \phi=\phi(x-b)-\phi(x)=-\epsilon \phi_{,1} \quad . $$The same result is obtained if we passively move the spatial volume element in the ##(-b)## direction.
Case 2:
We apply the corresponding passive displacement ##x'=x-b## . Naturally the new value is ##\phi'(x')## . But to get ##\delta\phi## we have to compare the values at the same nominal point ##x^\mu## , so$$\delta\phi= \phi'(x)-\phi(x)=\phi'(x'+b)-\phi(x)=\phi'(x')+\epsilon \phi'_{,1}-\phi(x)=\epsilon \phi_{,1} \quad .$$
So which one is the correct ##\delta \phi## , and why?

Kostik said:
I think Dirac is using ##\delta## is the most basic way it is used when varying a path or dynamical coordinate, to implement the principle of stationary action. Using the action ##I_g + I_m##, Dirac must vary ##g_{\mu\nu}## and -- in view of Eq. (27.9) -- ##p^\mu##. Thus, the varied path or variation is $$\bar{g}_{\mu\nu} = g_{\mu\nu} + \delta g_{\mu\nu}$$ $$\bar{p}^{\mu} = p^{\mu} + \delta p^{\mu} \, .$$ My error isn't in the definition of the Lie derivative. I am actually using the computations from Ohanian & Ruffini's "Gravitation and Spacetime", and I am carrying them through carefully, line by line. And as I said, I get $$\delta p^\mu = \mathcal{L}_b \, p^\mu$$ $$\delta g_{\mu\nu} = \mathcal{L}_b \, g_{\mu\nu}$$ which gives me the opposite of Dirac's Eq. (27.4).
The derivation of the geodesic equation from equation (27.11) will not be affected by a change of sign ##\delta p^\mu \rightarrow -\delta p^\mu## in (27.4).

Kostik said:
I am actually using the computations from Ohanian & Ruffini's "Gravitation and Spacetime", and I am carrying them through carefully, line by line.
What sections?

Kostik said:
My work is shown below:
Your attached files are just what I did in post #20, except for my insertion "by hand" of the minus signs to fit Dirac's (27.4).
 
Last edited:
  • Like
Likes dextercioby
  • #30
JimWhoKnew said:
What sections?
"Gravitation and Spacetime", 3rd Ed., Ohanian & Ruffini, pp. 262-264. I like how he computes the Lie derivative, keeping careful track of the error terms in exchanging ##x## and ##x'##, and ##T_{\mu\nu}## and ##T'_{\mu\nu}##.
 
  • Like
Likes dextercioby
  • #31
JimWhoKnew said:
In general we have$$\delta= ~^"\mathrm{new}^"~ - ~^"\mathrm{old}^" \quad .$$Apparently this should be straightforward: we know who the old is, we can figure out the new, go ahead. So why "apparently"?
Yep, this is where the issue really lies. If we vary $$x^\mu \rightarrow \bar{x}^\mu = x^\mu + \delta x^\mu = x^\mu + b^\mu \,\, ,$$ and if ##T(x)## is a tensor field, then what EXACTLY is the resultant variation ##\delta T(x)##?

It cannot be ##T(x) - T(x-b)##, as I originally thought, because we cannot subtract vectors/tensors evaluated at different points. I need to decide exactly what ##\delta T(x)## means before computing it.

D'Inverno talks about dragging the tensor ##T^{ab}## from ##P## to ##Q## -- see his Figure 6.3. What does he mean exactly -- is this parallel transport?

Obviously, to get the right answer, I want $$\delta T(x) = T'(x') - T(x')$$ (or ##\delta T(x) = T'(x) - T(x)## ... does it matter?) where ##T'## is the transformed tensor under the coordinate change ##x' = x + b##. (Thereby making ##\delta T(x)## equal to the opposite of the Lie derivative.) I just need to justify this to myself.
 
Last edited:
  • #32
I might have it figured out. Will post a coherent explanation if I do.
 
  • #33
JimWhoKnew said:
Now, on the line between the equations, Weinberg says "in covariant terms, we conclude that". I interpret it as an implicit application of the "comma goes to semicolon" practice that guarantees covariance.
The "comma goes to semicolon" is optional, since Lie derivatives are covariant with it and without it (page 137 in Carroll's "Lecture Notes on GR"). There are applications where the manifestly covariant form (semicolons) is more convenient.

BTW:
I think there is a typo in Weinberg's equation (10.9.10). ##T^\lambda{}_\mu## should be ##T_\mu{}^\lambda## .
 
  • #34
@JimWhoKnew My derivation is shown below. It's a little verbose, but it helps me understand what's going on. I find it a little tricky keeping the ##x##s and ##x'##s straight, and also taking care with the ##T##s and ##T'##s and watching the error terms. Still, I think the approach is sound.

1752758295501.webp

1752758316558.webp

1752758341082.webp

1752758365376.webp

1752758396390.webp

1752758421238.webp

1752758439319.webp
 
  • #35
@JimWhoKnew Something still bothers me here. Dirac went through all this trouble to establish (27.4), because he writes the matter action as $$I_m = -\int (p^\mu p_\mu)^{1/2} \, .$$ But then in his eq. (27.10), he writes the variation ##\delta(I_g + I_m)## as a sum of integrals, one with ##\delta g_{\mu\nu}## and one with ##\delta p^\mu##. Then he uses (27.4) to write ##\delta p^\mu## in terms of ##b^\mu##.

HOWEVER, you showed that ##\delta g_{\mu\nu}=-\mathcal{L}_b \, g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}## is also a function of ##b^\mu##.

So, ##\delta g_{\mu\nu}## and ##b^\mu## are not independent. Hence, how can he justify setting both integrands equal to zero?

Can ##b^\mu## and ##b_{\mu:\nu}+b_{\nu:\mu}## be varied independently?
 
Last edited:
  • #36
Kostik said:
@JimWhoKnew Something still bothers me here. Dirac went through all this trouble to establish (27.4), because he writes the matter action as $$I_m = -\int (p^\mu p_\mu)^{1/2} \, .$$ But then in his eq. (27.10), he writes the variation ##\delta(I_g + I_m)## as a sum of integrals, one with ##\delta g_{\mu\nu}## and one with ##\delta p^\mu##. Then he uses (27.4) to write ##\delta p^\mu## in terms of ##b^\mu##.

HOWEVER, you showed that ##\delta g_{\mu\nu}=-\mathcal{L}_b \, g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}## is also a function of ##b^\mu##.

So, ##\delta g_{\mu\nu}## and ##b^\mu## are not independent. Hence, how can he justify setting both integrands equal to zero?

Can ##b^\mu## and ##b_{\mu:\nu}+b_{\nu:\mu}## be varied independently?
I noticed this problem a couple of days ago. The bottom line: I don't understand how Dirac gets (27.4).

I didn't show that ##\delta g_{\mu\nu}=-\mathcal{L}_b \, g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}## . All I showed is that by substituting it, we can recover (27.4) up to a term proportional to ##p^\mu{}_{,\mu}## . Big difference.

An idea that I have, but not sure of its validity:$$\delta_b g_{\mu\nu}\equiv -\mathcal{L}_b \, g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}$$is the displacement-induced change that should be used in the derivation of ##\delta p^\mu## . To that, we are free to add an arbitrary ##\delta_1 g_{\mu\nu}## , so that$$\delta g_{\mu\nu}=\delta_b g_{\mu\nu}+\delta_1 g_{\mu\nu}$$ is completely arbitrary and not fixed by the displacement.
 
Last edited:
  • #37
Well, spurred on by your hint, I am comfortable that I have derived Dirac's (27.4) as shown in the notes pasted in #34 above. Now, as to the point that ##\delta g_{\mu\nu} = -(b_{\mu:\nu}+b_{\nu:\mu})##, and the question whether ##\delta g_{\mu\nu}## and ##b^\mu## can be varied independently: I think I have an answer to that.

Write ##b^\mu = ε B^\mu##. We can equally well choose ##b^\mu = ε B^\mu + ε^2 A^\mu## with a negligible effect on ##b^\mu##. If we choose ##A^\mu## such that ##{A^\mu}_{:\nu} \gg ε^{-2}##, then $$b_{\alpha:\beta}+b_{\beta:\alpha} = ε(B_{\alpha:\beta}+B_{\beta:\alpha}) + ε^2(A_{\alpha:\beta}+A_{\beta:\alpha}) \approx ε^2(A_{\alpha:\beta}+A_{\beta:\alpha})$$ which allows ##b^\mu## and ##g_{\mu\nu}## to be varied independently.
 
  • #38
Why all this mumbo-jumbo? A vector function and its covariant derivative are functionally independent. Like x and xdot in elementary Lagrangian mechanics.
 
  • #39
dextercioby said:
Why all this mumbo-jumbo? A vector function and its covariant derivative are functionally independent. Like x and xdot in elementary Lagrangian mechanics.
What do you mean by "functionally independent"? It seems to me to require some justification, in order to carry out the final step after Dirac's (27.10) and set the coefficients of ##\delta g_{\mu\nu}## and ##b^\mu## equal to zero.

In the traditional principle of stationary action, we only vary ##x^\mu##, not both ##\dot{x}^\mu##. So, the situation here seems a little tricker.
 
Last edited:
  • #40
dextercioby said:
Why all this mumbo-jumbo? A vector function and its covariant derivative are functionally independent. Like x and xdot in elementary Lagrangian mechanics.
If we take Dirac's equation (27.4)$$
\delta p^\mu = (p^\nu b^\mu - p^\mu b^\nu)_{,\nu}$$as provided, then it is a function of the variation variables ##b^\mu## and ##b^\mu{}_{,\nu}## . Taking ##\delta g_{\mu\nu}## as independent of these, we can happily apply it to the equation below (27.9) and get that the two terms in (27.10) have independent variations.
But if we attempt to justify (27.4) along the lines of post #20, we use$$\delta g_{\mu\nu}=-g_{\mu\nu,\rho}b^\rho-b_{\mu,\nu}-b_{\nu,\mu}$$which depends on the same variation variables as ##\delta p^\mu## . With this, the two RHS terms in (27.10) are no longer manifestly independent.
However, Dirac shows in equation (27.11) that the right term of (27.10) can be brought to a form that depends only on ##b^\mu## , so ##\delta g_{\mu\nu}## in the left term (of the RHS) can be regarded as independent (since it also contains ##b^\mu{}_{,\nu}## ). This is in accord with your (and @Kostik's) observation. Although that seems to settle the problem, I'm still not confident about the process I described in #20.

Kostik said:
If we choose ##A^\mu## such that ##{A^\mu}_{:\nu} \gg ε^{-2}##
With such a choice, ##b^\mu{}_{;\nu}## are no longer small.
 
Last edited:
  • #41
dextercioby said:
Why all this mumbo-jumbo? A vector function and its covariant derivative are functionally independent. Like x and xdot in elementary Lagrangian mechanics.
If in elementary Lagrangian mechanics we have a single particle Lagrangian$$\mathrm{L}\left(x,y,z,\dot x,\dot y ,\dot z \right)$$then for$$\delta\mathrm{I}=\int \delta \mathrm{L}dt$$we can vary ##x## arbitrarily while keeping ##\delta y=\delta z=0## (and similarly for the other cases). So ##\delta x,\delta y,\delta z## are mutually independent.
But that is not the case for ##\delta x## and ##\delta \dot x## . To derive the equations of motion we use$$\delta \dot x=\frac d{dt}(\delta x) \quad ,$$and if ##\delta x## vanishes during the entire action integration, so does ##\delta \dot x## .

Late edit:
Moreover, we end up with$$0=\int \left[\mathrm{~equation~~of~~motion~ }\right]_i \delta x^i dt \quad .$$##\delta \dot x ^i## are eliminated. This is different from equation (27.10) for the case described in #40.

JimWhoKnew said:
so ##\delta g_{\mu\nu}## in the left term (of the RHS) can be regarded as independent (since it also contains ##b^\mu{}_{,\nu}## ). This is in accord with your (and @Kostik's) observation. Although that seems to settle the problem,
Probably wrong.
 
Last edited:
  • #42
JimWhoKnew said:
If we take Dirac's equation (27.4)$$
\delta p^\mu = (p^\nu b^\mu - p^\mu b^\nu)_{,\nu}$$as provided, then it is a function of the variation variables ##b^\mu## and ##b^\mu{}_{,\nu}## . Taking ##\delta g_{\mu\nu}## as independent of these, we can happily apply it to the equation below (27.9) and get that the two terms in (27.10) have independent variations.
But if we attempt to justify (27.4) along the lines of post #20, we use$$\delta g_{\mu\nu}=-g_{\mu\nu,\rho}b^\rho-b_{\mu,\nu}-b_{\nu,\mu}$$which depends on the same variation variables as ##\delta p^\mu## . With this, the two RHS terms in (27.10) are no longer manifestly independent.
However, Dirac shows in equation (27.11) that the right term of (27.10) can be brought to a form that depends only on ##b^\mu## , so ##\delta g_{\mu\nu}## in the left term (of the RHS) can be regarded as independent (since it also contains ##b^\mu{}_{,\nu}## ). This is in accord with your (and @Kostik's) observation. Although that seems to settle the problem, I'm still not confident about the process I described in #20.


With such a choice, ##b^\mu{}_{;\nu}## are no longer small.
But ##b^\mu## is small, which is the assumption.
 
  • #43
Kostik said:
But ##b^\mu## is small, which is the assumption.
Remember that in the derivation of ##\mathrm{L}_\mathbf{b}g_{\mu\nu}## , terms of second order or higher in ##b^\mu## and ##b^\mu{}_{,\nu}## are discarded. It makes no sense unless they are all small.

Edit:
Even worse, ##\partial x'^\mu/\partial x^\nu## may become singular.
 
Last edited:
  • #44
JimWhoKnew said:
Remember that in the derivation of ##\mathrm{L}_\mathbf{b}g_{\mu\nu}## , terms of second order or higher in ##b^\mu## and ##b^\mu{}_{,\nu}## are discarded. It makes no sense unless they are all small.
Oh, that is quite right. So, my method fails -- and I am at a loss how to show that ##\delta g_{\mu\nu}## can be varied independently of ##b^\mu## in Dirac's (27.10).

From a physical point of view, the goal is to obtain ##g_{\mu\nu}## (Dirac's Eq (25.7)) from the stationary variation. (The geodesic equation is a bonus.) So it's natural that we should arbitrarily vary ##g_{\mu\nu}## in (27.10), just as we do in the vacuum case (26.8). It seems to be physically reasonable to vary ##g_{\mu\nu}## and ##b^\mu## independently. I just need to see, mathematically, how that is justifiable, given the relation ##g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}##.

EDIT: Hang on, maybe I'm getting hung up on ##g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}## and the apparent relation it imposes on ##g_{\mu\nu}## and ##b^{\mu}## in Dirac's (27.10). It seems that ##g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}## is simply the effect on ##g_{\mu\nu}## caused by ##b^{\mu}##. But in (27.10), we are at liberty to vary ##g_{\mu\nu}## arbitrarily, since this is allowed to make an arbitrary variation ##\delta(I_g + I_m)##. So I think equating the coefficients of ##g_{\mu\nu}## and ##b^{\mu}## in (27.10) and (27.11) is OK -- with no further explanation required!
 
Last edited:
  • #45
Kostik said:
EDIT: Hang on, maybe I'm getting hung up on ##g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}## and the apparent relation it imposes on ##g_{\mu\nu}## and ##b^{\mu}## in Dirac's (27.10). It seems that ##g_{\mu\nu}=-b_{\mu:\nu}-b_{\nu:\mu}## is simply the effect on ##g_{\mu\nu}## caused by ##b^{\mu}##. But in (27.10), we are at liberty to vary ##g_{\mu\nu}## arbitrarily, since this is allowed to make an arbitrary variation ##\delta(I_g + I_m)##. So I think equating the coefficients of ##g_{\mu\nu}## and ##b^{\mu}## in (27.10) and (27.11) is OK -- with no further explanation required!
Now you are just repeating my speculations as if they were your own.
 
  • #46
JimWhoKnew said:
Now you are just repeating my speculations as if they were your own.
I'm afraid not, what I wrote is 100% my own, original thought. You wrote "The bottom line: I don't understand how Dirac gets (27.4)"; however, I provided a detailed derivation of (27.4) in post #34.

I see now that you wrote "An idea that I have, but not sure of its validity", but I did not see that before; it was probably added by edit after I initially read the post. You might want to be more careful about making accusations of plagiarism. I have gone out of my way in previous posts to cite your contributions, in particular relating the ##\delta P^\mu## and ##\delta g_{\mu\nu}## variations to Lie derivatives, which was very helpful.

As an aside, why are you not sure of its validity? I think I explained that in #44.
 
Last edited:
  • #47
I read the passage around 27.4 and didn't understand Dirac's derivation either. I came up with the following.

First we have an active transformation where we move each particle in the dust from $$z^{\mu} \rightarrow z^{\mu} + b^{\mu}$$

Then given this active transformation $$p^{\prime\mu}(z^{\mu}) = p^{\mu}(z^{\mu} - b^{\mu})$$. And then $$ \delta p^0(z^{\mu}) = p^{\prime 0}(z^{\mu}) - p^0(z^{\mu}) =
p^0(z^{\mu}) - p^0(z^{\mu}),_{\nu} b^{\nu} - p^0(z^{\mu}) = -p^0,_{\nu} b^{\nu}$$

Using a Taylor expansion above. Now we use the conservation law that ##p^{\mu},_{\mu} = 0## or ##p^0,_0=-p^r,_r## we replace ##p^0,_0## in the previous equation and get $$\delta p^0 = p^r,_rb^0 - p^0,_r b^r$$.

I haven't convinced myself, but I believe a similar argument works for the other components of ##\delta p##.
 
Last edited:
  • #48
jbergman said:
I haven't convinced myself, but I believe a similar argument works for the other components of ##\delta p##.
I believe that if the generalization of the equation below (27.3) to equation (27.4) could have been demonstrated within a short paragraph, Dirac would have shown it. It was in his nature. I think it was skipped because it is too long a detour for Dirac's packed book.

jbergman said:
Using a Taylor expansion above. Now we use the conservation law that ##p^{\mu},_{\mu} = 0## or ##p^0,_0=-p^r,_r## we replace ##p^0,_0## in the previous equation and get $$\delta p^0 = p^r,_rb^0 - p^0,_r b^r$$.
Your result lacks the ##-p^0 b^r{}_{,r}## term of equation (27.3).
Note that ##p^\mu## is defined as a vector density. Since ##b^r## is not constant in general, the coordinate distance between two neighboring "particles" after the displacement is not necessarily the same as it was before. This should affect the density. To lowest order, the needed corrections should be linear in the ##b^\mu{}_{,\nu}## .

jbergman said:
I read the passage around 27.4 and didn't understand Dirac's derivation either.
I can add some details that will make the derivation of (27.3) more clear (in my opinion). As for the generalization to (27.4), the best I have so far is post #20.
 
Last edited:
  • #49
JimWhoKnew said:
I believe that if the generalization of the equation below (27.3) to equation (27.4) could have been demonstrated within a short paragraph, Dirac would have shown it. It was in his nature. I think it was skipped because it is too long a detour for Dirac's packed book.
Maybe, maybe not.
JimWhoKnew said:
Your result lacks the ##-p^0 b^r{}_{,r}## term of equation (27.3).
Note that ##p^\mu## is defined as a vector density.
I was aware of this part.
JimWhoKnew said:
Since ##b^r## is not constant in general, the coordinate distance between two neighboring "particles" after the displacement is not necessarily the same as it was before.
Ok, this part, I was wondering whether ##b^r## was constant or not. I haven't read the book other than this section. Thanks for the clarification. That definitely renders my derivation invalid. I will see if I can salvage it.
JimWhoKnew said:
This should affect the density. To lowest order, the needed corrections should be linear in the ##b^\mu{}_{,\nu}## .


I can add some details that will make the derivation of (27.3) more clear (in my opinion). As for the generalization to (27.4), the best I have so far is post #20.
Look forward to it.
 
  • #50
JimWhoKnew said:
I believe that if the generalization of the equation below (27.3) to equation (27.4) could have been demonstrated within a short paragraph, Dirac would have shown it. It was in his nature. I think it was skipped because it is too long a detour for Dirac's packed book.


Your result lacks the ##-p^0 b^r{}_{,r}## term of equation (27.3).
Note that ##p^\mu## is defined as a vector density. Since ##b^r## is not constant in general, the coordinate distance between two neighboring "particles" after the displacement is not necessarily the same as it was before. This should affect the density. To lowest order, the needed corrections should be linear in the ##b^\mu{}_{,\nu}## .


I can add some details that will make the derivation of (27.3) more clear (in my opinion). As for the generalization to (27.4), the best I have so far is post #20.
BTW, I found an old thread that mentions your Lie Derivative approach.

https://www.physicsforums.com/threads/unraveling-diracs-general-relativity-equation.734239/

That's an interesting idea and seems correct. Still trying to wrap my head around it since we are dealing with a density induced from dust particles and it wasn't obvious to me that moving particles was the same as flowing the resulting density along a vector field.
 
Back
Top