Can someone explain why p terms are not canceling in this Diffeomorphisms proof?

They represent the components of vectors and commute. Look at the derivation of (130), the 2nd line follows from the first by commuting the components of X and Y. In (220) we're doing the same thing.
  • #1
latentcorpse
1,444
0
So I'm trying to prove eqn (223) in the notes attached in this thread:
https://www.physicsforums.com/showthread.php?t=457123

I took the equation [itex]( \phi^* ( \eta) ) ( X) = \eta ( \phi_* (X))[/itex]
and expanded in a coordinate basis as follows

[itex]( \phi^* ( \eta) )_\mu dx^\mu X = \eta_\alpha dy^\alpha \phi_*(X)[/itex]

So to get the result it seems like it should be a simple case of cross multiplying but unfortunately the [itex]X[/itex] doesn't cancel the [itex]\phi_*(X)[/itex]

Thanks for any help.
 
Physics news on Phys.org
  • #2


latentcorpse said:
So I'm trying to prove eqn (223) in the notes attached in this thread:
https://www.physicsforums.com/showthread.php?t=457123

I took the equation [itex]( \phi^* ( \eta) ) ( X) = \eta ( \phi_* (X))[/itex]
and expanded in a coordinate basis as follows

[itex]( \phi^* ( \eta) )_\mu dx^\mu X = \eta_\alpha dy^\alpha \phi_*(X)[/itex]

So to get the result it seems like it should be a simple case of cross multiplying but unfortunately the [itex]X[/itex] doesn't cancel the [itex]\phi_*(X)[/itex]

Thanks for any help.

The notation [itex]( \phi^* ( \eta) ) ( X) = \eta ( \phi_* (X))[/itex] always means that indices are contracted in the coordinate basis, so

[tex]( \phi^* ( \eta) ) ( X) = ( \phi^* ( \eta) )_\mu X^\mu, [/tex]

[tex] \eta ( \phi_* (X)) = \eta_\alpha ( \phi_* (X))^\alpha. [/tex]

Just think of the usual expression for a 1-form acting on a vector: [tex]\omega(X) = \omega_\mu X^\mu[/tex].
 
  • #3


fzero said:
The notation [itex]( \phi^* ( \eta) ) ( X) = \eta ( \phi_* (X))[/itex] always means that indices are contracted in the coordinate basis, so

[tex]( \phi^* ( \eta) ) ( X) = ( \phi^* ( \eta) )_\mu X^\mu, [/tex]

[tex] \eta ( \phi_* (X)) = \eta_\alpha ( \phi_* (X))^\alpha. [/tex]

Just think of the usual expression for a 1-form acting on a vector: [tex]\omega(X) = \omega_\mu X^\mu[/tex].

Ok. So then we get the equality:
[itex]( \phi^*( \eta ) )_\mu X^\mu = \eta_\alpha ( \phi_* ( X ))^\alpha[/itex]
I don't see how we can get this into the form of (223)?Additionally, I am trying to prove (187). Given the hint, I am able to multiply by those vectors and get

[itex]\nabla_X \nabla_Y Z^a- \nabla_Y \nabla_X Z^a = R^a{}_{bcd}Z^bX^cY^d = (R(X,Y)Z)^a[/itex]. But according to teh defn of the Riemann tensor (see (178)), I am missing a [itex]\nabla_{[X,Y]}[/itex] piece. Any ideas?
 
  • #4


latentcorpse said:
Ok. So then we get the equality:
[itex]( \phi^*( \eta ) )_\mu X^\mu = \eta_\alpha ( \phi_* ( X ))^\alpha[/itex]
I don't see how we can get this into the form of (223)?

You can use (221).

Additionally, I am trying to prove (187). Given the hint, I am able to multiply by those vectors and get

[itex]\nabla_X \nabla_Y Z^a- \nabla_Y \nabla_X Z^a = R^a{}_{bcd}Z^bX^cY^d = (R(X,Y)Z)^a[/itex]. But according to teh defn of the Riemann tensor (see (178)), I am missing a [itex]\nabla_{[X,Y]}[/itex] piece. Any ideas?

You have to use the product rule correctly on the terms [tex]\nabla_b\nabla_c(Z^a X^bY^c)[/tex] and (135).
 
  • #5


fzero said:
You can use (221).
Ok. That works. But how do I prove (221)? There is no identity to help me there like there was for (223).

fzero said:
You have to use the product rule correctly on the terms [tex]\nabla_b\nabla_c(Z^a X^bY^c)[/tex] and (135).
So would it go something like this:

[itex]\nabla_c ( Z^a{}_{;d} X^c Y^d + Z^a X^c{}_{;d} Y^d + \nabla_Y Z^a X^c) - \nabla_d ( Z^a{}_{;c} X^c Y^d + \nabla_X Z^a Y^d + Z^a X^c Y^d{}_{;c} )[/itex]?

If I expand the next derivative, I'll get terms like [itex](Z^a{}_{;dc}-Z^a{}_{;cd})X^cY^d[/itex] but I can't cancel them as the second covariant derivatives don't commute for tensor fields, do they? Maybe I'm still doing something wrong...
 
  • #6


latentcorpse said:
fzero said:
You can use (221).
Ok. That works. But how do I prove (221)? There is no identity to help me there like there was for (223).

(221) follows in one or two lines from (220).

So would it go something like this:

[itex]\nabla_c ( Z^a{}_{;d} X^c Y^d + Z^a X^c{}_{;d} Y^d + \nabla_Y Z^a X^c) - \nabla_d ( Z^a{}_{;c} X^c Y^d + \nabla_X Z^a Y^d + Z^a X^c Y^d{}_{;c} )[/itex]?

If I expand the next derivative, I'll get terms like [itex](Z^a{}_{;dc}-Z^a{}_{;cd})X^cY^d[/itex] but I can't cancel them as the second covariant derivatives don't commute for tensor fields, do they? Maybe I'm still doing something wrong...

OK, it actually works out much faster if you multiply from the left. Acting with the derivatives the way I originally suggested probably does give extra terms. From the left, you end up with

[tex] Y^d \nabla_X \nabla_d Z^a - X^c \nabla_Y \nabla_c Z^a[/tex]

that you have to manipulate with the product rule.
 
  • #7


fzero said:
(221) follows in one or two lines from (220).
I get
[itex]( \phi_*(X))^\alpha (\frac{\partial}{\partial y^\alpha})_{\phi(p)} (f) = X^\nu (\frac{\partial}{\partial x^\nu})_p \phi^*(f)[/itex]
Again, I'm having trouble getting rid of the [itex]\phi^*[/itex] on the right

fzero said:
OK, it actually works out much faster if you multiply from the left. Acting with the derivatives the way I originally suggested probably does give extra terms. From the left, you end up with

[tex] Y^d \nabla_X \nabla_d Z^a - X^c \nabla_Y \nabla_c Z^a[/tex]

that you have to manipulate with the product rule.

Ok. You appear to have multiplied from the left by [itex]Y^dX^c[/itex] on the first term but [itex]X^cY^d[/itex] on the second term. Now we are using abstract index notation so these both represent vectors rather than the components of vectors. How do you know those vectors commute?

Thanks.
 
  • #8


latentcorpse said:
I get
[itex]( \phi_*(X))^\alpha (\frac{\partial}{\partial y^\alpha})_{\phi(p)} (f) = X^\nu (\frac{\partial}{\partial x^\nu})_p \phi^*(f)[/itex]
Again, I'm having trouble getting rid of the [itex]\phi^*[/itex] on the right

You need to use that [tex]\phi[/tex] is the map [tex]x^\mu\rightarrow y^\alpha[/tex] and the chain rule.

Ok. You appear to have multiplied from the left by [itex]Y^dX^c[/itex] on the first term but [itex]X^cY^d[/itex] on the second term. Now we are using abstract index notation so these both represent vectors rather than the components of vectors. How do you know those vectors commute?

Thanks.

They represent the components of vectors and commute. Look at the derivation of (130), the 2nd line follows from the first by commuting the components of [tex]X[/tex] and [tex]Y[/tex].
 
  • #9


fzero said:
You need to use that [tex]\phi[/tex] is the map [tex]x^\mu\rightarrow y^\alpha[/tex] and the chain rule.
well [itex]\phi[/itex] has components [itex]y^\alpha(x^\mu)[/itex]
I can see we want to introduce something like dy/dx d/dy on the RHS but I just don't see how?

fzero said:
They represent the components of vectors and commute. Look at the derivation of (130), the 2nd line follows from the first by commuting the components of [tex]X[/tex] and [tex]Y[/tex].
Right. But there he is using basis indices (see chapter 7).
Although he does say in Chapter 7 that we are free to change abstract indices to basis indices to get an equation that is true in an arbitrary basis so I guess everything is ok providing we work this out in greek indices?

Anyway I tried working out the following using the product rule:
[itex]\nabla_Y (X^\mu \nabla_\mu Z^\nu) = Y^\rho \nabla_\rho (X^\mu \nabla_\mu Z^\nu) = Y^\rho X^\mu{}_{; \rho} \nabla_\mu Z^\nu + Y^\rho X^\mu \nabla_\rho \nabla_\mu Z^\nu[/itex]
in the hope that one of those terms we get would be useful but I don't see it?
 
  • #10


latentcorpse said:
well [itex]\phi[/itex] has components [itex]y^\alpha(x^\mu)[/itex]
I can see we want to introduce something like dy/dx d/dy on the RHS but I just don't see how?

Use [tex]\phi^*(f) = f\circ \phi[/tex] applied to the coordinates [tex]x^\mu[/tex].

Right. But there he is using basis indices (see chapter 7).
Although he does say in Chapter 7 that we are free to change abstract indices to basis indices to get an equation that is true in an arbitrary basis so I guess everything is ok providing we work this out in greek indices?

Anyway I tried working out the following using the product rule:
[itex]\nabla_Y (X^\mu \nabla_\mu Z^\nu) = Y^\rho \nabla_\rho (X^\mu \nabla_\mu Z^\nu) = Y^\rho X^\mu{}_{; \rho} \nabla_\mu Z^\nu + Y^\rho X^\mu \nabla_\rho \nabla_\mu Z^\nu[/itex]
in the hope that one of those terms we get would be useful but I don't see it?

I'm not sure why you chose that expression. We had

[tex]
Y^d \nabla_X \nabla_d Z^a - X^c \nabla_Y \nabla_c Z^a,
[/tex]

so you want to re-express

[tex] Y^d \nabla_X \nabla_d Z^a = \nabla_X(Y^d \nabla_d Z^a) - ( \nabla_X Y^d )(\nabla_d Z^a)[/tex]

and similarly the other term.
 
  • #11


fzero said:
Use [tex]\phi^*(f) = f\circ \phi[/tex] applied to the coordinates [tex]x^\mu[/tex].
Sorry I still don't get. The [itex]\phi[/itex]'s are charts so they have components [itex]x^\mu[/itex] so how would this work?
 
  • #12


fzero said:
Use [tex]\phi^*(f) = f\circ \phi[/tex] applied to the coordinates [tex]x^\mu[/tex].



I'm not sure why you chose that expression. We had

[tex]
Y^d \nabla_X \nabla_d Z^a - X^c \nabla_Y \nabla_c Z^a,
[/tex]

so you want to re-express

[tex] Y^d \nabla_X \nabla_d Z^a = \nabla_X(Y^d \nabla_d Z^a) - ( \nabla_X Y^d )(\nabla_d Z^a)[/tex]

and similarly the other term.

Sorry, I have been working on some other stuff for the last few days and didn't have time to look at it again until now. I still do not get how either of these work? The second one I keep getting bogged down with what to do with algebra. The first one I don't really follow what you want me to do.

Thanks again.
 
  • #13


There really isn't any algebra to get bogged down with,

[tex]
\nabla_X(Y^d \nabla_d Z^a) = \nabla_X \nabla_Y Z^a,
[/tex]

while [tex] ( \nabla_X Y^d )[/tex] is one term in the expression (130) for [tex][X,Y]^d[/tex].

As for the other part, if [tex]\phi(x^\mu) = y^\alpha[/tex], then what is [tex](f\circ \phi)(x^\mu)[/tex]?
 
  • #14


fzero said:
There really isn't any algebra to get bogged down with,

[tex]
\nabla_X(Y^d \nabla_d Z^a) = \nabla_X \nabla_Y Z^a,
[/tex]

while [tex] ( \nabla_X Y^d )[/tex] is one term in the expression (130) for [tex][X,Y]^d[/tex].

As for the other part, if [tex]\phi(x^\mu) = y^\alpha[/tex], then what is [tex](f\circ \phi)(x^\mu)[/tex]?

Ok. I get us down to

[itex]\nabla_X \nabla_Y Z^a - \nabla_Y \nabla_X Z^a - ( \nabla_X Y^d)(\nabla_d Z^a) + ( \nabla_Y X^c)( \nabla_c Z^a)[/itex]

Now from our Riemann tensor we want those last terms to give us a [itex]-\nabla_{[X,Y]}Z^a[/itex]

So I tried working backwards:

[itex]\nabla_{[X,Y]}Z^a=\nabla_{XY-YX}Z^a = \nabla_{XY}Z^a-\nabla_{YX}Z^a[/itex]

Now I don't know how to get to the next line there? Could you explain please?

As for the other bit I would find that [itex](f \circ \phi)(x^\mu) = f(y^\alpha)[/itex]

Thanks.
 
  • #15


latentcorpse said:
Ok. I get us down to

[itex]\nabla_X \nabla_Y Z^a - \nabla_Y \nabla_X Z^a - ( \nabla_X Y^d)(\nabla_d Z^a) + ( \nabla_Y X^c)( \nabla_c Z^a)[/itex]

Now from our Riemann tensor we want those last terms to give us a [itex]-\nabla_{[X,Y]}Z^a[/itex]

Just write

[tex] - ( \nabla_X Y^d)(\nabla_d Z^a) + ( \nabla_Y X^c)( \nabla_c Z^a) = - ( \nabla_X Y^d-\nabla_Y X^d)(\nabla_d Z^a)[/tex]

and use (130).

So I tried working backwards:

[itex]\nabla_{[X,Y]}Z^a=\nabla_{XY-YX}Z^a = \nabla_{XY}Z^a-\nabla_{YX}Z^a[/itex]

The last part makes no sense at all, since [tex]XY[/tex] and [tex]YX[/tex] are not tangent vectors.

As for the other bit I would find that [itex](f \circ \phi)(x^\mu) = f(y^\alpha)[/itex]

So you need to go back a few posts and figure out what that's good for.
 
  • #16


fzero said:
Just write

[tex] - ( \nabla_X Y^d)(\nabla_d Z^a) + ( \nabla_Y X^c)( \nabla_c Z^a) = - ( \nabla_X Y^d-\nabla_Y X^d)(\nabla_d Z^a)[/tex]

and use (130).
The last part makes no sense at all, since [tex]XY[/tex] and [tex]YX[/tex] are not tangent vectors.
So you need to go back a few posts and figure out what that's good for.

Ok. So I have got the RIcci identity to work out.

As for the transformation, I assume I am meant to go back to[itex]( \phi_*(X))^\alpha (\frac{\partial}{\partial y^\alpha})_{\phi(p)} (f) = X^\nu (\frac{\partial}{\partial x^\nu})_p \phi^*(f)[/itex]

and get rid of the [itex]\phi^*(f)[/itex] on the RHS.
Should I act both sides on the coordinates [itex]x^\mu[/itex]? That's the only way I can see of being able to use the [itex]f(x^\mu)=y^\alpha[/itex] property.Also, on p90, he says that we should think of the singularity as a time rather than a place inside the black hole because r plays the role of the time coordinate. How does this work?

Thanks.
 
Last edited:
  • #17


latentcorpse said:
Ok. So I have got the RIcci identity to work out.

As for the transformation, I assume I am meant to go back to


[itex]( \phi_*(X))^\alpha (\frac{\partial}{\partial y^\alpha})_{\phi(p)} (f) = X^\nu (\frac{\partial}{\partial x^\nu})_p \phi^*(f)[/itex]

and get rid of the [itex]\phi^*(f)[/itex] on the RHS.
Should I act both sides on the coordinates [itex]x^\mu[/itex]?

Then try doing that. It's much better to try to work it out before asking the question.

That's the only way I can see of being able to use the [itex]f(x^\mu)=y^\alpha[/itex] property.

Hopefully you mean [itex]\phi(x^\mu)=y^\alpha[/itex]

Also, on p90, he says that we should think of the singularity as a time rather than a place inside the black hole because r plays the role of the time coordinate. How does this work?

Thanks.

You should be able to use the metric to show that the radial vector is timelike for [tex]r<2M[/tex].
 
  • #18


fzero said:
Then try doing that. It's much better to try to work it out before asking the question.



Hopefully you mean [itex]\phi(x^\mu)=y^\alpha[/itex]



You should be able to use the metric to show that the radial vector is timelike for [tex]r<2M[/tex].

Ok. I tried taking [itex]f=y^\alpha[/itex] but then on the LHS, this canceled the d/dy i had and similarly on the RHS it gets pulled back to an [itex]x^\mu[/itex] which cancels the d/dx we have?

This may be a stupid question but what is the form of the radial vector? Would it just be [itex]X=\frac{\partial}{\partial r} \Rightarrow X^\mu(0,1,0,0)[/itex] in Schwarzschild coordinates?
If so, when I calculate [itex]g_{rr}X^rX^r[/itex] I find it to be spacelike so I'm guessing that's wrong.
Moreover, how does showing a vector is timelike tell us it behaves like a time coordinate/that t doesn't behave like a time coordinate?
 
  • #19


latentcorpse said:
Ok. I tried taking [itex]f=y^\alpha[/itex] but then on the LHS, this canceled the d/dy i had and similarly on the RHS it gets pulled back to an [itex]x^\mu[/itex] which cancels the d/dx we have?

Leave [tex]f[/tex] arbitrary, but note that you have [tex](\partial/\partial y^\alpha )f(y^\alpha)[/tex] on the LHS and [tex](\partial/\partial x^\mu) f(y^\alpha)[/tex] on the RHS. You want to use the chain rule on the RHS and equate coefficients.

This may be a stupid question but what is the form of the radial vector? Would it just be [itex]X=\frac{\partial}{\partial r} \Rightarrow X^\mu(0,1,0,0)[/itex] in Schwarzschild coordinates?

Yes.

If so, when I calculate [itex]g_{rr}X^rX^r[/itex] I find it to be spacelike so I'm guessing that's wrong.

You might want to show your work. You should find that it's spacelike for [tex]r>2M[/tex] and timelike for [tex]r<2M[/tex].

Moreover, how does showing a vector is timelike tell us it behaves like a time coordinate/that t doesn't behave like a time coordinate?

How else would you decide that something is or isn't a time-coordinate?
 
  • #20


Leave [tex]f[/tex] arbitrary, but note that you have [tex](\partial/\partial y^\alpha )f(y^\alpha)[/tex] on the LHS and [tex](\partial/\partial x^\mu) f(y^\alpha)[/tex] on the RHS. You want to use the chain rule on the RHS and equate coefficients.
[/QUOTE]
What happened to your [itex]\phi^*(f)[/itex] on the RHS - you have just made it [itex]f[/itex], no?


fzero said:
You might want to show your work. You should find that it's spacelike for [tex]r>2M[/tex] and timelike for [tex]r<2M[/tex].
Well in the notes it says to work with the Schwarzschild coordinates so I used the Schwarzschild metric. Now I know that the metric that covers the r<2M is the one defined using ingoing Eddington Finkelstein coordinates but, we know that if we're in the region r<2M and we change back from EF to Schwarz coordinates we should get the Schwarz metric back again (but now applied to the r<2M region)> Isn't this guaranteed by Birkhoff's theorem which tells us that Schwarzschild is unique? So I guess my question is, why does it not work if you use the Schwarzschild metric?

Anyway, let's say I try with the metric in EF coords...

we have [itex]g_{\mu \nu} X^\mu X^\nu[/itex] with [itex]X^\mu=(0,1,0,0)[/itex]
so we would have [itex]g_{\mu \nu} X^\mu X^\nu=g_{rr}X^rX^r=g_{rr}=0[/itex]?

I'm sure I've done this before. I can't understand what I am doing wrong!


fzero said:
How else would you decide that something is or isn't a time-coordinate?
[/QUOTE]
I assume that checking it's timelike is the only way? Presumably then t would be spacelike in this region? How would we prove this since there is no t coordinate in the EF coord metric?

Thanks again.
 
  • #21


latentcorpse said:
What happened to your [itex]\phi^*(f)[/itex] on the RHS - you have just made it [itex]f[/itex], no?

It's been evaluated. [itex]\phi^*(f)[/itex] takes [tex]x^\mu\in M[/tex] to [tex]f(y^\alpha)[/tex].

Well in the notes it says to work with the Schwarzschild coordinates so I used the Schwarzschild metric. Now I know that the metric that covers the r<2M is the one defined using ingoing Eddington Finkelstein coordinates but, we know that if we're in the region r<2M and we change back from EF to Schwarz coordinates we should get the Schwarz metric back again (but now applied to the r<2M region)> Isn't this guaranteed by Birkhoff's theorem which tells us that Schwarzschild is unique?

Everything you say above is correct. The Schwarzschild coordinates are not complete because the [tex]r>2M[/tex] region does not connect to the [tex]r<2M[/tex] interior. However, as you say we can use ingoing EF coordinates to connect the two regions and transform back.

So I guess my question is, why does it not work if you use the Schwarzschild metric?

Anyway, let's say I try with the metric in EF coords...

we have [itex]g_{\mu \nu} X^\mu X^\nu[/itex] with [itex]X^\mu=(0,1,0,0)[/itex]
so we would have [itex]g_{\mu \nu} X^\mu X^\nu=g_{rr}X^rX^r=g_{rr}=0[/itex]?

I'm sure I've done this before. I can't understand what I am doing wrong!

The radial vector is [itex]X^\mu=(0,1,0,0)[/itex] in Schwarzschild coordinates, but not in EF.

The simplest way I'd work this out is by using the Schwarzschild coordinates for the region [tex]r<2M[/tex]. Then it's almost trivial to show that the radial and time coordinates get switched. One can complain that the coordinates are not valid past [tex]r=2M[/tex], but I don't see anything wrong with using EF to extend through [tex]r=2M[/tex] and then transforming back.

One can obviously show the same thing in EF coordinates, it just seems like it's going to be a lot harder. One can see how it works qualitatively by looking at the light cones in the Finklestein diagram on pg 18 of Townsend's BH notes. It's actually a little bit complicated to work those lightcones out, but is sort of a useful exercise if your morals prevent you from believing my suggestion above.

I assume that checking it's timelike is the only way? Presumably then t would be spacelike in this region? How would we prove this since there is no t coordinate in the EF coord metric?

Thanks again.
 
  • #22


fzero said:
It's been evaluated. [itex]\phi^*(f)[/itex] takes [tex]x^\mu\in M[/tex] to [tex]f(y^\alpha)[/tex].
So did you apply the LHS to [itex]y^\alpha[/itex] seeing as this is something in [itex]N[/itex] and the RHS to [itex]x^\mu[/itex] since these are the coordinates on [itex]M[/itex]? How is that allowed, surely we have to act with the same thing on both sides?

fzero said:
Everything you say above is correct. The Schwarzschild coordinates are not complete because the [tex]r>2M[/tex] region does not connect to the [tex]r<2M[/tex] interior. However, as you say we can use ingoing EF coordinates to connect the two regions and transform back.



The radial vector is [itex]X^\mu=(0,1,0,0)[/itex] in Schwarzschild coordinates, but not in EF.

The simplest way I'd work this out is by using the Schwarzschild coordinates for the region [tex]r<2M[/tex]. Then it's almost trivial to show that the radial and time coordinates get switched. One can complain that the coordinates are not valid past [tex]r=2M[/tex], but I don't see anything wrong with using EF to extend through [tex]r=2M[/tex] and then transforming back.

One can obviously show the same thing in EF coordinates, it just seems like it's going to be a lot harder. One can see how it works qualitatively by looking at the light cones in the Finklestein diagram on pg 18 of Townsend's BH notes. It's actually a little bit complicated to work those lightcones out, but is sort of a useful exercise if your morals prevent you from believing my suggestion above.

So you're telling me it's easier to work it out in Schwarzschild coords where [itex]X^\mu=(0,1,0,0)[/itex]. But it doesn't make sense to use the EF metric with this (since the EF metric involves [itex]v[/itex] which isn't a Schwarzschild cooridnate) so I would have to use the Schwarzschild metric but then I have the same problem as in my last post? The Schwarzschild metric is going to be the same in the region r<2M and the region r>2M, isn't it? We only introduce the EF coordinates for the purposes of showing we can smoothly extend through the r=2M coordinate singularity, don't we?
 
  • #23


latentcorpse said:
So did you apply the LHS to [itex]y^\alpha[/itex] seeing as this is something in [itex]N[/itex] and the RHS to [itex]x^\mu[/itex] since these are the coordinates on [itex]M[/itex]? How is that allowed, surely we have to act with the same thing on both sides?

I'd suggest you reread pg 67 in detail again, because I can't explain it any better without reproducing all of that again or drawing pictures. If we have a function [tex]A: M \rightarrow T[/tex] and a function [tex]B:N\rightarrow T[/tex], it's perfectly acceptable to say that [tex]A(x) = B(y)[/tex] as an equality on [tex]T[/tex], even though these act on different spaces. A map [tex]\phi:M\rightarrow N[/tex] can be used to explore such an equality and that is the whole point of the discussion.

So you're telling me it's easier to work it out in Schwarzschild coords where [itex]X^\mu=(0,1,0,0)[/itex]. But it doesn't make sense to use the EF metric with this (since the EF metric involves [itex]v[/itex] which isn't a Schwarzschild cooridnate) so I would have to use the Schwarzschild metric but then I have the same problem as in my last post? The Schwarzschild metric is going to be the same in the region r<2M and the region r>2M, isn't it? We only introduce the EF coordinates for the purposes of showing we can smoothly extend through the r=2M coordinate singularity, don't we?

I'm saying first that you would have to work out the transformation of [itex]X^\mu=(0,1,0,0)[/itex] under the EF coordinate change. If you did this, you could presumably use the EF coordinates to compute the inner product.

Secondly, that the metric is the "same" does not mean that [itex]X^\mu X_\mu[/itex] has the same value in both regions. Just try to compute it and see for yourself.
 
  • #24


fzero said:
I'd suggest you reread pg 67 in detail again, because I can't explain it any better without reproducing all of that again or drawing pictures. If we have a function [tex]A: M \rightarrow T[/tex] and a function [tex]B:N\rightarrow T[/tex], it's perfectly acceptable to say that [tex]A(x) = B(y)[/tex] as an equality on [tex]T[/tex], even though these act on different spaces. A map [tex]\phi:M\rightarrow N[/tex] can be used to explore such an equality and that is the whole point of the discussion.
Think I have it:
[itex]( \phi_*(X))^\alpha \left( \frac{\partial}{\partial y^\alpha} \right)_{\phi(p)} f(y^\beta) = X^\mu \left( \frac{\partial}{\partial x^\mu} \right)_p (\phi^* f)(x^\nu)[/itex]
[itex]( \phi_*(X))^\alpha \left( \frac{\partial}{\partial y^\alpha} \right)_{\phi(p)} f(y^\beta) = X^\mu \left( \frac{\partial}{\partial x^\mu} \right)_p f(y^\beta)[/itex]
using [itex] (\phi^*f)(x^\nu)=f(\phi(x^\nu))=f(y^\beta)[/itex]
so [itex]( \phi_*(X))^\alpha = X^\mu \left( \frac{\partial y^\alpha}{\partial x^\mu} \right)_p [/itex]

One little problem though is that in the 2nd last line I had the d/dy evaluated at phi(p) but then when I cross multiplied I now miraculously have it evaluated at p. How does that work?

fzero said:
I'm saying first that you would have to work out the transformation of [itex]X^\mu=(0,1,0,0)[/itex] under the EF coordinate change. If you did this, you could presumably use the EF coordinates to compute the inner product.

Ok. So EF coords require us to find [itex]v=t+r_*=t+r+2M \log{ | \frac{r}{2M}-1 |}[/itex]
Taking t=0 and r=1 I find
[itex]v=1+2M \log{ | \frac{1-2M}{2M} | }[/itex]
So [itex]X^\mu=(1+2M \log{ | \frac{1-2M}{2M} | },1,0,0)[/itex]
Surely those components are wrong?

fzero said:
Secondly, that the metric is the "same" does not mean that [itex]X^\mu X_\mu[/itex] has the same value in both regions. Just try to compute it and see for yourself.

But what about in the same region? If we are in r<2M then surely the [itex]X^\mu X_\mu[/itex] has the same value whether we compute wrt the EF metric (with [itex]X^\mu[/itex] expressed in EF coords) or if we compute wrt the Schwarzschild metric (with [itex]X^\mu[/itex] expressed in Schwarzschild coords). But clearly evaluating in the Schwarzschild gives it as being spacelike?
 
  • #25


latentcorpse said:
Think I have it:
[itex]( \phi_*(X))^\alpha \left( \frac{\partial}{\partial y^\alpha} \right)_{\phi(p)} f(y^\beta) = X^\mu \left( \frac{\partial}{\partial x^\mu} \right)_p (\phi^* f)(x^\nu)[/itex]
[itex]( \phi_*(X))^\alpha \left( \frac{\partial}{\partial y^\alpha} \right)_{\phi(p)} f(y^\beta) = X^\mu \left( \frac{\partial}{\partial x^\mu} \right)_p f(y^\beta)[/itex]
using [itex] (\phi^*f)(x^\nu)=f(\phi(x^\nu))=f(y^\beta)[/itex]
so [itex]( \phi_*(X))^\alpha = X^\mu \left( \frac{\partial y^\alpha}{\partial x^\mu} \right)_p [/itex]

One little problem though is that in the 2nd last line I had the d/dy evaluated at phi(p) but then when I cross multiplied I now miraculously have it evaluated at p. How does that work?

I'd say that

[tex] \left( \frac{\partial}{\partial x^\mu} \right)_p = \left( \frac{\partial y^\alpha}{\partial x^\mu} \right)_p \left( \frac{\partial}{\partial y^\alpha} \right)_{\phi(p)} [/tex]

Ok. So EF coords require us to find [itex]v=t+r_*=t+r+2M \log{ | \frac{r}{2M}-1 |}[/itex]
Taking t=0 and r=1 I find
[itex]v=1+2M \log{ | \frac{1-2M}{2M} | }[/itex]
So [itex]X^\mu=(1+2M \log{ | \frac{1-2M}{2M} | },1,0,0)[/itex]
Surely those components are wrong?

That's not how the components of a vector transform under a change of coordinates.

But what about in the same region? If we are in r<2M then surely the [itex]X^\mu X_\mu[/itex] has the same value whether we compute wrt the EF metric (with [itex]X^\mu[/itex] expressed in EF coords) or if we compute wrt the Schwarzschild metric (with [itex]X^\mu[/itex] expressed in Schwarzschild coords). But clearly evaluating in the Schwarzschild gives it as being spacelike?

It is not spacelike. This would be immediately apparent if you would actually do the computation.
 
  • #26


fzero said:
That's not how the components of a vector transform under a change of coordinates.
[itex]X'^\mu = A^\mu{}_\nu X^\nu[/itex]
where [itex]A^\mu{}_\nu = \frac{\partial x'^\mu}{\partial x^\nu}[/itex]

So what is this [itex]A[/itex] matrix?

I find it has non zero cpts [itex]A=\text{diag} ( \frac{\partial v}{\partial t} , 1 , 1 , 1)[/itex]
but then [itex]v=t+r+2M \log{ | \frac{r}{2M}-1 |} \Rightarrow \frac{\partial v}{ \partial t}=1[/itex]

but then we just have a identity matrix which doesn't seem very useful?

fzero said:
It is not spacelike. This would be immediately apparent if you would actually do the computation.
So when I get the calculation to work, I should find they should give identical results (hopefully)?
 
  • #27


latentcorpse said:
[itex]X'^\mu = A^\mu{}_\nu X^\nu[/itex]
where [itex]A^\mu{}_\nu = \frac{\partial x'^\mu}{\partial x^\nu}[/itex]

So what is this [itex]A[/itex] matrix?

I find it has non zero cpts [itex]A=\text{diag} ( \frac{\partial v}{\partial t} , 1 , 1 , 1)[/itex]
but then [itex]v=t+r+2M \log{ | \frac{r}{2M}-1 |} \Rightarrow \frac{\partial v}{ \partial t}=1[/itex]

but then we just have a identity matrix which doesn't seem very useful?

[tex]\partial v/\partial r[/tex] is non-vanishing too.

So when I get the calculation to work, I should find they should give identical results (hopefully)?

I expect so.
 
  • #28


fzero said:
[tex]\partial v/\partial r[/tex] is non-vanishing too.
I expect so.

I'm finding [itex]X'^\mu= \begin{pmatrix} 2 + \frac{2M}{r-2M} \\ 1 \\ 1 \\ 1 \end{pmatrix}[/itex]

But things don't really simplify much when I work out [itex]g_{\mu \nu} X'^\mu X'^\nu[/itex]?Also, in Chapter 34, he says that outgoing radial null geodesics inside the region r<2M will have decreasing r and therefore reach the r=0 singularity in finite affine parameter. This makes sense to me physically but I don't see how the maths backs it up. We have shown that [itex]\frac{dr}{d \tau}=1[/itex] for these radial null geodesics and so wouldn't they have increasing r i.e. be able to escape from the black hole?
 
Last edited:
  • #29


latentcorpse said:
Also, in Chapter 34, he says that outgoing radial null geodesics inside the region r<2M will have decreasing r and therefore reach the r=0 singularity in finite affine parameter. This makes sense to me physically but I don't see how the maths backs it up. We have shown that [itex]\frac{dr}{d \tau}=1[/itex] for these radial null geodesics and so wouldn't they have increasing r i.e. be able to escape from the black hole?

If you just blindly apply (259), then you'll find that for [tex]r<2M[/tex] coordinate time is decreasing along the affine parameter. When you fix this, you'll find that the radius is now decreasing. Alternatively, you can simply plot the geodesics (268) which is apparently done for you in the text (the version I downloaded doesn't display any figures).
 
  • #30


fzero said:
If you just blindly apply (259), then you'll find that for [tex]r<2M[/tex] coordinate time is decreasing along the affine parameter. When you fix this, you'll find that the radius is now decreasing. Alternatively, you can simply plot the geodesics (268) which is apparently done for you in the text (the version I downloaded doesn't display any figures).

Yeah we have the plot which does make it clear. I'm just trying to convince myself with the maths. So, the expression for [itex]\frac{dt}{d \tau}[/itex] makes it clear that for r<2M, we have the coordinate time decreasing along the affine parameter.

What do you mean by fix it?

The only way I can think of to see what the radial coordinate is doing along the geodesic is to find its rate of change wrt the affine parameter and that is given as [itex]\frac{dr}{d \tau}=1[/itex] so why does that not mean the geodesic will have increasing r?

And do you have any advice about my attempt to do that calculation in the last post? Am I going about it on the right lines?

Thanks again!
 
  • #31


latentcorpse said:
Yeah we have the plot which does make it clear. I'm just trying to convince myself with the maths. So, the expression for [itex]\frac{dt}{d \tau}[/itex] makes it clear that for r<2M, we have the coordinate time decreasing along the affine parameter.

What do you mean by fix it?

Change the sign of [tex]\tau[/tex].

And do you have any advice about my attempt to do that calculation in the last post? Am I going about it on the right lines?

I expect that your expression for [tex]U^\mu[/tex] is wrong, since I don't think the angular components change. I think you chose the wrong sign in the log too.
 
  • #32


fzero said:
Change the sign of [tex]\tau[/tex].
Why do we do that?
Surely [itex]\tau[/itex], being the affine parameter, can only increase?



fzero said:
I expect that your expression for [tex]U^\mu[/tex] is wrong, since I don't think the angular components change. I think you chose the wrong sign in the log too.

Ok. so we had [itex]X'^\mu = A^\mu_\nu X^\nu[/itex]
where [itex]A^\mu _\nu = \frac{\partial x^\mu}{\partial x^\nu}[/itex]

Now our sets of coords are (v,r,theta,phi) and (t,r,theta,phi)

So [itex]X'^1=X'^2=X'^3=1[/itex] (I'm hoping my [itex]X'^1[/itex] component is ok there. I don't think r in EF varies with t, theta or phi so I think it's ok.

So we need to work out [itex]\frac{\partial v}{\partial x^\nu}[/itex]
but [itex]v=t+r_*=t+r+2M \log{| \frac{r}{2M}-1|}[/itex]
so [itex]\frac{\partial v}{\partial t}=1[/itex] and [itex]\frac{\partial v}{\partial r} = 1 +\frac{2M}{\frac{r}{2M}-1} \frac{1}{2M}=1+\frac{2M}{r-2M}[/itex]

so presumably these add to give [itex]\frac{\partial v}{\partial x^\nu}=2 + \frac{2M}{r-2M}[/itex]?


One small problem I encountered whilst reading is why in the 1st line of (35) do we only vary the derivative terms and not the metric outside the brackets?

Cheers.
 
  • #33


latentcorpse said:
Why do we do that?
Surely [itex]\tau[/itex], being the affine parameter, can only increase?

You're free to choose the affine parameter however you want. If you're choosing it as a proper time, you'd generally want the direction to go the same way as the coordinate time, if there's no reason why it shouldn't.



Ok. so we had [itex]X'^\mu = A^\mu_\nu X^\nu[/itex]
where [itex]A^\mu _\nu = \frac{\partial x^\mu}{\partial x^\nu}[/itex]

Now our sets of coords are (v,r,theta,phi) and (t,r,theta,phi)

So [itex]X'^1=X'^2=X'^3=1[/itex] (I'm hoping my [itex]X'^1[/itex] component is ok there. I don't think r in EF varies with t, theta or phi so I think it's ok.

You were originally transforming a vector [tex]X^\mu = (0,1,0,0)[/tex].

So we need to work out [itex]\frac{\partial v}{\partial x^\nu}[/itex]
but [itex]v=t+r_*=t+r+2M \log{| \frac{r}{2M}-1|}[/itex]
so [itex]\frac{\partial v}{\partial t}=1[/itex] and [itex]\frac{\partial v}{\partial r} = 1 +\frac{2M}{\frac{r}{2M}-1} \frac{1}{2M}=1+\frac{2M}{r-2M}[/itex]

so presumably these add to give [itex]\frac{\partial v}{\partial x^\nu}=2 + \frac{2M}{r-2M}[/itex]?

[tex]\log{\left| \frac{r}{2M}-1\right|} = \log\left(1-\frac{r}{2M}\right)[/tex]

when [tex]r<2M[/tex].

One small problem I encountered whilst reading is why in the 1st line of (35) do we only vary the derivative terms and not the metric outside the brackets?

Cheers.

I've no idea what equation (35) you're referring to. In my copy of Reall's notes, (35) is a product between a covector and a vector.
 
  • #34


fzero said:
You're free to choose the affine parameter however you want. If you're choosing it as a proper time, you'd generally want the direction to go the same way as the coordinate time, if there's no reason why it shouldn't.
This just seems like we're cheating almost. Surely we're also free to take [itex]\tau[/itex] increasing in which case we still have the result [itex]\frac{dr}{d \tau}=+1[/itex]. No?



fzero said:
You were originally transforming a vector [tex]X^\mu = (0,1,0,0)[/tex].

Ok. So I think I finally have it.

I find [itex]X'^\mu=(1-\frac{2M}{2M-r},1,0,0)[/itex]

and then [itex]g(X,X)= \frac{4M^2}{(2M-r)r} - \frac{6M^3}{(2M-r)r^2} + 1 - \frac{2M}{2M-r}[/itex]
I find this can be rewritten as
[itex]g(X,X)=1-\frac{2M}{2M-r} \left( \left( \sqrt{3} \frac{M}{r} - \frac{1}{\sqrt{3}} \right)^2 + \frac{8}{9} \right)[/itex]

Now everything in the brackets is positive and [itex]\frac{2M}{2M-r}>1[/itex] for [itex]r<2M[/itex] so we get [itex]g(X,X)<0[/itex]
Does this look ok?

How about doing the same thing in Schwarzschild coordinates? What did I do wrong the 1st time I tried this and ended up finding [itex]r[/itex] to be spacelike. I'm curious as to how they agree!


fzero said:
I've no idea what equation (35) you're referring to. In my copy of Reall's notes, (35) is a product between a covector and a vector.

Sorry! I meant (355). The first line of that. Why does the [itex]\delta[/itex] not hit the metric in front of the brackets?

Cheers.
 
  • #35


latentcorpse said:
This just seems like we're cheating almost. Surely we're also free to take [itex]\tau[/itex] increasing in which case we still have the result [itex]\frac{dr}{d \tau}=+1[/itex]. No?

If you get the sign of [tex]\tau[/tex] wrong, you'll find that r increases as you travel backwards in coordinate time on the outgoing geodesics.

Ok. So I think I finally have it.

I find [itex]X'^\mu=(1-\frac{2M}{2M-r},1,0,0)[/itex]

and then [itex]g(X,X)= \frac{4M^2}{(2M-r)r} - \frac{6M^3}{(2M-r)r^2} + 1 - \frac{2M}{2M-r}[/itex]
I find this can be rewritten as
[itex]g(X,X)=1-\frac{2M}{2M-r} \left( \left( \sqrt{3} \frac{M}{r} - \frac{1}{\sqrt{3}} \right)^2 + \frac{8}{9} \right)[/itex]

Now everything in the brackets is positive and [itex]\frac{2M}{2M-r}>1[/itex] for [itex]r<2M[/itex] so we get [itex]g(X,X)<0[/itex]
Does this look ok?

How about doing the same thing in Schwarzschild coordinates? What did I do wrong the 1st time I tried this and ended up finding [itex]r[/itex] to be spacelike. I'm curious as to how they agree!

In Schwarzschild coordinates you find

[tex] - \frac{2M}{2M-r}[/tex]

so there's probably something wrong with your derivation above.

Sorry! I meant (355). The first line of that. Why does the [itex]\delta[/itex] not hit the metric in front of the brackets?

Cheers.

You should reread the remarks above (355) about the normal coordinates he's using. When you act on the inverse metric, you find terms like

[tex] \delta g^{\mu\sigma} \left( g_{\sigma\nu,\rho} +\cdots \right), ~~(*)[/tex]

but these all vanish because he takes [tex] g_{\sigma\nu,\rho} =0[/tex]. If you used general coordinates, the effect of the terms (*) would be to generate the covariant derivatives in the 2nd line of (355).
 
<h2>1. Why do p terms not cancel in this Diffeomorphisms proof?</h2><p>The reason why p terms do not cancel in this Diffeomorphisms proof is because they represent different variables or parameters. In mathematical proofs, it is important to keep track of all variables and not assume that they will cancel out.</p><h2>2. Can you provide an example of p terms not canceling in a Diffeomorphisms proof?</h2><p>One example is in the proof of the inverse function theorem, where the p terms represent the partial derivatives of the functions involved. These terms do not cancel because they are different derivatives and cannot be simplified.</p><h2>3. How does the failure of p terms to cancel affect the overall proof?</h2><p>The failure of p terms to cancel does not affect the validity of the proof. It simply means that the variables involved cannot be simplified or canceled out, and they must be considered separately in the proof.</p><h2>4. Is it common for p terms to not cancel in mathematical proofs?</h2><p>Yes, it is common for p terms to not cancel in mathematical proofs. In fact, it is often a key part of the proof to show that these terms cannot be simplified or canceled out in order to arrive at the desired conclusion.</p><h2>5. How can one ensure that p terms do not cancel in a proof?</h2><p>To ensure that p terms do not cancel in a proof, it is important to carefully track all variables and not assume that they will cancel out. Additionally, it may be helpful to double check the calculations and equations to make sure that all terms are accounted for and cannot be simplified further.</p>

1. Why do p terms not cancel in this Diffeomorphisms proof?

The reason why p terms do not cancel in this Diffeomorphisms proof is because they represent different variables or parameters. In mathematical proofs, it is important to keep track of all variables and not assume that they will cancel out.

2. Can you provide an example of p terms not canceling in a Diffeomorphisms proof?

One example is in the proof of the inverse function theorem, where the p terms represent the partial derivatives of the functions involved. These terms do not cancel because they are different derivatives and cannot be simplified.

3. How does the failure of p terms to cancel affect the overall proof?

The failure of p terms to cancel does not affect the validity of the proof. It simply means that the variables involved cannot be simplified or canceled out, and they must be considered separately in the proof.

4. Is it common for p terms to not cancel in mathematical proofs?

Yes, it is common for p terms to not cancel in mathematical proofs. In fact, it is often a key part of the proof to show that these terms cannot be simplified or canceled out in order to arrive at the desired conclusion.

5. How can one ensure that p terms do not cancel in a proof?

To ensure that p terms do not cancel in a proof, it is important to carefully track all variables and not assume that they will cancel out. Additionally, it may be helpful to double check the calculations and equations to make sure that all terms are accounted for and cannot be simplified further.

Similar threads

  • Advanced Physics Homework Help
Replies
2
Views
2K
  • Advanced Physics Homework Help
Replies
4
Views
2K
Replies
3
Views
5K
  • Math Proof Training and Practice
6
Replies
175
Views
20K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
3
Views
222
  • General Math
Replies
8
Views
2K
  • Quantum Physics
Replies
1
Views
3K
  • Linear and Abstract Algebra
Replies
9
Views
4K
  • Beyond the Standard Models
Replies
1
Views
2K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
7
Views
2K
Back
Top