Understanding Variable Changes in the Chain Rule for Mathematicians

  • Thread starter Thread starter Buri
  • Start date Start date
  • Tags Tags
    Chain Chain rule
Buri
Messages
271
Reaction score
0
I have a question about the variable changes in the proofs of Proposition 1.3.4 and Proposition 1.3.6. In the first one, it seems like the author does the variable change but once he applies the chain rule he doesn't do it completely. While in the second it seems like it does the variable changes like usual in the chain rule. Are these 'partial' variable changes in the first one okay? I don't see what's the point of the variable change if you're not going to apply it completely - though I do see the purpose of not doing it completely in the first proposition. And another question, when applying the chain rule in Liebniz notation do you HAVE to make the variable change?

I'm not very comfortable with Liebniz notation since I've never used it before really. I'm more used to the f'(x) notation or Df notation.

Any clarification would be appreciated!
 

Attachments

  • Page 14.jpg
    Page 14.jpg
    15.3 KB · Views: 494
  • Page 15.jpg
    Page 15.jpg
    19.8 KB · Views: 468
Physics news on Phys.org
I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

What the author is doing in the first one is to start with

t=\phi\circ\phi^{-1}(t)

and apply d/dt to both sides, which combined with the definitions \psi=\phi^{-1} and \bar t=\psi(t) yields

1=\phi'(\phi^{-1}(t))(\phi^{-1})'(t)=\phi'(\psi(t))\psi'(t)=\phi'(\bar t)\psi'(t)=\frac{d\phi}{d\bar t}\frac{d\psi}{dt}

Edit: I edited the math above, because the \phi^{-1} was missing a ' symbol.
 
Last edited:
Fredrik said:
I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

Well there shouldn't be a d(psi) in the first one. Shouldn't it be instead a dt (tilda)?
 
I'm not sure if it really matters though. I just find that the noation is sometimes used like literal fractions too - I haven't studied differential forms yet, so maybe they actually can be treated like separate entitities, but I've always heard you can't. So..
 
Buri said:
Well there shouldn't be a d(psi) in the first one. Shouldn't it be instead a dt (tilda)?
That's what I don't like about the notation. :smile: See if you understand the calculation I added to my previous post. (I think I wrote it before you posted your reply, but I lost contact with the site for a few minutes, and saved the changes when I was able to reconnect).
 
Fredrik said:
I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

What the author is doing in the first one is to start with

t=\phi\circ\phi^{-1}(t)

and apply d/dt to both sides, which combined with the definitions \psi=\phi^{-1} and \bar t=\psi(t) yields

1=\phi'(\phi^{-1}(t))\phi^{-1}(t)=\phi'(\psi(t))\psi'(t)=\phi'(\bar t)\psi'(t)=\frac{d\phi}{d\bar t}\frac{d\psi}{dt}

I do understand what is going on. But my question is more as to if the partial variable changes like when its supposed to be d t (tilda) instead of d (psi) are those correct? Because just to make notation easier, what he's doing is this:

df/du dg/dx when normally it should be df/du du/dx. See what I imean?
 
Fredrik said:
That's what I don't like about the notation. :smile: See if you understand the calculation I added to my previous post. (I think I wrote it before you posted your reply, but I lost contact with the site for a few minutes, and saved the changes when I was able to reconnect).

I do. And I guess what you don't like about the noation is exactly what I find confusing about it. This course is in Differential Geometry, but its geared towards physics students actually and so they use different notation and such as to what math major classes use and so I just find this rather confusing, eventhough I do understand its just the chain rule.
 
I often find the d(something)/d(something) notation very confusing myself, but when I do, I just ignore the book and do the calculation myself in a notation that makes it obvious what functions I'm dealing with. When I use the chain rule, it looks something like this:

(f\circ g)_{,i}(x)=f_{,j}(g(x))g^j_{,i}(x)

(I'm using the summation convention, and ",i" for the ith partial derivative).
 
Fredrik said:
I often find the d(something)/d(something) notation very confusing myself, but when I do, I just ignore the book and do the calculation myself in a notation that makes it obvious what functions I'm dealing with. When I use the chain rule, it looks something like this:

(f\circ g)_{,i}(x)=f_{,j}(g(x))g^j_{,i}(x)

(I'm using the summation convention, and ",i" for the ith partial derivative).

Thanks for the idea. I'll do the same from now on. I've spent so much time trying to unravel calculations with Leibniz notation, but every time I figure something out something else crops up which doesn't follow the rules, so there isn't much of a point I guess in trying to figuring out all the nuances of the notation.

Thanks for your help! :smile:
 
Back
Top