Understanding Variable Changes in the Chain Rule for Mathematicians

Buri · Feb 28, 2011

I have a question about the variable changes in the proofs of Proposition 1.3.4 and Proposition 1.3.6. In the first one, it seems like the author does the variable change but once he applies the chain rule he doesn't do it completely. While in the second it seems like it does the variable changes like usual in the chain rule. Are these 'partial' variable changes in the first one okay? I don't see what's the point of the variable change if you're not going to apply it completely - though I do see the purpose of not doing it completely in the first proposition. And another question, when applying the chain rule in Liebniz notation do you HAVE to make the variable change?

I'm not very comfortable with Liebniz notation since I've never used it before really. I'm more used to the f'(x) notation or Df notation.

Any clarification would be appreciated!

Fredrik · Feb 28, 2011

I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

What the author is doing in the first one is to start with

t=\phi\circ\phi^{-1}(t)

and apply d/dt to both sides, which combined with the definitions \psi=\phi^{-1} and \bar t=\psi(t) yields

1=\phi'(\phi^{-1}(t))(\phi^{-1})'(t)=\phi'(\psi(t))\psi'(t)=\phi'(\bar t)\psi'(t)=\frac{d\phi}{d\bar t}\frac{d\psi}{dt}

Edit: I edited the math above, because the \phi^{-1} was missing a ' symbol.

Buri · Feb 28, 2011

Fredrik said:

I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

Well there shouldn't be a d(psi) in the first one. Shouldn't it be instead a dt (tilda)?

Buri · Feb 28, 2011

I'm not sure if it really matters though. I just find that the noation is sometimes used like literal fractions too - I haven't studied differential forms yet, so maybe they actually can be treated like separate entitities, but I've always heard you can't. So..

Fredrik · Feb 28, 2011

Buri said:

Well there shouldn't be a d(psi) in the first one. Shouldn't it be instead a dt (tilda)?

That's what I don't like about the notation.

See if you understand the calculation I added to my previous post. (I think I wrote it before you posted your reply, but I lost contact with the site for a few minutes, and saved the changes when I was able to reconnect).

Buri · Feb 28, 2011

Fredrik said:

I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

What the author is doing in the first one is to start with

t=\phi\circ\phi^{-1}(t)

and apply d/dt to both sides, which combined with the definitions \psi=\phi^{-1} and \bar t=\psi(t) yields

1=\phi'(\phi^{-1}(t))\phi^{-1}(t)=\phi'(\psi(t))\psi'(t)=\phi'(\bar t)\psi'(t)=\frac{d\phi}{d\bar t}\frac{d\psi}{dt}

I do understand what is going on. But my question is more as to if the partial variable changes like when its supposed to be d t (tilda) instead of d (psi) are those correct? Because just to make notation easier, what he's doing is this:

df/du dg/dx when normally it should be df/du du/dx. See what I imean?

Buri · Feb 28, 2011

Fredrik said:

That's what I don't like about the notation. See if you understand the calculation I added to my previous post. (I think I wrote it before you posted your reply, but I lost contact with the site for a few minutes, and saved the changes when I was able to reconnect).

I do. And I guess what you don't like about the noation is exactly what I find confusing about it. This course is in Differential Geometry, but its geared towards physics students actually and so they use different notation and such as to what math major classes use and so I just find this rather confusing, eventhough I do understand its just the chain rule.

Fredrik · Feb 28, 2011

I often find the d(something)/d(something) notation very confusing myself, but when I do, I just ignore the book and do the calculation myself in a notation that makes it obvious what functions I'm dealing with. When I use the chain rule, it looks something like this:

(f\circ g)_{,i}(x)=f_{,j}(g(x))g^j_{,i}(x)

(I'm using the summation convention, and ",i" for the ith partial derivative).

Buri · Feb 28, 2011

Fredrik said:

I often find the d(something)/d(something) notation very confusing myself, but when I do, I just ignore the book and do the calculation myself in a notation that makes it obvious what functions I'm dealing with. When I use the chain rule, it looks something like this:

(f\circ g)_{,i}(x)=f_{,j}(g(x))g^j_{,i}(x)

(I'm using the summation convention, and ",i" for the ith partial derivative).

Thanks for the idea. I'll do the same from now on. I've spent so much time trying to unravel calculations with Leibniz notation, but every time I figure something out something else crops up which doesn't follow the rules, so there isn't much of a point I guess in trying to figuring out all the nuances of the notation.

Thanks for your help!

Understanding Variable Changes in the Chain Rule for Mathematicians

Attachments

Similar threads

Hot Threads

I Algebraic property of real numbers

I Problem in understanding instantaneous velocity

I How to find the path if we only know the velocity (without common formulas)?

I Harmonic series Ʃ1/n diverges but p-series Ʃ(1/n)^p diverges?

I Explicit logical justification for last step in epsilon/delta proof?

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective