Understanding Variable Changes in the Chain Rule for Mathematicians

  • Context: Graduate 
  • Thread starter Thread starter Buri
  • Start date Start date
  • Tags Tags
    Chain Chain rule
Click For Summary

Discussion Overview

The discussion revolves around the application of the chain rule in calculus, specifically regarding variable changes in proofs from a mathematical text. Participants express confusion about the completeness of variable changes in certain propositions and the use of Leibniz notation versus other notations.

Discussion Character

  • Debate/contested
  • Technical explanation
  • Conceptual clarification

Main Points Raised

  • One participant questions the completeness of variable changes in the application of the chain rule in Proposition 1.3.4 compared to Proposition 1.3.6.
  • Another participant provides a detailed breakdown of the chain rule application in the first proposition, but questions the presence of certain notation, specifically whether it should involve d(psi) or dt (tilda).
  • Some participants express discomfort with Leibniz notation, finding it confusing and preferring other notational forms.
  • There is a discussion about the treatment of differentials in notation, with some participants suggesting that they can be treated as separate entities while others disagree.
  • A participant mentions using summation convention for clarity in their calculations, indicating a preference for a notation that makes the functions involved more explicit.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the appropriateness of the variable changes in the first proposition or the use of Leibniz notation. Multiple competing views on the notation and its application remain present throughout the discussion.

Contextual Notes

Some participants note that the notation used in their course is geared towards physics students, which may differ from standard mathematical notation, contributing to the confusion.

Buri
Messages
271
Reaction score
0
I have a question about the variable changes in the proofs of Proposition 1.3.4 and Proposition 1.3.6. In the first one, it seems like the author does the variable change but once he applies the chain rule he doesn't do it completely. While in the second it seems like it does the variable changes like usual in the chain rule. Are these 'partial' variable changes in the first one okay? I don't see what's the point of the variable change if you're not going to apply it completely - though I do see the purpose of not doing it completely in the first proposition. And another question, when applying the chain rule in Liebniz notation do you HAVE to make the variable change?

I'm not very comfortable with Liebniz notation since I've never used it before really. I'm more used to the f'(x) notation or Df notation.

Any clarification would be appreciated!
 

Attachments

  • Page 14.jpg
    Page 14.jpg
    15.3 KB · Views: 523
  • Page 15.jpg
    Page 15.jpg
    19.8 KB · Views: 491
Physics news on Phys.org
I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

What the author is doing in the first one is to start with

t=\phi\circ\phi^{-1}(t)

and apply d/dt to both sides, which combined with the definitions \psi=\phi^{-1} and \bar t=\psi(t) yields

1=\phi'(\phi^{-1}(t))(\phi^{-1})'(t)=\phi'(\psi(t))\psi'(t)=\phi'(\bar t)\psi'(t)=\frac{d\phi}{d\bar t}\frac{d\psi}{dt}

Edit: I edited the math above, because the \phi^{-1} was missing a ' symbol.
 
Last edited:
Fredrik said:
I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

Well there shouldn't be a d(psi) in the first one. Shouldn't it be instead a dt (tilda)?
 
I'm not sure if it really matters though. I just find that the noation is sometimes used like literal fractions too - I haven't studied differential forms yet, so maybe they actually can be treated like separate entitities, but I've always heard you can't. So..
 
Buri said:
Well there shouldn't be a d(psi) in the first one. Shouldn't it be instead a dt (tilda)?
That's what I don't like about the notation. :smile: See if you understand the calculation I added to my previous post. (I think I wrote it before you posted your reply, but I lost contact with the site for a few minutes, and saved the changes when I was able to reconnect).
 
Fredrik said:
I'm not a big fan of the notation, but they both look like straightforward applications of the chain rule. What is it about the first one that doesn't look "complete" to you?

What the author is doing in the first one is to start with

t=\phi\circ\phi^{-1}(t)

and apply d/dt to both sides, which combined with the definitions \psi=\phi^{-1} and \bar t=\psi(t) yields

1=\phi'(\phi^{-1}(t))\phi^{-1}(t)=\phi'(\psi(t))\psi'(t)=\phi'(\bar t)\psi'(t)=\frac{d\phi}{d\bar t}\frac{d\psi}{dt}

I do understand what is going on. But my question is more as to if the partial variable changes like when its supposed to be d t (tilda) instead of d (psi) are those correct? Because just to make notation easier, what he's doing is this:

df/du dg/dx when normally it should be df/du du/dx. See what I imean?
 
Fredrik said:
That's what I don't like about the notation. :smile: See if you understand the calculation I added to my previous post. (I think I wrote it before you posted your reply, but I lost contact with the site for a few minutes, and saved the changes when I was able to reconnect).

I do. And I guess what you don't like about the noation is exactly what I find confusing about it. This course is in Differential Geometry, but its geared towards physics students actually and so they use different notation and such as to what math major classes use and so I just find this rather confusing, eventhough I do understand its just the chain rule.
 
I often find the d(something)/d(something) notation very confusing myself, but when I do, I just ignore the book and do the calculation myself in a notation that makes it obvious what functions I'm dealing with. When I use the chain rule, it looks something like this:

(f\circ g)_{,i}(x)=f_{,j}(g(x))g^j_{,i}(x)

(I'm using the summation convention, and ",i" for the ith partial derivative).
 
Fredrik said:
I often find the d(something)/d(something) notation very confusing myself, but when I do, I just ignore the book and do the calculation myself in a notation that makes it obvious what functions I'm dealing with. When I use the chain rule, it looks something like this:

(f\circ g)_{,i}(x)=f_{,j}(g(x))g^j_{,i}(x)

(I'm using the summation convention, and ",i" for the ith partial derivative).

Thanks for the idea. I'll do the same from now on. I've spent so much time trying to unravel calculations with Leibniz notation, but every time I figure something out something else crops up which doesn't follow the rules, so there isn't much of a point I guess in trying to figuring out all the nuances of the notation.

Thanks for your help! :smile:
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 20 ·
Replies
20
Views
5K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 36 ·
2
Replies
36
Views
6K
  • · Replies 3 ·
Replies
3
Views
3K