Chain Rule Confusion

Click For Summary
SUMMARY

The discussion centers on the application of the Chain Rule in partial differentiation, specifically addressing the confusion surrounding the calculation of $$\frac{\partial y}{\partial p}$$. Participants clarify that partial differentiation should treat each variable as independent and atomic, without considering dependencies on other variables. The conversation emphasizes the importance of understanding that partial differentiation is applied to functions, not variables, and highlights the potential pitfalls of misapplying the Chain Rule when intermediate variables like $$p$$ are involved.

PREREQUISITES
  • Understanding of partial differentiation and its principles
  • Familiarity with the Chain Rule in calculus
  • Basic knowledge of functions and their dependencies
  • Experience with mathematical notation, particularly curly-d notation for partial derivatives
NEXT STEPS
  • Study the principles of partial differentiation in depth
  • Explore the Chain Rule and its applications in multivariable calculus
  • Review examples of common pitfalls in applying the Chain Rule
  • Learn about total differentiation and how it differs from partial differentiation
USEFUL FOR

Students and educators in mathematics, particularly those studying calculus and multivariable functions, as well as professionals applying calculus in fields such as physics and engineering.

laser1
Messages
170
Reaction score
23
Homework Statement
desc
Relevant Equations
desc
1732222784637.png

Source: Paul's Notes (https://tutorial.math.lamar.edu/Solutions/CalcIII/ChainRule/Prob6.aspx)

In his solutions:
1732222836233.png

where he computes the stuff in red. My question is, why is $$\frac{\partial y}{\partial p}$$not $$1+3\frac{\partial t}{\partial p}=1+3\left(\frac{-1}{2}\right)=\frac{-1}{2}$$ Thanks
 
Physics news on Phys.org
Because the rule for partial differentiation is to calculate the derivative using the formula given and not looking through any variables that may themselves be functions of other variables. In your alternative approach you are treating t as a function of p, looking through that function definition and including as part of the partial derivative any dependencies on p that you find . You are not supposed to look through like that. Treat each variable in the formula given for y as if it is an atomic variable that has no dependence on any other variable.
Unfortunately, the curly-d notation for partial differentiation tends to obscure this important principle because it makes it look like partial differentiation is something you do to a variable. It isn't. It's something you do to a function. Total differentiation can be something we do to a variable, but not partial differentiation.
See my Insights not on this topic, which may help clarify: https://www.physicsforums.com/insights/partial-differentiation-without-tears/
 
  • Like
Likes   Reactions: laser1 and FactChecker
andrewkirk said:
Because the rule for partial differentiation is to calculate the derivative using the formula given and not looking through any variables that may themselves be functions of other variables. In your alternative approach you are treating t as a function of p, looking through that function definition and including as part of the partial derivative any dependencies on p that you find . You are not supposed to look through like that. Treat each variable in the formula given for y as if it is an atomic variable that has no dependence on any other variable.
Unfortunately, the curly-d notation for partial differentiation tends to obscure this important principle because it makes it look like partial differentiation is something you do to a variable. It isn't. It's something you do to a function. Total differentiation can be something we do to a variable, but not partial differentiation.
See my Insights not on this topic, which may help clarify: https://www.physicsforums.com/insights/partial-differentiation-without-tears/
Thanks, I read through your article. My flaw there was computing dy/dp as opposed to partial y / partial p.

One point is still annoying me though - what if we consider ##p## to be a placeholder for ##1-2t##? So ##y=(1-2t)+3t-4s=1+t-4s##. This is the exact same thing, right? But now, $$\frac{\partial w}{\partial t}=\text{TERMS}+\frac{\partial w}{\partial y} \frac{\partial y}{\partial t}$$ where TERMS is just the other partial derivative terms. Now the part that goes through ##y## is equal to ##1##.

I can't really see where the flaw is with this logic.
 
laser1 said:
Thanks, I read through your article. My flaw there was computing dy/dp as opposed to partial y / partial p.

One point is still annoying me though - what if we consider ##p## to be a placeholder for ##1-2t##? So ##y=(1-2t)+3t-4s=1+t-4s##. This is the exact same thing, right? But now, $$\frac{\partial w}{\partial t}=\text{TERMS}+\frac{\partial w}{\partial y} \frac{\partial y}{\partial t}$$ where TERMS is just the other partial derivative terms. Now the part that goes through ##y## is equal to ##1##.

I can't really see where the flaw is with this logic.
If you simplify first, and eliminate ##p##, then the second and third terms in the full expansion reduce to a single term involving ## \frac{\partial y}{\partial t}##.
 
PeroK said:
If you simplify first, and eliminate ##p##, then the second and third terms in the full expansion reduce to a single term involving ## \frac{\partial y}{\partial t}##.
I mean, if ##y## was defined as ##y=(1-2t)+3t-4s=1+t-4s##, that's exactly what we'd do, right? We would only have those two terms in the expression involving ##\partial w/ \partial y##, those terms being ##\frac{\partial w}{\partial y} \frac{\partial y}{\partial t}##.
 
laser1 said:
I mean, if ##y## was defined as ##y=(1-2t)+3t-4s=1+t-4s##, that's exactly what we'd do, right? We would only have those two terms in the expression involving ##\partial w/ \partial y##.
Yes. If you do it both ways, you must get the same final answer. If you eliminate ##p## first, then the whole expansion is simplified.
 
  • Like
Likes   Reactions: FactChecker
PeroK said:
Yes. If you do it both ways, you must get the same final answer. If you eliminate ##p## first, then the whole expansion is simplified.
1732265455620.png
 
laser1 said:
I mean, if ##y## was defined as ##y=(1-2t)+3t-4s=1+t-4s##, that's exactly what we'd do, right?
Note that, IMO, that is how ##y## is defined. It is that function of ##s## and ##t##. That is not changed by the fact that we have an intermediate variable ##p##.
 
laser1 said:
Nonsense. Do the whole calculation for ##\frac{\partial w}{\partial t}##.
 
  • Like
Likes   Reactions: laser1
  • #10
Ah my bad! I completely missed that the solutions had the ##\frac{\partial w}{\partial y} \frac{\partial y}{\partial t}## term next to it, which evaluates to ##3\frac{\partial w}{\partial y}##, and ##3-2=1## so it all works out the same!
 
  • Like
Likes   Reactions: FactChecker and PeroK

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 9 ·
Replies
9
Views
1K
Replies
2
Views
1K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
2
Views
2K