Chain Rule (partial derivatives): basic interpretation question

Click For Summary

Discussion Overview

The discussion revolves around the interpretation and understanding of the chain rule in the context of partial derivatives, particularly focusing on its geometric and intuitive aspects. Participants explore the implications of the formula for the total differential and its application in multivariable calculus.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant expresses confusion about the intuition behind the chain rule, particularly regarding the interpretation of changes in variables and their relationship to the Pythagorean theorem.
  • Another participant suggests using a first-order Taylor series expansion to clarify the relationship between changes in the variables and the resulting change in the function.
  • A different participant emphasizes the geometric understanding of the chain rule, discussing the tangent bundle of surfaces and how tangent planes map through functions.
  • Some participants propose viewing the total differential as a vector in cotangent space, linking it to the gradient and paths through the space.
  • There are multiple references to the need for visual aids to better understand the geometric interpretations discussed.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the best way to intuitively understand the chain rule. Various interpretations and methods are proposed, indicating that multiple competing views remain on the topic.

Contextual Notes

Some limitations include the reliance on geometric intuition, which may not be universally accessible, and the potential for differing interpretations of mathematical concepts such as tangent bundles and total differentials.

nomadreid
Gold Member
Messages
1,771
Reaction score
255
TL;DR
If z is a a continuous function of x,y, then dz=(∂z/∂x)dx+(∂z/∂y)dy is a basic formula whose intuition escapes me, unless one treats dx and dy as vectors, which doesn't seem right.
The proof for the above ubiquitous formula (as in the summary) in "Chain rule for one independent variable" at the beginning of
https://math.libretexts.org/Bookshe...5:_The_Chain_Rule_for_Multivariable_Functions
is something that I need to work through, but I don't see the forest for all the trees: that is, along with the formal proof I would like to have a rough intuition here, which fails me when I consider that
(∂z/∂x)dx seems to be the (infinitesimal) change in z in the xz plane, and similarly
(∂z/∂y)dy seems to be the change in z in the yz plane,
which would seem to indicate that the corresponding change in z would be (if dx, dy and dz are scalars) the Pythagorean combination rather than the simple addition as if they were vectors.

Obviously I am looking at it wrongly; any pointers to correct this would be greatly appreciated.
 
Physics news on Phys.org
You can see it from a first-order Taylor series expansion:
$$z(x + \Delta x, y + \Delta y) \approx z(x, y + \Delta y) + \frac{\partial z}{\partial x}\Delta x \approx z(x, y) + \frac{\partial z}{\partial y}\Delta y + \frac{\partial z}{\partial x}\Delta x$$So:
$$\Delta z = z(x + \Delta x, y + \Delta y) - z(x, y) \approx \frac{\partial z}{\partial x}\Delta x + \frac{\partial z}{\partial y}\Delta y$$
 
  • Like
Likes   Reactions: nomadreid
nomadreid said:
TL;DR Summary: If z is a a continuous function of x,y, then dz=(∂z/∂x)dx+(∂z/∂y)dy is a basic formula whose intuition escapes me, unless one treats dx and dy as vectors, which doesn't seem right.

Obviously I am looking at it wrongly; any pointers to correct this would be greatly appreciated.
What makes you think you are wrong? The total differential is vector in the cotangent space ##T^*_pU=\{T_pU\longrightarrow \mathbb{R}\}## and ##\dfrac{\partial z}{\partial x_k}## its coordinates in the standard basis ##dx_k.## You can look at it as
$$
(dz)_p(X) =\left. \dfrac{d}{dt}\right|_{t=0} z(\gamma(t))=\bigl\langle \operatorname{grad}z,X \bigr\rangle
$$
with a path ##\gamma(t)## through ##p## along the vector field ##X, ## i.e. ##\gamma(0)=p\, , \,\dot\gamma (0)=X.##
 
  • Like
Likes   Reactions: mathwonk and nomadreid
PeroK said:
You can see it from a first-order Taylor series expansion:
$$z(x + \Delta x, y + \Delta y) \approx z(x, y + \Delta y) + \frac{\partial z}{\partial x}\Delta x \approx z(x, y) + \frac{\partial z}{\partial y}\Delta y + \frac{\partial z}{\partial x}\Delta x$$So:
$$\Delta z = z(x + \Delta x, y + \Delta y) - z(x, y) \approx \frac{\partial z}{\partial x}\Delta x + \frac{\partial z}{\partial y}\Delta y$$
PeroK: That is a much nicer proof than the one I cited from the website, thank you.

fresh_42 said:
What makes you think you are wrong? The total differential is vector in the cotangent space ##T^*_pU=\{T_pU\longrightarrow \mathbb{R}\}## and ##\dfrac{\partial z}{\partial x_k}## its coordinates in the standard basis ##dx_k.## You can look at it as
$$
(dz)_p(X) =\left. \dfrac{d}{dt}\right|_{t=0} z(\gamma(t))=\bigl\langle \operatorname{grad}z,X \bigr\rangle
$$
with a path ##\gamma(t)## through ##p## along the vector field ##X, ## i.e. ##\gamma(0)=p\, , \,\dot\gamma (0)=X.##
Super! Thanks, fresh_42.
 
  • Like
Likes   Reactions: PeroK
The chain rule is best understood geometrically. To avoid thinking about higher dimensions, we will just consider 2d surfaces. However, the reasoning that follows goes through with minor adjustments for higher dimensions.

Assume we have a surface ##M##, then we can attach to it all its tangent planes. We call this the tangent bundle of the surface and denote it as ##TM##. Now if we have a map ##f : M \rightarrow N## from this surface ##M## to another surface ##N##, then it is intuitively obvious that all the tangent planes of the first shape maps to the tangent planes of the second. We call this map ##Tf## and in symbols, we expect ##Tf: TM \rightarrow TN##. Now it is also geometrically obvious that if we had a further map ##g: N \rightarrow P## to another surface ##P##, then composing the maps ##Tg\circ Tf## should equal ##T(g\circ f)##. This is best understood through a picture, which unfortunately I can't do here. Anyway, this means that ##T## is what is called a functor. Now if we work locally, that is using charts, then it turns out this functorial rule is precisely the chain rule.
 
  • Like
Likes   Reactions: dextercioby and nomadreid
Mozibur Rahman Ullah said:
The chain rule is best understood geometrically. To avoid thinking about higher dimensions, we will just consider 2d surfaces. However, the reasoning that follows goes through with minor adjustments for higher dimensions.

Assume we have a surface ##M##, then we can attach to it all its tangent planes. We call this the tangent bundle of the surface and denote it as ##TM##. Now if we have a map ##f : M \rightarrow N## from this surface ##M## to another surface ##N##, then it is intuitively obvious that all the tangent planes of the first shape maps to the tangent planes of the second. We call this map ##Tf## and in symbols, we expect ##Tf: TM \rightarrow TN##. Now it is also geometrically obvious that if we had a further map ##g: N \rightarrow P## to another surface ##P##, then composing the maps ##Tg\circ Tf## should equal ##T(g\circ f)##. This is best understood through a picture, which unfortunately I can't do here. Anyway, this means that ##T## is what is called a functor. Now if we work locally, that is using charts, then it turns out this functorial rule is precisely the chain rule.
Interesting approach, thanks.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 10 ·
Replies
10
Views
1K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 18 ·
Replies
18
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 15 ·
Replies
15
Views
6K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
5
Views
4K
  • · Replies 7 ·
Replies
7
Views
2K