Chain Rule (partial derivatives): basic interpretation question

Click For Summary
SUMMARY

The discussion centers on the interpretation of the chain rule in the context of partial derivatives, specifically the formula dz = (∂z/∂x)dx + (∂z/∂y)dy. Participants explore the geometric intuition behind this formula, emphasizing that the total differential is a vector in the cotangent space. The conversation highlights the importance of understanding the chain rule through geometric mappings of tangent planes between surfaces, reinforcing that the chain rule can be viewed as a functorial relationship between these mappings.

PREREQUISITES
  • Understanding of partial derivatives and their notation
  • Familiarity with the concept of total differentials
  • Basic knowledge of tangent spaces and bundles
  • Experience with geometric interpretations of calculus
NEXT STEPS
  • Study the geometric interpretation of the chain rule in multivariable calculus
  • Explore the concept of tangent bundles and their applications
  • Learn about functorial relationships in differential geometry
  • Review first-order Taylor series expansions in multiple dimensions
USEFUL FOR

Mathematicians, physics students, and anyone studying multivariable calculus who seeks a deeper understanding of the chain rule and its geometric implications.

nomadreid
Gold Member
Messages
1,762
Reaction score
248
TL;DR
If z is a a continuous function of x,y, then dz=(∂z/∂x)dx+(∂z/∂y)dy is a basic formula whose intuition escapes me, unless one treats dx and dy as vectors, which doesn't seem right.
The proof for the above ubiquitous formula (as in the summary) in "Chain rule for one independent variable" at the beginning of
https://math.libretexts.org/Bookshe...5:_The_Chain_Rule_for_Multivariable_Functions
is something that I need to work through, but I don't see the forest for all the trees: that is, along with the formal proof I would like to have a rough intuition here, which fails me when I consider that
(∂z/∂x)dx seems to be the (infinitesimal) change in z in the xz plane, and similarly
(∂z/∂y)dy seems to be the change in z in the yz plane,
which would seem to indicate that the corresponding change in z would be (if dx, dy and dz are scalars) the Pythagorean combination rather than the simple addition as if they were vectors.

Obviously I am looking at it wrongly; any pointers to correct this would be greatly appreciated.
 
Physics news on Phys.org
You can see it from a first-order Taylor series expansion:
$$z(x + \Delta x, y + \Delta y) \approx z(x, y + \Delta y) + \frac{\partial z}{\partial x}\Delta x \approx z(x, y) + \frac{\partial z}{\partial y}\Delta y + \frac{\partial z}{\partial x}\Delta x$$So:
$$\Delta z = z(x + \Delta x, y + \Delta y) - z(x, y) \approx \frac{\partial z}{\partial x}\Delta x + \frac{\partial z}{\partial y}\Delta y$$
 
  • Like
Likes   Reactions: nomadreid
nomadreid said:
TL;DR Summary: If z is a a continuous function of x,y, then dz=(∂z/∂x)dx+(∂z/∂y)dy is a basic formula whose intuition escapes me, unless one treats dx and dy as vectors, which doesn't seem right.

Obviously I am looking at it wrongly; any pointers to correct this would be greatly appreciated.
What makes you think you are wrong? The total differential is vector in the cotangent space ##T^*_pU=\{T_pU\longrightarrow \mathbb{R}\}## and ##\dfrac{\partial z}{\partial x_k}## its coordinates in the standard basis ##dx_k.## You can look at it as
$$
(dz)_p(X) =\left. \dfrac{d}{dt}\right|_{t=0} z(\gamma(t))=\bigl\langle \operatorname{grad}z,X \bigr\rangle
$$
with a path ##\gamma(t)## through ##p## along the vector field ##X, ## i.e. ##\gamma(0)=p\, , \,\dot\gamma (0)=X.##
 
  • Like
Likes   Reactions: mathwonk and nomadreid
PeroK said:
You can see it from a first-order Taylor series expansion:
$$z(x + \Delta x, y + \Delta y) \approx z(x, y + \Delta y) + \frac{\partial z}{\partial x}\Delta x \approx z(x, y) + \frac{\partial z}{\partial y}\Delta y + \frac{\partial z}{\partial x}\Delta x$$So:
$$\Delta z = z(x + \Delta x, y + \Delta y) - z(x, y) \approx \frac{\partial z}{\partial x}\Delta x + \frac{\partial z}{\partial y}\Delta y$$
PeroK: That is a much nicer proof than the one I cited from the website, thank you.

fresh_42 said:
What makes you think you are wrong? The total differential is vector in the cotangent space ##T^*_pU=\{T_pU\longrightarrow \mathbb{R}\}## and ##\dfrac{\partial z}{\partial x_k}## its coordinates in the standard basis ##dx_k.## You can look at it as
$$
(dz)_p(X) =\left. \dfrac{d}{dt}\right|_{t=0} z(\gamma(t))=\bigl\langle \operatorname{grad}z,X \bigr\rangle
$$
with a path ##\gamma(t)## through ##p## along the vector field ##X, ## i.e. ##\gamma(0)=p\, , \,\dot\gamma (0)=X.##
Super! Thanks, fresh_42.
 
  • Like
Likes   Reactions: PeroK
The chain rule is best understood geometrically. To avoid thinking about higher dimensions, we will just consider 2d surfaces. However, the reasoning that follows goes through with minor adjustments for higher dimensions.

Assume we have a surface ##M##, then we can attach to it all its tangent planes. We call this the tangent bundle of the surface and denote it as ##TM##. Now if we have a map ##f : M \rightarrow N## from this surface ##M## to another surface ##N##, then it is intuitively obvious that all the tangent planes of the first shape maps to the tangent planes of the second. We call this map ##Tf## and in symbols, we expect ##Tf: TM \rightarrow TN##. Now it is also geometrically obvious that if we had a further map ##g: N \rightarrow P## to another surface ##P##, then composing the maps ##Tg\circ Tf## should equal ##T(g\circ f)##. This is best understood through a picture, which unfortunately I can't do here. Anyway, this means that ##T## is what is called a functor. Now if we work locally, that is using charts, then it turns out this functorial rule is precisely the chain rule.
 
  • Like
Likes   Reactions: dextercioby and nomadreid
Mozibur Rahman Ullah said:
The chain rule is best understood geometrically. To avoid thinking about higher dimensions, we will just consider 2d surfaces. However, the reasoning that follows goes through with minor adjustments for higher dimensions.

Assume we have a surface ##M##, then we can attach to it all its tangent planes. We call this the tangent bundle of the surface and denote it as ##TM##. Now if we have a map ##f : M \rightarrow N## from this surface ##M## to another surface ##N##, then it is intuitively obvious that all the tangent planes of the first shape maps to the tangent planes of the second. We call this map ##Tf## and in symbols, we expect ##Tf: TM \rightarrow TN##. Now it is also geometrically obvious that if we had a further map ##g: N \rightarrow P## to another surface ##P##, then composing the maps ##Tg\circ Tf## should equal ##T(g\circ f)##. This is best understood through a picture, which unfortunately I can't do here. Anyway, this means that ##T## is what is called a functor. Now if we work locally, that is using charts, then it turns out this functorial rule is precisely the chain rule.
Interesting approach, thanks.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 10 ·
Replies
10
Views
1K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 18 ·
Replies
18
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 15 ·
Replies
15
Views
6K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
5
Views
4K
  • · Replies 7 ·
Replies
7
Views
2K