Chain Rule in Multiple Variables

Click For Summary

Discussion Overview

The discussion revolves around understanding the chain rule in the context of functions of multiple variables. Participants explore how the chain rule applies when functions depend on other functions that themselves depend on different sets of variables. The scope includes theoretical aspects of calculus and mathematical reasoning.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • One participant seeks clarification on the chain rule, particularly how it applies when a function is composed of other functions that depend on different variables.
  • Another participant suggests that a more accessible tutorial or video might be beneficial for understanding the chain rule, rather than focusing on a specific Stack Exchange question.
  • A participant explains that if \( f = f(x, y) \), where \( x = x(u, v) \) and \( y = y(a, b) \), then a new function \( g(u, v, a, b) \) can be defined, leading to a set of partial derivatives that can be calculated using the chain rule.
  • Examples are provided to illustrate the application of the chain rule in this context, including specific functions and their derivatives.
  • Additional resources on the multivariable chain rule are shared for further exploration.

Areas of Agreement / Disagreement

Participants express varying levels of understanding and suggest different resources for learning. While some participants provide explanations and examples, there is no explicit consensus on the best approach to understanding the chain rule in this context.

Contextual Notes

Some participants note that the discussion may benefit from a more foundational understanding of analysis and linear algebra, which could affect comprehension of the chain rule's application in multiple variables.

cwill53
Messages
220
Reaction score
40
TL;DR
I need help understanding the chain rule.
The following link leads to a question I asked on the mathematics Stack Exchange site.

https://math.stackexchange.com/ques...s/3791017?noredirect=1#comment7809514_3791017

I want to understand how the chain rule works, and what exactly the person who answered my question was saying. I haven't taken analysis and I know very little linear algebra, so it went over my head.
 
Physics news on Phys.org
jedishrfu said:
Rather than discuss a Stack Exchange article, perhaps a more down-to-earth tutorial would be better:

https://tutorial.math.lamar.edu/classes/calciii/chainrule.aspx

or this video from Khan Academy:

https://www.khanacademy.org/math/mu...ariable-chain-rule/v/multivariable-chain-rule
It's not a stack exchange article, it's a specific question I have about how the chain rule changes if I have a function composed of different variables, where each variable is composed of its own set of variables.

My question is, how does the chain rule change when say

f = f(x,y) and x = x(u,v) and y = y(u,b)?

The variabes x and y are both functions of the variable u, but x is also a function of v while y is a function of b.

How about if say f = f(x,y) and x = x(u,v) while y = y(a,b)?

Now f depends on x and y but x and y depend on a completely different set of variables.
 
cwill53 said:
Summary:: I need help understanding the chain rule.

The following link leads to a question I asked on the mathematics Stack Exchange site.

https://math.stackexchange.com/ques...s/3791017?noredirect=1#comment7809514_3791017

I want to understand how the chain rule works, and what exactly the person who answered my question was saying. I haven't taken analysis and I know very little linear algebra, so it went over my head.
To answer your specific question. If you have ##f = f(x, y)## then you are defining ##f## as a function of two variables. There is some implicit rule for taking an ordered pair ##(x, y)## and outputing a number ##f(x, y)##.

If additionally you define ##x = x(u, v)## and ##y = y(a, b)##, you have two additional functions of two variables. We now have three different functions of two variables. This allows us to define a function of four variables (let's call it ##g##) where: ##g(u, v, a, b) = f(x(u, v), y(a, b))##.

The function ##g##, like any function of four variables, has four partial derivatives:
$$\frac{\partial g}{\partial u} = \frac{\partial f}{\partial x}\frac{\partial x}{\partial u}, \ \ \frac{\partial g}{\partial v} = \frac{\partial f}{\partial x}\frac{\partial x}{\partial v}, \ \ \frac{\partial g}{\partial a} = \frac{\partial f}{\partial y}\frac{\partial y}{\partial a}, \ \ \frac{\partial g}{\partial b} = \frac{\partial f}{\partial y}\frac{\partial y}{\partial b}$$
The next thing you should do is to test this out with an example. E.g.
$$f(x, y) = \cos(x)\sin(y), \ \ x(u, v) = 2u + 3v, \ \ y(a.b) = 2a^2 + b^3$$ which gives
$$g(u, v, a, b) = \cos(2u + 3v)\sin(2a^2 + b^3)$$
You can partially differentiate ##g## directly and then check the partial derivatives match the above formulas.

You may also be interested in these Insights on the multivariable chain rule:

https://www.physicsforums.com/insights/demystifying-chain-rule-calculus/

https://www.physicsforums.com/insights/how-to-solve-second-order-partial-derivatives/
 
  • Like
Likes   Reactions: hutchphd, etotheipi, cwill53 and 1 other person
PeroK said:
To answer your specific question. If you have ##f = f(x, y)## then you are defining ##f## as a function of two variables. There is some implicit rule for taking an ordered pair ##(x, y)## and outputing a number ##f(x, y)##.

If additionally you define ##x = x(u, v)## and ##y = y(a, b)##, you have two additional functions of two variables. We now have three different functions of two variables. This allows us to define a function of four variables (let's call it ##g##) where: ##g(u, v, a, b) = f(x(u, v), y(a, b))##.

The function ##g##, like any function of four variables, has four partial derivatives:
$$\frac{\partial g}{\partial u} = \frac{\partial f}{\partial x}\frac{\partial x}{\partial u}, \ \ \frac{\partial g}{\partial v} = \frac{\partial f}{\partial x}\frac{\partial x}{\partial v}, \ \ \frac{\partial g}{\partial a} = \frac{\partial f}{\partial y}\frac{\partial y}{\partial a}, \ \ \frac{\partial g}{\partial b} = \frac{\partial f}{\partial y}\frac{\partial y}{\partial b}$$
The next thing you should do is to test this out with an example. E.g.
$$f(x, y) = \cos(x)\sin(y), \ \ x(u, v) = 2u + 3v, \ \ y(a.b) = 2a^2 + b^3$$ which gives
$$g(u, v, a, b) = \cos(2u + 3v)\sin(2a^2 + b^3)$$
You can partially differentiate ##g## directly and then check the partial derivatives match the above formulas.

You may also be interested in these Insights on the multivariable chain rule:

https://www.physicsforums.com/insights/demystifying-chain-rule-calculus/

https://www.physicsforums.com/insights/how-to-solve-second-order-partial-derivatives/
This was perfect! I understood it all. Thank you so much.
 
  • Like
Likes   Reactions: PeroK

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 12 ·
Replies
12
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K