How Do You Apply the Multidimensional Chain Rule in Variable Transformations?

Click For Summary

Discussion Overview

The discussion revolves around the application of the multidimensional chain rule in the context of variable transformations, specifically how to compute partial derivatives of a function when changing variables. Participants explore the notation and methodology involved in expressing these derivatives correctly.

Discussion Character

  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant proposes that for a function f(x, y) transformed to f(z, y), the partial derivative with respect to x can be expressed as $$\frac{\partial f}{\partial x} = \frac{\partial f}{\partial z}\frac{\partial z}{\partial x}$$ and suggests a similar form for the second derivative.
  • Another participant recommends changing the notation to use primed variables (x', y') to clarify the transformation and avoid confusion, suggesting that the derivatives should be expressed in terms of these new variables.
  • A third participant points out the potential confusion in using the same symbol "y" for both the original function and the transformed variable, advocating for clearer notation that distinguishes between the variables involved in the transformation.
  • This participant provides an example to illustrate how to express the function in terms of the new variable z and either x or y, emphasizing the need to understand the relationships between the variables correctly.

Areas of Agreement / Disagreement

Participants express differing views on the appropriate notation and approach to take when applying the chain rule in this context. There is no consensus on the best method or notation to use, indicating an ongoing debate about clarity and correctness in variable transformations.

Contextual Notes

Participants highlight limitations in the current notation and the potential for confusion when variables are not clearly distinguished. The discussion does not resolve these issues but rather emphasizes the need for careful consideration of variable relationships.

member 428835
hey pf!

suppose i have a function ##f( x , y)##. i make a change of variables such that ##z(x,y)## in such a way that now ##f( z , y)##. how do i find $$\frac{\partial f}{\partial y}$$ $$\frac{\partial f}{\partial x}$$ $$\frac{\partial^2 f}{\partial y^2}$$ $$\frac{\partial^2 f}{\partial x}$$

i think $$\frac{\partial f}{\partial x} = \frac{\partial f}{\partial z}\frac{\partial z}{\partial x}$$ and $$\frac{\partial^2 f}{\partial x^2} = \frac{\partial^2 f}{\partial z^2} \frac{\partial z}{\partial x} + \frac{\partial f}{\partial z}\frac{\partial^2 z}{\partial x^2}$$

i also think $$\frac{\partial f}{\partial y} = \frac{\partial f}{\partial z} \frac{\partial z}{\partial y} + \frac{\partial f}{\partial y}$$ but something is wrong here. i feel that i need some new notation or something to fully represent what is happening.

i have no idea how to express $$\frac{\partial^2 f}{\partial y^2}$$

any help is greatly appreciated
 
Physics news on Phys.org
Great inquiry and along your investigation you even noted the vulgarity of the notation!

To free yourself from the corruption the present notation is proving, I recommend the following:
you started from a function of x and y and then switched to new coordinates x' and y' by the transformation

x'=z(x,y)
y'=y

This cleanses the notation. Redo your analysis where derivatives are dressed either primed or unprimed. You will see that your results are correct so long as you change some of their form.

(Hint: write down the f as a function of the unprimed coordinates and a take a partial of some primed coordinate, say x'. Carry out chain rules.)
 
Last edited:
The notation confusion comes where you chose to use "y" as both a variable in the first function and a variable in the definition of "x". If you are going to have two variables it would be better to use two new symbols.

That is, you initially have f(x, y) and you have a new variable, z, that is, in some way, a function of both x and y so you write f(z, y). That notation makes no sense to me. If this new z is a function of x and y wouldn't it make equal sense to say "f(x, z)"? The difficulty is that you want to replace x or y with a new variable so that instead of knowing that z is a function of x and y, you need to know how x, and/or y, is a function of z.

For example, if we have f(x,y)= x+ y and are given z= x/y, we can write x= yz and so can write f(z, y)= yz+ z= z(y+ 1). But we could as well write y= x/z and write f(x, z)= x+ (x/z)= x(z+ 1)/z.
 
Hey thanks to you both!
 

Similar threads

  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 17 ·
Replies
17
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K