Making Sense of the Chain Rule: Can I Multiply to Find dy/dx?

  • Context: Undergrad 
  • Thread starter Thread starter Bashyboy
  • Start date Start date
  • Tags Tags
    Chain Chain rule
Click For Summary
SUMMARY

The discussion centers on the chain rule in calculus, specifically the interpretation of dy/dx as a limit rather than a fraction. Participants clarify that while dy/dx can be conceptually treated like a fraction for understanding, it fundamentally arises from limits of Δy/Δx. The conversation emphasizes that dy, dx, and related terms are not numbers but results of limiting processes. Additionally, the use of fixed delta values is suggested as a naive guide to understanding the chain rule and total differentials.

PREREQUISITES
  • Understanding of basic calculus concepts, including derivatives and limits.
  • Familiarity with the chain rule and its application in differentiation.
  • Knowledge of total differentials and their significance in multivariable calculus.
  • Basic grasp of infinitesimals and their role in calculus.
NEXT STEPS
  • Study the formal proof of the chain rule in calculus.
  • Explore the concept of total differentials in multivariable calculus.
  • Learn about the application of limits in calculus, focusing on ε-δ definitions.
  • Investigate the historical context and development of infinitesimals in calculus.
USEFUL FOR

Students of calculus, mathematics educators, and anyone seeking a deeper understanding of the chain rule and its implications in differentiation and limits.

Bashyboy
Messages
1,419
Reaction score
5
I have read a few sources regarding the chain rule, and a pervasive explanation that most of the sources share is this, which is way to sort of make sense of it:

"Regard du/dx as the rate of change of u with respect to x, dy/du as the rate of change of y with respect to u, and dy/dx as the rate of change of y with respect to x. If u changes twice as fast as x and y changes three times as fast as u, then it seems reasonable that y changes six times as fast as x, and so we expect that dy/dx = dy/du * du/dx."

I don't understand why it is a simple operation of multiplication to find how fast y changes compared to x. Maybe I am missing something. I'd like to mention, though, that I do understand the chain rule; but when I read this description of it, I just don't seem to understand.
 
Physics news on Phys.org
The simplest way of looking at it (though not as rigorous as the proof of the chain rule) is that, while dy/dx is not a fraction, it is the limit of a fraction- go back before the limit, to \Delta y/\Delta x, and with x a function of u, \Delta y/\Delta x= (\Delta y/\Delta u)(\Delta u/\Delta x). You can "cancel" in that fraction and take the limit.
 
HallsofIvy said:
The simplest way of looking at it (though not as rigorous as the proof of the chain rule) is that, while dy/dx is not a fraction, it is the limit of a fraction- go back before the limit, to \Delta y/\Delta x, and with x a function of u, \Delta y/\Delta x= (\Delta y/\Delta u)(\Delta u/\Delta x). You can "cancel" in that fraction and take the limit.

In rigorous mathematics, I heard you can't treat dy/dx like a fraction. But I don't think there's anything wrong by doing that, since dy/dx is like (as HallsoIvy said above) Δy/Δx, so isn't there sort of like a value in Δx and Δy? So why can't you treat it like a fraction?
 
Byron Chen said:
In rigorous mathematics, I heard you can't treat dy/dx like a fraction. But I don't think there's anything wrong by doing that, since dy/dx is like (as HallsoIvy said above) Δy/Δx, so isn't there sort of like a value in Δx and Δy? So why can't you treat it like a fraction?

The simple reason is that you are not dealing with a number so to speak: dy, dx and all the others are not numbers but the results of limits.

The proofs for calculus show when they can be treated like fractions, but remember that in general, they are the result of limiting processes.

One thing you can do however, is to get an understanding by using fixed delta values instead of the infinitesimal ones and see how they affect calculations.

I stress that these are not the same things and only to be used as a naive guide to what's happening, but never the less they can be useful.

Consider the following: the total differential. Let's say you have a function f of two variables x and y that are independent.

Then you can obtain the total differential df = df/dx*dx + df/dy*dy.

If you change the infinitesimals to deltas, then you can see that what this is doing is it's taking the rate of change with respect to each variable and then incrementing it in the right way.

Lets simplify it and let triangle_x = triangle_y = 1 (in place of dx and dy). Also triangle_f is the distance between successive measurements.

So for the change in f with respect to deltas of 1, we get the change in the x and y direction.

If you add these together as vectors you get the result of a right angled triangle where the change in f is the hypotenuse of the triangle and the other sides are the rates of change with a step-size of 1 (not infinitesimal).

But because we are dealing with infinitesimals and limits as opposed to constant numbers we can't just simply "cancel" terms because again they are not numbers.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 22 ·
Replies
22
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K