Quincy
- 228
- 0
Chain Rule - intuitive "Proof"
Suppose y = f(u), and u = g(x), then dy/dx = dy/du * du/dx.
In an intuitive "proof" of the chain rule, it has this step: dy/dx = \lim_{\Delta x \to 0} \frac {\Delta y}{\Delta x} = \lim_{\Delta x \to 0} \frac {\Delta y}{\Delta u} * \frac {\Delta u}{\Delta x}
My question is, why multiply by \frac {\Delta u}{\Delta u}? I know mathematically, it's because \frac {\Delta u}{\Delta u} = 1, and multiplying by 1 doesn't change the function, but I'm looking for a philosophical reason. I found a quote by the user mathwonk in an old thread, which says: "it seems plausible that the best linear approximation to a composite function, is obtained by composing the best approximations to the component functions. On the other hand for a linear function, composing means simply multiplying." Can someone expand on this?...
Suppose y = f(u), and u = g(x), then dy/dx = dy/du * du/dx.
In an intuitive "proof" of the chain rule, it has this step: dy/dx = \lim_{\Delta x \to 0} \frac {\Delta y}{\Delta x} = \lim_{\Delta x \to 0} \frac {\Delta y}{\Delta u} * \frac {\Delta u}{\Delta x}
My question is, why multiply by \frac {\Delta u}{\Delta u}? I know mathematically, it's because \frac {\Delta u}{\Delta u} = 1, and multiplying by 1 doesn't change the function, but I'm looking for a philosophical reason. I found a quote by the user mathwonk in an old thread, which says: "it seems plausible that the best linear approximation to a composite function, is obtained by composing the best approximations to the component functions. On the other hand for a linear function, composing means simply multiplying." Can someone expand on this?...