Understanding the Chain Rule: When to Use and Why

  • Context: Undergrad 
  • Thread starter Thread starter LinearAlgebra
  • Start date Start date
  • Tags Tags
    Chain Chain rule
Click For Summary

Discussion Overview

The discussion revolves around the chain rule in calculus, specifically addressing when and why it is used in differentiation. Participants explore the application of the chain rule to functions that are compositions of other functions, and they examine examples to clarify understanding. The scope includes conceptual clarification and technical explanation of differentiation techniques.

Discussion Character

  • Conceptual clarification
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Some participants express confusion about when to use the chain rule, particularly in the context of differentiating composite functions.
  • One participant explains that the chain rule is applicable when one function is the argument of another, using the example of differentiating f(x) = (sin(x) + x^2)^3.
  • Another participant describes the chain rule as a method to find the total rate of change when dealing with two functions, illustrating this with a walking and driving analogy.
  • Several participants question the necessity of the chain rule, suggesting that one could differentiate components normally, but others clarify that this approach still involves the chain rule implicitly.
  • A participant mentions the Weierstrass definition of the derivative as a more complex alternative to using the chain rule, emphasizing the latter's efficiency.
  • There is a discussion about the terminology used in defining derivatives, with some participants referencing different historical figures associated with derivative definitions.
  • One participant interprets the original poster's question about differentiating "normally" as referring to the standard derivative of cosine functions, regardless of the inner function.

Areas of Agreement / Disagreement

Participants generally agree on the utility of the chain rule but express differing views on the necessity of its application versus differentiating components directly. The discussion remains unresolved regarding the clarity of the original poster's question and the implications of differentiating "normally."

Contextual Notes

Some participants reference specific examples and definitions that may depend on prior knowledge of calculus concepts, which could limit understanding for those unfamiliar with the terminology or methods discussed.

LinearAlgebra
Messages
22
Reaction score
0
When do you know to use the chain rule??

I really don't understand what this means: dr/dt = dr/ds * ds/dt

Like in one example in the book they have:
r(s) = cos2si + sin2sj + e^-3sk where s = t^4

Then they find dr/dt using the chain rule. My question is why can't you just differentiate it normally by the individual components? How would that be different?
 
Physics news on Phys.org
You use the chain rule when one function is the argument of the other:

For example: let's say we have the function f(x) = (sin(x) + x^2)^3

We differentiate this by asking our selves a simpler question. How would we differentiate x^3. Using the power rule we would get 3x^2. Sadly though f(x) isn’t x^3 it’s (sin(x) + x^2)^3. What the chain rule tells us is that we can differentiate this the same we would differentiate x^3 (pretending (sin(x)+x^2) is x), we just need to tack the derivative of the inside stuff on the end.

So the derivative of sin(x) + x^2 is cos(x) +2x

So we get the derivate of (sin(x) + x^2)^3 as [3(sin(x) + x^2)^2]*( cos(x) +2x)
 
What JonF said is pretty much correct. Heres a way to understand why

[tex]\frac{dy}{dx}= \frac{dy}{du} \cdot \frac{du}{dx}[/tex].

If we want the rate of change of something, and it actually involves 2 functions, we use the chain rule. Sometimes it is also called the function of a function rule.

There are 2 functions, each with their own rates of change. One is the inner function, eg (2si). That is the function f(s)=2si. Then the outer function is cos(2si), or cos(f(s)). If we want the TOTAL rate of change, we take the product of the two rates of change. so we let 2si=u. Then the function can be said to be cos u. dy/du= -sin u. du/dx= 2i. Replace the u with 2si again and your done.

We can see the rule works this way: I'm walking at a certain speed. Someone else is driving 2 times as fast as me. A plane is going 2 times as fast as the car. How do we find the total rate of change, basically how fast the plane is compared to the walker, we multiply the rates of change! This is the chain rule in action!

Have fun :p
 
LinearAlgebra said:
When do you know to use the chain rule??

I really don't understand what this means: dr/dt = dr/ds * ds/dt

Like in one example in the book they have:
r(s) = cos2si + sin2sj + e^-3sk where s = t^4

Then they find dr/dt using the chain rule. My question is why can't you just differentiate it normally by the individual components? How would that be different?
What do you mean by "differentiate it normally by the individual components"? Using the chain rule has nothing to do with the components. They suggest using the chain rule because of that "2s" in the trig functions and the "-3s" in the exponential.

How would YOU differentiate cos 2s "normally"? If you would think "the derivative of cosine is negative sine and then I multiply the the derivative of 2s which is 2 to get -2cos 2x, then you Are using the chain rule on each individual component.
 
LinearAlgebra said:
When do you know to use the chain rule??
...
My question is why can't you just differentiate it normally by the individual components? How would that be different?

Ok, how do you differentiate cos(t^4)? You don't know how to do this without the chain rule. How do you know to use the chain rule? Because it is the composition of functions that you do know how to differentiate: f(x)=x^4 and g(x)=cos(x).
 
matt grime said:
Ok, how do you differentiate cos(t^4)? You don't know how to do this without the chain rule.

Sure we do. Use the Weierstrass definition of the derivative as a limit (Cringe):

[tex]\frac d{dt}\left(\cos(t^4)\right) =<br /> \lim_{\Delta t\to 0} \frac{\cos((t+\Delta t)^4)-\cos(t^4)}{\Delta t}[/tex]

After way too much work (trig identities, binomial expansion, small angle approximation), one arrives at

[tex]\frac d{dt}\left(cos(t^4)\right) = -4t^3 \sin(t^4)[/tex]

Of course, one could just use the chain rule and get the same answer quickly. We use the chain rule because it is a lot easier than using the formal, limit-based definition of the derivative.
 
Was the 'you' in my sentence confusing you, D H?
 
No. I was simply expounding on why the chain rule is useful. Who would ever want to revert to limit-based calculation of a derivative when a few short-hand techniques like the chain rule, product rule, quotient rule, etc. make the job of computing derivatives easy?
 
I never knew that Weierstrass made that definition, I've always been told that the expression was known as Newton's Quotient Expression or something like that, and that Newton made that one.
 
  • #10
BTW everyone seems to be confused when the OP said differently normally, I am pretty sure he meant that if he saw cos(BLAH) his derivative is -sin(BLAH), no matter what his BLAH is.
 

Similar threads

  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 12 ·
Replies
12
Views
4K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 10 ·
Replies
10
Views
2K