How can we choose y and k to express f(g(x+h)) in the desired form?

Bashyboy
Messages
1,419
Reaction score
5
Hello everyone,

I am reading a proof of the chain rule given in this link: http://kruel.co/math/chainrule.pdf

Here is the portion I am troubled with:

"We know use these equations to rewrite f(g(x+h)). In particular, use the first equation to obtain

f(g(x+h)) = f(g(x) + [g'(x) + v]h),

and use the second equation applied to the right-hand-side with k = [g'(x) + v]h..."


How do they arrive at this, k = [g'(x) + v]h. Based above previous equations and definitions, I don't see how it is possible to write k in terms of the derivative of g(x), v, and h.

Could someone help me?
 
Physics news on Phys.org
Bashyboy said:
Hello everyone,

I am reading a proof of the chain rule given in this link: http://kruel.co/math/chainrule.pdf

Here is the portion I am troubled with:

"We know use these equations to rewrite f(g(x+h)). In particular, use the first equation to obtain

f(g(x+h)) = f(g(x) + [g'(x) + v]h),

and use the second equation applied to the right-hand-side with k = [g'(x) + v]h..."How do they arrive at this, k = [g'(x) + v]h. Based above previous equations and definitions, I don't see how it is possible to write k in terms of the derivative of g(x), v, and h.

Could someone help me?

Start with:
<br /> f(y + k) = f(y) + k(f&#039;(y) + w) \\<br /> g(x + h) = g(x) + h(g&#039;(x) + v)<br />

We want to calculate f(g(x+h)) - f(g(x)), so first we need f(g(x+h)). From the second equation,
<br /> f(g(x + h)) = f(g(x) + h(g&#039;(x) + v))<br />
and now we apply the first equation with y = g(x) and k = h(g&#039;(x) + v).
 
Last edited:
Bashyboy said:
Hello everyone,

I am reading a proof of the chain rule given in this link: http://kruel.co/math/chainrule.pdf

Here is the portion I am troubled with:

"We know use these equations to rewrite f(g(x+h)). In particular, use the first equation to obtain

f(g(x+h)) = f(g(x) + [g'(x) + v]h),

and use the second equation applied to the right-hand-side with k = [g'(x) + v]h..."


How do they arrive at this, k = [g'(x) + v]h. Based above previous equations and definitions, I don't see how it is possible to write k in terms of the derivative of g(x), v, and h.

Could someone help me?
What "previous" equation or definition did they have involving k? It looks to me like they are defining k to be [g'(x)+ v]h. Had they already defined it as something else?
 
Well, if they are defining k as [g'(x) + v]h, that would seem awfully arbitrary. What is the justification for such a definition?
 
Bashyboy said:
Well, if they are defining k as [g'(x) + v]h, that would seem awfully arbitrary. What is the justification for such a definition?

Go back to here:
<br /> f(g(x+h))=f(g(x)+h(g′(x)+v))<br />
We also have
<br /> f(y + k) = f(y) + k(f&#039;(y) + w))<br />
which holds for all y and for all k.

Thus, to express f(g(x+h)) in the form f(g(x)) + (\mbox{something involving $x$ and $h$}), which is what we must do to attain our ultimate goal of finding f(g(x+h))- f(g(x)), we need to choose y and k subject to
y + k = g(x) + h(g&#039;(x) + v).
What choice for y and k would you make here?
 
Back
Top