SpY]
- 63
- 0
Homework Statement
Let \textbf{F}: \textbf{R}^m \rightarrow \textbf{R}^n and \textbf{G}: \textbf{R}^p \rightarrow \textbf{R}^m
Prove that ({\textbf{F} \circ \textbf{G}})'(x) = {\textbf{F}}'(\textbf{G}(\textbf{x})) {\textbf{G}}'(\textbf{x})
Homework Equations
Assume the single variable chain rule, that is for
f, g: \textbf{R} \rightarrow \textbf{R}
\frac {d(f \circ g)}{dt}(t) = \frac {df}{dt} \big]_{g(t)} \frac {dg}{dt}(t)
The Attempt at a Solution
I figured using the single variable result by extending it to \textbf{R}^2 first, a sort of subproof which uses the mean value theorem:
Let f: \textbf{R}^2 \rightarrow \textbf{R} and \textbf{G}: \textbf{R} \rightarrow \textbf{R}^2
Then
f(\textbf{G}(t+h)) - f(\textbf{G}(t)) = f(G_1(t+h), G_2(t+h)) - f(G_1(t), G_2(t+h)) + f(G_1(t), G_2(t+h)) - f(G_1(t), G_2(t))
The second and third terms change nothing, I will use them later
Then by the first mean value theorem,
\exists k_1, k_2 \in (0,h) such that
G_1 (t+h) - G_1 (t) = h{G_1}'(t+k_1)
G_2 (t+h) - G_2 (t) = h{G_2}'(t+k_2)
Expanding the first two terms previously by substituting G_1(t+h)
f(G_1(t+h), G_2(t+h)) - f(G_1(t), G_2(t+h))
= f(h{G_1}'(t+k_1) + G_1(t), G_2(t+h))- f(G_1(t), G_2(t+h))
= h{G_1}'(t+k_1) \frac {\partial df}{\partial dx_1} \big]_{(p_1 + G_1(t), G_2(t+h))}
Where p_1 \in (0, h{G_1}'(t+k_1))
Similarly for the next two terms substituting G_2(t+h)
f(G_1(t), G_2(t+h)) - f(G_1(t), G_2(t))
f(G_1(t), h{G_2}'(t+k_2) + G_2(t)) - f(G_1(t), G_2(t))
= h{G_2}'(t+k_2) \frac {\partial df}{\partial dx_2} \big]_{(G_1(t), p_2 + G_2(t))}
Where p_2 \in (0, h{G_1}'(t+k_2))
Combining this all together and dividing by h:
\frac {f(\textbf{G}(t+h)) - f(\textbf{G}(t))}{h}
= {G_1}'(t+k_1) \frac {\partial df}{\partial dx_1} \big]_{(p_1 + G_1(t), G_2(t+h))} + {G_2}'(t+k_2) \frac {\partial df}{\partial dx_2} \big]_{(G_1(t), p_2 + G_2(t))}
Now as h \rightarrow 0, k_1, k_2, p_1, p_2 \rightarrow 0 since they are contained in intervals up to h. The LHS is now the chain derivative
{(f \circ \textbf{G})}'(t) =\lim_{h \to 0} \frac {f(\textbf{G}(t+h)) - f(\textbf{G}(t))}{h}
= {G_1}'(t+k_1) \frac {\partial df}{\partial dx_1} \big]_{(p_1 + G_1(t), G_2(t+h))} + {G_2}'(t+k_2) \frac {\partial df}{\partial dx_2} \big]_{(G_1(t), p_2 + G_2(t))}
= {f}'(\textbf{G} (t)) { \textbf{G}}'(t)
I've tried generalizing this for any n, but it gets rather long so I'm not sure how to put in concisely. After that, I don't know how to take it to the general proof (any m,n) as required.
Thanks
Last edited: