View Full Version : Test of Second Partials - Proof
circa415
Nov4-04, 09:47 PM
Can anyone give me a proof or some kind of general explaination about why there is a local max at (a,b) if D>0 and fxx<0, a min if D>0 and fxx>0, etc. My text book doesn't give any kind of explaination at all and I'm just a little curious as to why it works.
Second Derivatives Test:
Suppose the second partial derivatives of f are continuous on a disk with center (a,b), and suppose that f_x(a,b)=0 and f_y(a,b)=0 [that is, (a,b) is a critical point of f]. Let
D=D(a,b)=f_{xx}(a,b)f_{yy}(a,b)-[f_{xy}(a,b)]^2
(a) If D>0 and f_{xx}(a,b)>0, then f(a,b) is a local minimum.
(b) If D>0 and f_{xx}(a,b)<0, then f(a,b) is a local maximum.
(c) If D<0 and f_{xx}(a,b)>0, then f(a,b) is not a local maximum or minimum.
Proof of part (a):
We compute the second-order directional derivative of f in the direction of \vec u = \langle h, k \rangle. The first-order derivative is given by:
D_uf=f_xh+f_yk \quad \mbox{(from a different theorem)}
Applying this theorem a second time, we have:
\begin{eqnarray}
D^2_uf & = & D_u(D_uf)=\frac{\partial}{\partial x}(D_uf)h+\frac{\partial}{\partial y}(D_uf)k \nonumber \\
& = & (f_{xx}h+f_{yx}k)h+(f_{xy}h+f_{yy}k)k \nonumber\\
& = & f_{xx}h^2+2f_{xy}hk+f_{yy}k^2 \mbox{(by Clairaut's theorem)}\nonumber
\end{eqnarray}
If we complete the square in this expression, we obtain:
D_u^2f=f_{xx}\left(h+\frac{f_{xy}}{f_{xx}}k\right) ^2+\frac{k^2}{f_{xx}}(f_{xx}f_{yy}-f^2_{xy}) \quad \mbox(<- Equation 1)
We are given that f_{xx}(a,b)>0 and D(a,b)>0. But f_{xx} and D=f_{xx}f_{yy}-f^2_{xy} are continuous functions, so there is a disk B with center (a,b) and radius \delta>0 such that f_{xx}>0 and D>0 whenever (x,y) is in B. Therefore, by looking at Equation 1, we see that D_u^2f(x,y)>0 whenever (x,y) is in B. This means that if C is the curve obtained by intersecting the graph of f with the vertical plane through P(a,b,f(a,b)) in the direction of \vec u, then C is concave upward on an interval of lenght 2\delta. This is true in the direction of every vector \vec u, so if we restrict (x,y) to lie in B, the graph lies above its horizontal tangent plane at P. Thus f(x,y)\geq f(a,b) whenever (x,y) is in B. This shows that f(a,b) is a local minimum.
Parts (b) and (c) have similar proofs.
HallsofIvy
Nov5-04, 07:15 AM
Notice that what Galileo is really doing is expanding f(x,y) in a Taylor's series with two variables. The derivatives at a critical point are 0 so there are no first power terms. For x, y very close to the critical point, we can ignore higher powers so we have only the constant term (the value of f AT the critical point) and the quadratic terms. We can always change the coordinate system to eliminate any "xy" term and have left ax2+ by2 (with a, b, positive, negative, or 0). If both a and b are positive, that's a minimum, if negative, maximum. If either is 0, we need to look at higher powers.
What's REALLY happening is that, with a real valued function of of R2 to R, the second derivative is a linear function from R2 to R2 which can be represented as a 2 by 2 matrix (the entries are: first row fxx, fxy, second row fyx, fyy).
Since that is a symmetric matrix we can always change the coordinate system to make that a diagonal matrix resulting in the "ax2+ b2" above (a, b are the diagonal elements). Of course the determinant of such a matrix is ab so everything depends upon whether that determinant is positive or negative. But the determinant is independent of changing the coordinate system so we can just look at the determinant of the original matrix: D= (fxxfyy-fxy2).
The second derivative test can seem rather tricky and abstract to begin with, but, as HallsofIvy said, what you really get out of it, is either:
1) Locally, your graph looks like a paraboloid expanding upwards (i.e, you've got positive curvatures and a minimum point)
2) Locally, your graph looks like a paraboloid expanding downwards (i.e, you've got negative curvatures and a maximum point)
3) Locally, your graph looks like a "saddle", neither maximum nor minimum.
4) Second derivatives test is insufficient in elucidating local behaviour (i.e, you must look on higher derivatives, (assuming those exist))
vBulletin® v3.8.7, Copyright ©2000-2012, vBulletin Solutions, Inc.