Critical points of functions

In summary, the quadratic approximation is used to determine the nature of a critical point near a when the second derivatives are not 0. The higher derivatives are used to help determine the nature of the critical point.
  • #1
Benny
584
0
Hi, I posted a related question (about the 2nd derivative test) a while ago but now that I've read through some relevant theory I'm in a better position to find out about the following.

In the notes I have, the behaviour of a function [itex]f:U \subset R^n \to R[/itex] near a critical point point [itex]\mathop a\limits^ \to [/itex] is considered. To do this, a 'quadratic' approximation to f near a is considered.

[tex]
f\left( {\mathop a\limits^ \to + \mathop h\limits^ \to } \right) \approx f\left( {\mathop a\limits^ \to } \right) + \frac{1}{2}\left( {D_{\mathop h\limits^ \to } } \right)^2 f\left( {\mathop a\limits^ \to } \right)
[/tex]

Where D_h is the directional derivative of f in the direction of h.

Then there's some stuff on matrix representation, eigenvalues and Hessians. The important point is when the determinant of the Hessian is equal to zero in which case the test of the nature of the critical point is inconclusive. Consequently, there is a need to look at higher derivatives.

My questions are; why is a 'quadratic' approximation considered in stead of for instance a 'linear' or a 'quartic' approximation? Is it because the 2nd derivative test is supposed to use 2nd derivatives? Also, how does looking at higher derivatives (when the 2nd derivative test is inconclusive) help?

Does the math just work out? If the 2nd derivative test is inconclusive then how does considering higher derivatives help? If I use a 3rd degree taylor polynomial to approximate f near a then that's just a very rough approximation. How can I be sure about the nature of the stationary point from the approximation? Any help would be good thanks.
 
Physics news on Phys.org
  • #2
A 'quadratic' approximation is considered instead of 'linear' approximation precisely because in this case the first derivatives are 0. The 'linear' approximation is no different from the constant value and gives no more information. As long as the second derivatives are not 0 then the 'quadratic' approximation is the next best. Of course, it might happen that both first and second derivatives are 0- then you would have to go to a cubic approximation.

1 dimensional examples: y= x2 has derivative 0 at x= 0 and so a critical point there. The second derivative is 2 which is positive and so x= 0 is a minimum. The "Taylor's polynomial" expansion about x= 0 is, of course, just 0+ 0x+ x2. If y= x3, again the first derivative is 0 and so there is a critical point there. Checking the second derivative, we see that it is also 0 so we need to go to the 3rd derivative. Here we see that the Taylor's polynomial about x= 0 is
0+ 0x+ 0x2+ x3 and so x= 0 is a "saddle point".

You use the derivative where you finally get something other than 0.
 
  • #3
Thanks for the help. But I'm not sure how you concluded that f(x) = x^3 has saddle point at x = 0 from the fact that you need to differentiate 3 times to get to a derivative which is non-zero at x = 0. If f(x) = x^4 then then I would need to differentiate 4 times to get to a derivative which is non-zero. But in this case x = 0 corresponds to a local minimum.
 
  • #4
In these cases, when the first non-zero derivative is of odd order, then we have a point of inflection (if I'm not mistaking) and not an extremum.
 
  • #5
I would imagine the conclusion was reached because it's obvious what that cubic looks like, ie use your common sense: if it locally looks like y=x^3 then it is a saddle, if locally it looks like x^4 then it will be a maximum.
 
  • #6
I asked the question because I am not sure what to do with a function of two variables when the Hessian matrix corresponding to a critical point is the zero matrix. I got the impression that there was a reason for using f(x) = x^3 as an example. Usually, I don't find it to be all that easy to visualise a 3 dimensional surface.

Of course its obvious what the nature of the critical point of f(x) = x^3 at x = 0 is. The nature of the critical point of f(x,y) = sin(x) + sin(y) + sin(x+y) at (x,y) = (pi,pi) is not so obvious. I could 'plug' that point into the equation and I'll find that f = 0 at that point. But what do I do then? Consider the behaviour of f along lines like x = 0, y = 0, y = x? I don't know how to utilise that method and even then it seems like a fudge method to determine the nature of the critical point.

By the way, if f(x,y) is continuous in all of R^2 and I have a critical point which isn't a local maximum or minimum then is that point necessarily a saddle?
 
  • #7
Look at the graphs! f(x,y)= a+ x2+ y2 has a minimum at (0,0), f(x,y)= a- x2- y2 has a maximum at (0,0), f(x,y)= a- x2+ y2 or f(x,y)= a+ x2- y[/sup]2[/sup] has a saddle point at (0,0). For a more complicated function, write out the Taylor's series. At a critical point, the linear terms will be 0 so look at the quadratic terms- they must be one of the above. If the second derivatives are also 0 then the quadratic terms are also 0 so you need to look at the third derivatives. But of course x3 or x2y or xy2 or y3 are all less than 0 on one side and greater than 0 on the other- so saddle point. If it happens that all first, second, and third go on to the fourth derivatives to look at the fourth powers.

By the way, if f(x,y) is continuous in all of R^2 and I have a critical point which isn't a local maximum or minimum then is that point necessarily a saddle?
Except, of course, if f(x,y) is a constant.
 
  • #8
Ok thanks for the help.
 

1. What is a critical point of a function?

A critical point of a function is a point on the graph where the slope of the tangent line is zero, or where the derivative of the function is undefined. It is also known as a stationary point.

2. How do you find critical points of a function?

To find the critical points of a function, set the derivative of the function equal to zero and solve for the variable. The resulting values are the critical points. Additionally, you can also plot the function on a graph and visually identify the points where the slope is zero.

3. What is the significance of critical points in calculus?

Critical points are important in calculus because they give us information about the behavior of a function. They can tell us where the function is increasing, decreasing, or changing direction. They are also used to find the maximum and minimum values of a function.

4. Can a function have more than one critical point?

Yes, a function can have multiple critical points. This can happen when the function has multiple local maximum or minimum points, or when the function has a point of inflection.

5. How do critical points relate to optimization problems?

In optimization problems, we are trying to find the maximum or minimum value of a function. Critical points are used to determine these values, as the maximum or minimum points of a function occur at the critical points. Therefore, identifying the critical points of a function is an important step in solving optimization problems.

Similar threads

  • Calculus and Beyond Homework Help
Replies
4
Views
286
  • Calculus and Beyond Homework Help
Replies
8
Views
455
  • Calculus and Beyond Homework Help
Replies
14
Views
941
  • Calculus and Beyond Homework Help
Replies
3
Views
264
  • Calculus and Beyond Homework Help
Replies
5
Views
996
  • Calculus and Beyond Homework Help
Replies
2
Views
1K
  • Calculus and Beyond Homework Help
Replies
4
Views
2K
  • Calculus and Beyond Homework Help
Replies
6
Views
2K
Replies
3
Views
1K
  • Calculus and Beyond Homework Help
Replies
9
Views
3K
Back
Top