Sign confusion when taking gradient (Newton's Method)

Click For Summary
SUMMARY

The discussion centers on the application of Newton's Method in n-dimensional optimization, specifically regarding the gradient calculation of the function g(x,y) = 1 - (x-1)^4 - (y-1)^4. The confusion arises from the absence of negative signs in the gradient expression, where the correct gradient is given by F(x,y) = [4(x-1)^3, 4(y-1)^3]ᵀ. It is clarified that the gradient vector indicates the direction of steepest ascent, and to find a minimum, one must move in the opposite direction, hence the need for the negative gradient -∇f.

PREREQUISITES
  • Understanding of gradient vectors in multivariable calculus
  • Familiarity with Newton's Method for optimization
  • Knowledge of critical points and local maxima
  • Basic proficiency in mathematical notation and functions
NEXT STEPS
  • Study the derivation of gradients for multivariable functions
  • Learn about the implications of critical points in optimization problems
  • Explore the application of Newton's Method in higher dimensions
  • Investigate the role of negative gradients in finding minima
USEFUL FOR

Students and professionals in mathematics, engineering, and data science who are working with optimization techniques, particularly those utilizing Newton's Method in multivariable contexts.

zmalone
Messages
10
Reaction score
0
I'm watching a lecture on Newton's method with n-dimensions but I am kind of hung up on why the professor did not use the negative sign while taking the first gradient? Is there a rule that explains this or something that I'm forgetting? The rest makes sense but highlighted in red is the part I am confused on if anyone can clear that up I'd appreciate it, thanks!

Where g(x,y) = 1-(x-1)^4-(y-1)^4

local maximum at (1,1) ; critical point at (1,1)

Gradient of g(x,y):

F(x,y,) = [Dg(x,y,)]transpose = [4(x-1)^3 4(y-1)^3]transpose
Why not [-4(x-1)^3 -4(y-1)^3]?

Gradient of F(x,y):

DF(x,y) =
12(x-1)^2 0
0 12(y-1)^2

Screen shot which is probably easier to read:
 

Attachments

  • NewtonMethodQuestionGradient.jpg
    NewtonMethodQuestionGradient.jpg
    14 KB · Views: 516
Physics news on Phys.org
zmalone said:
I'm watching a lecture on Newton's method with n-dimensions but I am kind of hung up on why the professor did not use the negative sign while taking the first gradient? Is there a rule that explains this or something that I'm forgetting? The rest makes sense but highlighted in red is the part I am confused on if anyone can clear that up I'd appreciate it, thanks!

Where g(x,y) = 1-(x-1)^4-(y-1)^4

local maximum at (1,1) ; critical point at (1,1)

Gradient of g(x,y):

F(x,y,) = [Dg(x,y,)]transpose = [4(x-1)^3 4(y-1)^3]transpose
Why not [-4(x-1)^3 -4(y-1)^3]?

Gradient of F(x,y):

DF(x,y) =
12(x-1)^2 0
0 12(y-1)^2

Screen shot which is probably easier to read:

You are right, the minus signs are missing. Anyway, D=0 at the critical point, so you have to investigate the original function.


ehild
 
You are using Newton's method to find what? The gradient vector points in the direction of fastest increase. If you are "following the gradient" to (numerically) find a minimum point, you want to go in the opposite direction, i.e. -\nabla f
 

Similar threads

  • · Replies 8 ·
Replies
8
Views
2K
Replies
2
Views
2K
  • · Replies 24 ·
Replies
24
Views
2K
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
3
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
12
Views
2K