Gradient Descent and Cauchy Method in Differential Equations

Click For Summary
The discussion centers on the use of the variable theta (Θ) in the context of gradient descent and the Cauchy method applied to differential equations. Theta is defined as greater than zero, indicating a positive step size in the direction of the gradient. The negative value of alpha (α = -θX) represents a slight change along the tangent plane, moving towards the point defined by the differential equation. This negative aspect is crucial as it allows the point to approach the original coordinates as theta approaches zero. Overall, the conversation emphasizes the mathematical rationale behind these choices in the context of optimization methods.
kidsasd987
Messages
142
Reaction score
4
http://www.math.uiuc.edu/documenta/vol-ismp/40_lemarechal-claude.pdf



I don't understand why we use theta for equation (1)

Θ>0

but why α=-θX?



Thanks.
 
Physics news on Phys.org
It was already given that X is the derivative with respect to x and so \theta X is gives a slight change along the tangent plane in the x direction. I presume it is negative because they want to think of \theta going to 0 as the point, (x_0- \theta X, y_0- \theta Y, z_0-\theta Z), moving toward (x_0, y_0, z_0) along the path defined by the differential equation.
 
  • Like
Likes 1 person

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 3 ·
Replies
3
Views
2K