Proving Convexity of a Function with Directional Derivative

  • Context: Graduate 
  • Thread starter Thread starter stanley.st
  • Start date Start date
Click For Summary
SUMMARY

The discussion centers on proving the convexity of a function using the directional derivative. It establishes that a function f is convex if and only if for all x, y, the inequality f(x) - f(y) ≥ ∇f(y)ᵀ(x - y) holds. The proof begins with the definition of convexity and leads to the conclusion by taking the limit as λ approaches 0. A key point raised is the necessity of using a unit vector in the definition of the directional derivative, which is addressed by normalizing the vector (x - y).

PREREQUISITES
  • Understanding of convex functions and their properties
  • Familiarity with directional derivatives and their definitions
  • Knowledge of gradient notation and vector calculus
  • Basic principles of limits in calculus
NEXT STEPS
  • Study the formal definition of convex functions in mathematical analysis
  • Learn about directional derivatives and their applications in optimization
  • Explore the relationship between gradients and convexity in multivariable calculus
  • Investigate advanced topics in calculus, such as the implications of limits and continuity
USEFUL FOR

Mathematicians, students studying calculus or optimization, and anyone interested in the properties of convex functions and their applications in various fields.

stanley.st
Messages
30
Reaction score
0
I'm reading book and there's proposition with convex function

Function f is convex if and only if for all x,y
(*)\quad f(x)-f(y)\ge\nabla f(y)^T(x-y)

It's proven in this way: From definition of convexity

f(\lambda x+(1-\lambda)x)\le \lambda f(x)+(1-\lambda)f(y)

we have

\frac{f(y+\lambda(x-y))-f(y)}{\lambda}\le f(x)-f(y)

Setting \lambda\to0 we have (*).

-------------------------------------------------

My problem is in last sentece. I understand formula on left-hand side as directional derivative. But in definition of directional derivative is needed to (x-y) be an unit vector. It is not. So it is not a directional derivative. If I define

\lambda:=\frac{\mu}{\Vert x-y\Vert}

I have directional derivative on left hand side

\frac{f(y+\mu\frac{(x-y)}{\Vert x-y\Vert})-f(y)}{\mu}\le\frac{f(x)-f(y)}{\Vert x-y\Vert}

But in this way I don't obtain result (*) but I obtain this

\nabla f(y)^T(x-y)\le\frac{f(x)-f(y)}{\Vert x-y\Vert}
 
Physics news on Phys.org

Similar threads

  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
Replies
3
Views
2K