How Does Strict Convexity Influence the Gradient Inequality?

  • Thread starter Thread starter kingwinner
  • Start date Start date
  • Tags Tags
    Convex Functions
Click For Summary

Homework Help Overview

The discussion revolves around the implications of strict convexity in the context of a differentiable function defined on an open convex set. The original poster seeks to prove that for strictly convex functions, a specific gradient inequality holds between two distinct points in the domain.

Discussion Character

  • Exploratory, Assumption checking, Mathematical reasoning

Approaches and Questions Raised

  • The original poster attempts to manipulate the definition of strict convexity to establish the desired inequality. They express uncertainty about transitioning from a weak inequality to a strict one and seek clarification on justifying this rigorously.
  • Another participant suggests considering the case where the equality holds to explore potential contradictions, indicating a method of proof by contradiction.
  • Further, a participant introduces the idea of analyzing a univariate function derived from the original function to investigate the behavior of the gradient and its implications for strict convexity.

Discussion Status

Contextual Notes

Participants note the challenge of proving the strict inequality given the conditions of the problem, with emphasis on the subtleties involved in the limit process and the implications of strict convexity.

kingwinner
Messages
1,266
Reaction score
0

Homework Statement


Let S C Rd be open and convex.
Let f be C1(S).
Prove that if f is strictly convex, then f(y) > f(x) + grad f(x) o (y-x) for all x,y in S such that x≠y.
(note: "o" means dot product)


Homework Equations


Strictly convex functions

The Attempt at a Solution


Suppose f is stirctly convex.
Then for all x,y in S such that x≠y, 0<s<1,
f[(1-s)x+sy] < (1-s)f(x) + s f(y)
=> f[x+s(y-x)] < f(x) + s[f(y)-f(x)]
=> [f(x+s(y-x)) - f(x)] /s < f(y)-f(x)
=> lim [f(x+s(y-x)) - f(x)] /s lim [f(y)-f(x)]
s->0------------------------s->0
(recall from calculus, we know that a strict inequality, IN THE LIMIT, becomes a loose inequality. And that's why I must write ≤ when I take the limit)

=> grad f(x) o (y-x) ≤ f(y)-f(x)
=> f(y) f(x) + grad f(x) o (y-x) for all x,y in S such that x≠y.

BUT what I actually want to prove is >, not ≥. How can I rigorously justify that it should be >? I'm really running out of ideas regarding proving this subtle point.

Any help will be much appreciated!
 
Physics news on Phys.org
kingwinner said:

Homework Statement


Let S C Rd be open and convex.
Let f be C1(S).
Prove that if f is strictly convex, then f(y) > f(x) + grad f(x) o (y-x) for all x,y in S such that x≠y.
(note: "o" means dot product)

Homework Equations


Strictly convex functions

The Attempt at a Solution


Suppose f is stirctly convex.
Then for all x,y in S such that x≠y, 0<s<1,
f[(1-s)x+sy] < (1-s)f(x) + s f(y)
=> f[x+s(y-x)] < f(x) + s[f(y)-f(x)]
=> [f(x+s(y-x)) - f(x)] /s < f(y)-f(x)
=> lim [f(x+s(y-x)) - f(x)] /s lim [f(y)-f(x)]
s->0------------------------s->0
(recall from calculus, we know that a strict inequality, IN THE LIMIT, becomes a loose inequality. And that's why I must write ≤ when I take the limit)

=> grad f(x) o (y-x) ≤ f(y)-f(x)
=> f(y) f(x) + grad f(x) o (y-x) for all x,y in S such that x≠y.

BUT what I actually want to prove is >, not ≥. How can I rigorously justify that it should be >? I'm really running out of ideas regarding proving this subtle point.

Any help will be much appreciated!

Suppose x and y are given points in S, with x ≠ y. What happens if you assume f(y) = f(x) + grad f(x) o (y-x)?

RGV
 
Last edited:
Ray Vickson said:
Suppose x and y are given points in S, with x ≠ y. What happens if you assume f(y) = f(x) + grad f(x) o (y-x)?

RGV

I was trying to set up something like this and I need to try to reach a contradiction. But the problem is I can't find where the contradiction is. (or in other words, I need to show that f(y) = f(x) + grad f(x) o (y-x) => x=y.[/color] But I can't think of a way of showing it...)
What kind of manipluations do I have to do with f(y) = f(x) + grad f(x) o (y-x)? I have no idea what to do next after assuming f(y) = f(x) + grad f(x) o (y-x).
 
kingwinner said:
I was trying to set up something like this and I need to try to reach a contradiction. But the problem is I can't find where the contradiction is. (or in other words, I need to show that f(y) = f(x) + grad f(x) o (y-x) => x=y.[/color] But I can't think of a way of showing it...)
What kind of manipluations do I have to do with f(y) = f(x) + grad f(x) o (y-x)? I have no idea what to do next after assuming f(y) = f(x) + grad f(x) o (y-x).

Let p = y-x in R^d and consider the univariate function h(t) = f(x + t*p) for real t. We have h'(t) = grad f(x + t*p) ° p, so h'(0) = grad f(x) ° p . One can show from strict convexity (just by manipulating the convexity-defining property) that the ratio r(s) = \frac{h(s)-h(0)}{s} is strictly increasing in s > 0, so for small positive s we have r(1) > r(s). Fix small s0 > 0 and look at the limit of r(s) as s --> 0+. We have r(1) > r(s0) >= limit r(s) = h'(0) = grad f(x) ° p .

Note: I leave it to you to show that r(s) is a strictly increasing function.

RGV
 
Last edited:

Similar threads

Replies
2
Views
2K
Replies
8
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
5
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
6
Views
3K
Replies
2
Views
1K