Solve Jensen's Inequality: Proof & Troubleshooting

  • Context: Graduate 
  • Thread starter Thread starter GregA
  • Start date Start date
  • Tags Tags
    Inequality
Click For Summary
SUMMARY

The discussion focuses on the proof of Jensen's Inequality, which states that for a random variable X with finite expectation E(X) and a convex function f, the inequality f(E(X)) ≤ E(f(X)) holds. The user encounters difficulty transitioning from the expression E(f(X)) ≥ E(λ(X - E(X)) + f(E(X))) to the conclusion f(E(X)). The confusion arises from the incorrect application of linearity in expectation, particularly in handling the term E(λ(X - E(X))). The user ultimately resolves the misunderstanding regarding the treatment of the constant λ.

PREREQUISITES
  • Understanding of convex functions in mathematics
  • Familiarity with the properties of expectation in probability theory
  • Knowledge of linearity of expectation
  • Basic proficiency in mathematical proofs and inequalities
NEXT STEPS
  • Study the properties of convex functions and their implications in probability theory
  • Learn about Jensen's Inequality and its applications in statistics
  • Explore the concept of expectation and its linearity in depth
  • Practice mathematical proofs involving inequalities and expectations
USEFUL FOR

Mathematicians, statisticians, and students studying probability theory who seek to deepen their understanding of inequalities and expectations in random variables.

GregA
Messages
210
Reaction score
0
I am having trouble with the following proof of Jensen's Inequality. I'll post the statement of the theorem, it's proof, and where I'm having problems:

Let X be a random variable
with E(X) < \infty, and let f : \mathbb{R}\rightarrow\mathbb{R} be a convex function. Then
\begin{equation*}f(E(X))\leq E(f(X))\end{equation*} [1]​

where a function is convex if \forall x_0\in \mathbb{R},\ \exists \lambda \in \mathbb{R}: f(x)\geq \lambda(x-x_0)+f(x_0)

Proof: Let f be convex and let \lambda \in \mathbb{R} be such that
f(X)\geq \lambda(x-E(X))+f(E(X)) [2]​
then
E(f(x))\geq E(\lambda(x-E(X))+f(E(X))) [3]
=f(E(X)) [4]​
Q.E.D




As is probably clear from my having problems with this, probability and dealing with expectations isn't my strong point but getting from [3] to [4] isn't looking obvious to me at all since if I expand RHS of [3] (and assume x is meant to be X, a typo) then unless I'm wrong I get:

E(f(X))\geq E(\lambda(X-E(X))+f(E(X)))=E(\lambda(X-E(X)))+E(f(E(X))) (using E(g(X)+h(X))=E(h(X))+E(f(X)))
=\lambda E(X)-E(X)+f(E(X)) (using E(aX+b)=aE(X)+b and E(a)=a (where a,b are constants))
=(\lambda-1)E(X)+f(E(X))
and this isn't [4] :confused:
Where am I going wrong?
 
Last edited:
Physics news on Phys.org
E is a linear function. Therefore E(λ(x-E(x))=λ(E(x-E(x))=λ(E(x)-E(x))=0.
 
aggh...Now I see where I've pulled the lambda out incorrectly it's obvious! cheers :)
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K