Minimizing cost function Statistics

colstat · Jan 27, 2011

Homework Statement

Let X and Y be two unknown variables with E(Y)=[tex]\mu[/tex] and EY² < [tex]\infty[/tex].

Homework Equations

a. Show that the constant c that minimizes E(Y-c)² is c=[tex]\mu[/tex].
b. Deduce that the random variable f(X) that minimizes E[(Y-f(X))²|X] is f(X)= E[Y|X].
c. Deduce that the random variable f(X) that minimizes E[(Y-f(X))²] is also f(X)= E[Y|X].

The Attempt at a Solution

a. E(Y-c)² = E[Y²-2Y*c+c²]
other ideas: Can I do the derivative w.r.t. Y, set it equal to zero. But there is an expectation operator, how do you take expectation through the operator.
I can't use a posterior mean, the problem did not specify distr.of the r.v.

b, c. no ideas yet

LCKurtz · Jan 27, 2011

colstat said:

Homework Statement

Let X and Y be two unknown variables with E(Y)=[tex]\mu[/tex] and EY² < [tex]\infty[/tex].
Homework Equations

a. Show that the constant c that minimizes E(Y-c)² is c=[tex]\mu[/tex].
b. Deduce that the random variable f(X) that minimizes E[(Y-f(X))²|X] is f(X)= E[Y|X].
c. Deduce that the random variable f(X) that minimizes E[(Y-f(X))²] is also f(X)= E[Y|X].

The Attempt at a Solution

a. E(Y-c)² = E[Y²-2Y*c+c²]
other ideas: Can I do the derivative w.r.t. Y, set it equal to zero. But there is an expectation operator, how do you take expectation through the operator.
I can't use a posterior mean, the problem did not specify distr.of the r.v.

b, c. no ideas yet

For (a) Remember, the variable here is c, not Y. Start by using linearity to expand
E[Y²-2Y*c+c²] and remember that.

colstat · Jan 28, 2011

LCKurtz,
E[Y²-2Y*c+c²]=E[Y²]-2E[Y]E[c]+E[c²]],
is that what you meant by linearity to expand?
But how do you solve for c, really!

correction from my post: "how do you take derivative through operator"

LCKurtz · Jan 28, 2011

colstat said:

LCKurtz,
E[Y²-2Y*c+c²]=E[Y²]-2E[Y]E[c]+E[c²]],
is that what you meant by linearity to expand?
But how do you solve for c, really!

correction from my post: "how do you take derivative through operator"

Close, but the way you have written the middle term leads me to believe you don't quite understand. The expected value operation is linear means that if X and Y are random variables and c is a constant then

1. E(X + Y) = E(X) + E(Y)

and

2. E(cX) = cE(X)

Use those two properties (carefully) on E[Y²-2Y*c+c²].

And remember, E(Y²) and μ = E(Y) are just numbers. You are trying to minimize a function of c, just like you maximized and minimized functions of x in calculus.

colstat · Jan 28, 2011

Yes, I know about the expectation part wait, I still don't get it. Do I use derivative to do it?

LCKurtz · Jan 28, 2011

colstat said:

Yes, I know about the expectation part wait, I still don't get it. Do I use derivative to do it?

Show me the function of c that you got when you expanded it.

colstat · Jan 29, 2011

E[Y²-2Y*c+c²]
=E[Y²]-2E[Y]E[c]+E[c²]]
=E[Y²]-2cE[Y]+c²
=E[Y²]-2c[tex]\mu[/tex]+c²

statdad · Jan 29, 2011

OK, so, if you have

[tex] E[(Y-\mu)^2] = E[Y^2-2c\mu + c^2[/tex]

as a function of [itex]c[/itex], how would you minimize it?

LCKurtz · Jan 29, 2011

colstat said:

E[Y²-2Y*c+c²]
=E[Y²]-2E[Y]E[c]+E[c²]]
=E[Y²]-2cE[Y]+c²
=E[Y²]-2c[tex]\mu[/tex]+c²

While I wouldn't quibble with the result, my objection to your expansion is your writing

E(-2Yc) = -2E(Y)E(c)

with no additional commentary. It looks like you are treating c as a random variable and using the "fact" that E(XY) = E(X)E(Y), which is not generally true and certainly not one of the two linearity properties. Perhaps you understand what you are doing at that step, but you haven't convinced me of it yet.

On an additional note, we have company for the next week so I am going to let StatDad take it from here if he is willing.

colstat · Jan 29, 2011

so, what do you do next?

Minimizing cost function Statistics

Homework Help Overview

Discussion Character

Approaches and Questions Raised

Discussion Status

Contextual Notes

Homework Statement

Homework Equations

The Attempt at a Solution

Homework Statement

Homework Equations

The Attempt at a Solution

Similar threads

Distance between a Clock's hands when the distance is increasing most rapidly

Polar integral

Deriving spatial derivatives

Is this the correct general solution of the given PDE?

J_1(x) = (x^2/10)*(J_1(x) + J_3(x)) How to solve?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect