Minimizing cost function Statistics

Click For Summary

Homework Help Overview

The discussion revolves around minimizing a cost function in the context of statistics, specifically focusing on the expected value of random variables X and Y. The original poster presents a problem involving the minimization of E(Y-c)² and related expressions, with an emphasis on deriving properties of the expected value.

Discussion Character

  • Exploratory, Mathematical reasoning, Assumption checking

Approaches and Questions Raised

  • Participants explore the expansion of the expression E(Y-c)² and question how to take derivatives through the expectation operator. There are discussions about the linearity of expectation and how to properly apply it in this context. Some participants express uncertainty about the treatment of constants versus random variables in their calculations.

Discussion Status

Participants are actively engaging with the problem, attempting to clarify the mathematical steps involved in the minimization process. There is a mix of attempts to derive the necessary expressions and questions about the validity of certain manipulations. Some guidance has been offered regarding the properties of expectation, but no consensus has been reached on the next steps.

Contextual Notes

There is a noted lack of specification regarding the distribution of the random variables involved, which may affect the approach to the problem. Participants are also grappling with the implications of treating c as a constant versus a random variable in their calculations.

colstat
Messages
56
Reaction score
0

Homework Statement


Let X and Y be two unknown variables with E(Y)=\mu and EY2 < \infty.

Homework Equations


a. Show that the constant c that minimizes E(Y-c)2 is c=\mu.
b. Deduce that the random variable f(X) that minimizes E[(Y-f(X))2|X] is f(X)= E[Y|X].
c. Deduce that the random variable f(X) that minimizes E[(Y-f(X))2] is also f(X)= E[Y|X].

The Attempt at a Solution


a. E(Y-c)2 = E[Y2-2Y*c+c2]
other ideas: Can I do the derivative w.r.t. Y, set it equal to zero. But there is an expectation operator, how do you take expectation through the operator.
I can't use a posterior mean, the problem did not specify distr.of the r.v.

b, c. no ideas yet
 
Last edited:
Physics news on Phys.org
colstat said:

Homework Statement


Let X and Y be two unknown variables with E(Y)=\mu and EY2 < \infty.

Homework Equations


a. Show that the constant c that minimizes E(Y-c)2 is c=\mu.
b. Deduce that the random variable f(X) that minimizes E[(Y-f(X))2|X] is f(X)= E[Y|X].
c. Deduce that the random variable f(X) that minimizes E[(Y-f(X))2] is also f(X)= E[Y|X].

The Attempt at a Solution


a. E(Y-c)2 = E[Y2-2Y*c+c2]
other ideas: Can I do the derivative w.r.t. Y, set it equal to zero. But there is an expectation operator, how do you take expectation through the operator.
I can't use a posterior mean, the problem did not specify distr.of the r.v.

b, c. no ideas yet

For (a) Remember, the variable here is c, not Y. Start by using linearity to expand
E[Y2-2Y*c+c2] and remember that.
 
LCKurtz,
E[Y2-2Y*c+c2]=E[Y2]-2E[Y]E[c]+E[c2]],
is that what you meant by linearity to expand?
But how do you solve for c, really!

correction from my post: "how do you take derivative through operator"
 
colstat said:
LCKurtz,
E[Y2-2Y*c+c2]=E[Y2]-2E[Y]E[c]+E[c2]],
is that what you meant by linearity to expand?
But how do you solve for c, really!

correction from my post: "how do you take derivative through operator"

Close, but the way you have written the middle term leads me to believe you don't quite understand. The expected value operation is linear means that if X and Y are random variables and c is a constant then

1. E(X + Y) = E(X) + E(Y)

and

2. E(cX) = cE(X)

Use those two properties (carefully) on E[Y2-2Y*c+c2].

And remember, E(Y2) and μ = E(Y) are just numbers. You are trying to minimize a function of c, just like you maximized and minimized functions of x in calculus.
 
Yes, I know about the expectation part wait, I still don't get it. Do I use derivative to do it?
 
colstat said:
Yes, I know about the expectation part wait, I still don't get it. Do I use derivative to do it?

Show me the function of c that you got when you expanded it.
 
E[Y2-2Y*c+c2]
=E[Y2]-2E[Y]E[c]+E[c2]]
=E[Y2]-2cE[Y]+c2
=E[Y2]-2c\mu+c2
 
OK, so, if you have

<br /> E[(Y-\mu)^2] = E[Y^2-2c\mu + c^2<br />

as a function of c, how would you minimize it?
 
colstat said:
E[Y2-2Y*c+c2]
=E[Y2]-2E[Y]E[c]+E[c2]]
=E[Y2]-2cE[Y]+c2
=E[Y2]-2c\mu+c2

While I wouldn't quibble with the result, my objection to your expansion is your writing

E(-2Yc) = -2E(Y)E(c)

with no additional commentary. It looks like you are treating c as a random variable and using the "fact" that E(XY) = E(X)E(Y), which is not generally true and certainly not one of the two linearity properties. Perhaps you understand what you are doing at that step, but you haven't convinced me of it yet.

On an additional note, we have company for the next week so I am going to let StatDad take it from here if he is willing.
 
  • #10
so, what do you do next?
 

Similar threads

  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
Replies
3
Views
2K
Replies
10
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 21 ·
Replies
21
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K