Expected value of sample variance

Click For Summary

Discussion Overview

The discussion revolves around the expected value of sample variance, specifically the derivation of the equation E(S²) = (n-1)/n σ² from the sample variance formula S² = (1/n) Σ(Xᵢ - X̄)². Participants seek clarification on the steps involved in this derivation and the implications of using n versus n-1 in the denominator.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant requests clarification on the derivation of the expected value of sample variance from the sample variance equation.
  • Another participant notes that the conventional sample variance divides by n-1 to avoid bias, suggesting that the definition on the referenced page is specific to that context.
  • A participant explains that the use of n-1 relates to the sample mean, not the theoretical mean, and provides a mathematical breakdown of the expected value calculation.
  • Further elaboration includes a detailed step-by-step derivation leading to the conclusion that E(S²) = (n-1)/n σ².
  • One participant expresses gratitude for the clarification but indicates some steps remain unclear, prompting further discussion on specific calculations.
  • A later reply addresses a question about deriving E(X²) = σ² + μ² from the variance definition.

Areas of Agreement / Disagreement

Participants generally agree on the mathematical principles involved, but there are differing views on the implications of using n versus n-1 in the sample variance calculation. The discussion remains unresolved regarding the clarity of certain steps in the derivation.

Contextual Notes

Some steps in the derivation are noted to be unclear to participants, indicating potential limitations in understanding the mathematical transitions involved.

Who May Find This Useful

Readers interested in statistical estimation, variance calculations, and the mathematical foundations of sample statistics may find this discussion beneficial.

musicgold
Messages
303
Reaction score
19
Hi,

My question is related to this web page. http://en.wikipedia.org/wiki/Estimator_bias

In the Examples section, note the equation for the expected value of sample variance.

[tex]{E}(S^2)=\frac{n-1}{n} \sigma^2[/tex]


Could anybody please show me the steps to go from the sample variance equation (given below) to the above equation?

[tex]S^2=\frac{1}{n}\sum_{i=1}^n(X_i-\overline{X}\,)^2[/tex]


Thanks

MG.
 
Physics news on Phys.org
well, that "sample variance" was defined for the purposes of that page. The usual sample variance divides by n-1 instead of by n, so it is not biased. This page includes a derivation of that fact.
 
The essential point for the use of n-1 rather than n is that the sample variance makes use of the sample mean, not the theoretical mean.

Specifically, let x be one sample, m the theoretical mean and a the statistical average.
Then E(x-a)2=E(x-m+m-a)2=E(x-m)2+E(m-a)2+2E((x-m)(m-a)).
When you plow through the details, the factor shows up.
 
Thanks folks. However, my question is not about the use of n-1 in the denominator. I understand the concept of the degrees of freedom.

I wish to know the operations/steps I need to perform on the Sample Variance equation to get the expected value equation.

Thanks again,

MG.
 
I gave you the answer.
 
Is this what you're looking for?

First consider (I'll bring in the 1/n later)

[tex] \sum (x_i - \bar x)^2 = \sum x_i^2 - n\bar{x}^2[/tex]

The expected value of this expression is

[tex] \begin{align*}<br /> E\left(\sum(x_i - \bar x^2)^2\right) &= \sum E(x_i^2) - n E\left( \bar{x}^2\right)\\<br /> & = \sum \left(\mu^2 + \sigma^2\right) - n \frac 1 {n^2} \left(\sum E(x_i^2) + \sum_{i<j} x_i x_j \right) \\<br /> & = n\mu^2 + n \sigma^2 - \frac 1 n \left( n \mu^2 + n \sigma^2 + n(n-1) \mu^2 \right) \\<br /> & = n\mu^2 + n \sigma^2 - \mu^2 - \sigma^2 - (n-1) \mu^2 \\<br /> & = n\mu^2 + n\sigma^2 - n \mu^2 - \sigma^2 \\<br /> & = (n-1) \sigma^2<br /> \end{align*}[/tex]

Now
[tex] \begin{align*}<br /> S^2 & = \frac 1 n \sum (x_i - \bar{x})^2) \\<br /> E(S^2) & = \frac 1 n E\left(\sum (x_i - \bar{x}^2) \right) \\<br /> & = \left(\frac 1 n \right) (n-1) \sigma^2 = \frac{n-1} n \sigma^2<br /> \end{align*}[/tex]

and from this last line we see that in order to obtain an unbiased estimate of [tex]\sigma^2[/tex], the maximum likelihood (for normal distributions) estimator [tex]S^2[/tex] needs to be multiplied by (n)/(n-1) to get

[tex] \frac 1 {n-1} \sum (x_i - \bar{x})^2)[/tex]
 
Statdad

I am not clear about just one step.

How do I get

[tex](\left(\mu^2 + \sigma^2\right)[/tex] from [tex]E (x_i^2)[/tex]Thanks

MG.

P.S. How do you manage to write so many equations efficiently using LaTex? Do you have an advanced editor?
 
First:
Since
[tex] Var(X) = \sigma^2 = E(X - \mu)^2 = E(X^2) - \mu^2[/tex]

a simple re-arrangement gives

[tex] E(X^2) = \sigma^2 + \mu^2[/tex]

Second question: if you want to have several equations nicely aligned inside a display, use the \begin{align*} and \end{align*} pair inside the tex delimiters. Without the tex info, if i have

f(x) & = x^2 + 5x + 6 \\
& = (x+3)(x+2)

inside the delimiters, the compiled result is

[tex] \begin{align*}<br /> f(x) & = x^2 + 5x + 6 \\<br /> & = (x+3)(x+2)<br /> \end{align*}[/tex]

* the "&" sign causes the equations to be aligned at the start of the next symbol ("=" in my
example)
* the "\\" terminates a line and tells tex to begin a new line

If you click on any displayed formula you should see, in a pop-up window, the underlying code.

Edited to note: some older tex manuals will discuss the use of the "eqarray" (I think I have the name correct, but since I don't use it I'm not going to claim 100% accuracy here) environment for doing what I've done
with align*. Don't use eqarray - the spacing is (to state it as nicely as possible) horrific.
 
  • #10
Statdad,

Thanks a lot. I really appreciate your help.

Also,

[tex] \begin{align*}<br /> <br /> Var (X) & = E [ X - E (X) ]^2 \\<br /> & = E [ X^2 - 2X E(X) + E(X)^2] \\<br /> & = E(X^2) - 2 E(X) E(X) + E(X)^2 \\<br /> & = E(X^2) - 2 E(X)^2 + E(X)^2 \\<br /> & = E (X^2) - E(X)^2.\end{align*}<br /> [/tex]
 
Last edited:

Similar threads

  • · Replies 42 ·
2
Replies
42
Views
6K
Replies
1
Views
4K
  • · Replies 28 ·
Replies
28
Views
3K
Replies
5
Views
6K
  • · Replies 7 ·
Replies
7
Views
6K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 14 ·
Replies
14
Views
2K