Help with proof of sample variance

  • Context: Graduate 
  • Thread starter Thread starter fadecomic
  • Start date Start date
  • Tags Tags
    Proof Variance
Click For Summary
SUMMARY

The discussion centers on the proof of the sample variance formula, specifically s² = (1/(n-1))Σ(y_i - μ_y)², as an unbiased estimator of the population variance σ². The user initially confuses the variance of the random variable y_i, which represents a draw from the population, with the sample variance. Upon clarification, it is established that since y_i is drawn from the population, its variance is indeed the population variance, not the sample variance. This distinction is crucial for understanding the unbiased nature of the sample variance estimator.

PREREQUISITES
  • Understanding of statistical concepts such as variance and unbiased estimators.
  • Familiarity with the notation and terminology used in statistics, including symbols like s² and σ².
  • Knowledge of the difference between population and sample statistics.
  • Basic proficiency in mathematical expectations and summations.
NEXT STEPS
  • Study the derivation of the sample variance formula in detail.
  • Learn about the properties of unbiased estimators in statistics.
  • Explore the concept of expected value and its application in variance calculations.
  • Investigate the differences between population parameters and sample statistics in statistical analysis.
USEFUL FOR

Statisticians, data analysts, and students studying statistics who seek to deepen their understanding of variance and its estimators.

fadecomic
Messages
10
Reaction score
0
Hi,

I'm trying to prove to myself that the formulation of sample variance as [tex]s^2=\frac{1}{n-1}\sum_{i=1}^{n}(y_i - \mu_y)^2[/tex] is an unbiased estimator of the population variance [tex]\sigma^2[/tex]. Of course, I proceed by checking the expected value of the sample variance, which flows smoothly until I get to [tex]\frac{n-1}{n}E[s^2] = E[y_i^2]-E[\mu_y^2][/tex]. Fine. Now, from the definition of variance: [tex]V[y_i^2]=E[y_i^2]-(E[y_i])^2[/tex]. Most of the sources I've checked insert the population variance here for [tex]V[y_i][/tex] (which is what you get to if you sub in the above). That make absolutely no sense to me. Shouldn't this be the sample variance since [tex]y_i[/tex] is the sample? Calling it the population variance seems circular to me. Can someone explain to me why this variance is the population variance and not the sample variance?

Thanks.
 
Physics news on Phys.org
Oh, never mind. I figured it out. Above, y_i is a random draw from the population, not the sample, so by definition, its variance is the population variance, not the sample variance.
 

Similar threads

Replies
1
Views
4K
Replies
5
Views
6K
Replies
1
Views
3K
  • · Replies 7 ·
Replies
7
Views
6K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 28 ·
Replies
28
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K