Undergrad Standard Deviation Versus Sample Size & T-Distribution

Click For Summary
SUMMARY

The standard deviation of a t-distribution decreases as the degrees of freedom increase, which correlates with an increase in sample size. This phenomenon occurs because larger sample sizes provide a more accurate estimate of the true population standard deviation. When calculating the sample standard deviation, using the sample mean and dividing by n leads to underestimation, while dividing by (n-1) applies Bessel's correction, which adjusts for bias. The population standard deviation, calculated as the square root of the sum of squared deviations from the mean divided by (n-1), is inherently a biased estimate due to Jensen's inequality.

PREREQUISITES
  • Understanding of t-distribution and its properties
  • Knowledge of sample mean and sample standard deviation calculations
  • Familiarity with Bessel's correction
  • Basic concepts of bias in statistical estimators
NEXT STEPS
  • Study the implications of Bessel's correction in statistical analysis
  • Learn about unbiased estimators for standard deviation in normal distributions
  • Explore the relationship between sample size and statistical accuracy
  • Investigate Jensen's inequality and its effects on statistical estimations
USEFUL FOR

Statisticians, data analysts, and researchers who require a deeper understanding of t-distribution properties and the implications of sample size on statistical estimations.

OpheliaM
Messages
7
Reaction score
1
I don't understand why does the standard deviation of a t-Distribution decreases as the degree of freedom (and, thus, also the sample size) increases when the sample standard deviation underestimates the population standard deviation?
 
Physics news on Phys.org
OpheliaM said:
I don't understand why does the standard deviation of a t-Distribution decreases as the degree of freedom (and, thus, also the sample size) increases
More data tends to give a more accurate estimate of the true population standard deviation.
when the sample standard deviation underestimates the population standard deviation?
The sample standard deviation underestimates the population standard deviation if you use the sample mean and divide by n. If you use the true population mean and divide by n or use the sample mean and divide by (n-1) that is not true.(CORRECTION: it is still under-estimated. See @Number Nine 's post below) For the degree of the t-distribution, you should use the n or (n-1) that you divided by.

PS. Just to be more clear. The sample mean should always be the sum of the sample divided by n. When I say "use the sample mean and divide by (n-1)", I mean that the sum of squares of deviations from the sample mean are divided by (n-1). That is Bessel's correction. (see https://en.wikipedia.org/wiki/Bessel's_correction )
 
Last edited:
FactChecker said:
More data tends to give a more accurate estimate of the true population standard deviation.The sample standard deviation underestimates the population standard deviation if you use the sample mean and divide by n. If you use the true population mean and divide by n or use the sample mean and divide by (n-1) that is not true. For the degree of the t-distribution, you should use the n or (n-1) that you divided by.

PS. Just to be more clear. The sample mean should always be the sum of the sample divided by n. When I say "use the sample mean and divide by (n-1)", I mean that the sum of squares of deviations from the sample mean are divided by (n-1). That is Bessel's correction. (see https://en.wikipedia.org/wiki/Bessel's_correction )

A minor point: the "population standard deviation" (i.e. the square root of the sum of squared deviations from the mean, divided by n-1) is actually a biased estimate of the standard deviation. This follows from Jensen's inequality, since the square root is a concave function. It's fairly difficult to find an unbiased estimator of a normal standard deviation, and the corrections have no closed form -- see https://en.wikipedia.org/wiki/Unbia...deviation#Results_for_the_normal_distribution
 
  • Like
Likes FactChecker
Number Nine said:
A minor point: the "population standard deviation" (i.e. the square root of the sum of squared deviations from the mean, divided by n-1) is actually a biased estimate of the standard deviation. This follows from Jensen's inequality, since the square root is a concave function. It's fairly difficult to find an unbiased estimator of a normal standard deviation, and the corrections have no closed form -- see https://en.wikipedia.org/wiki/Unbia...deviation#Results_for_the_normal_distribution
I stand corrected. Thanks. I will correct my prior post.
 
If there are an infinite number of natural numbers, and an infinite number of fractions in between any two natural numbers, and an infinite number of fractions in between any two of those fractions, and an infinite number of fractions in between any two of those fractions, and an infinite number of fractions in between any two of those fractions, and... then that must mean that there are not only infinite infinities, but an infinite number of those infinities. and an infinite number of those...

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 24 ·
Replies
24
Views
6K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 28 ·
Replies
28
Views
3K
  • · Replies 31 ·
2
Replies
31
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
1K