I Standard Deviation Versus Sample Size & T-Distribution

Click For Summary
The discussion centers on the relationship between standard deviation, sample size, and the t-distribution. As the degree of freedom increases with larger sample sizes, the standard deviation of the t-distribution decreases, leading to more accurate estimates of the population standard deviation. Using the sample mean and dividing by n can result in an underestimation of the population standard deviation, while using (n-1) applies Bessel's correction to provide a better estimate. Additionally, the population standard deviation calculated using n-1 is still considered biased due to Jensen's inequality, complicating the search for an unbiased estimator. Overall, understanding these nuances is crucial for accurate statistical analysis.
OpheliaM
Messages
7
Reaction score
1
I don't understand why does the standard deviation of a t-Distribution decreases as the degree of freedom (and, thus, also the sample size) increases when the sample standard deviation underestimates the population standard deviation?
 
Physics news on Phys.org
OpheliaM said:
I don't understand why does the standard deviation of a t-Distribution decreases as the degree of freedom (and, thus, also the sample size) increases
More data tends to give a more accurate estimate of the true population standard deviation.
when the sample standard deviation underestimates the population standard deviation?
The sample standard deviation underestimates the population standard deviation if you use the sample mean and divide by n. If you use the true population mean and divide by n or use the sample mean and divide by (n-1) that is not true.(CORRECTION: it is still under-estimated. See @Number Nine 's post below) For the degree of the t-distribution, you should use the n or (n-1) that you divided by.

PS. Just to be more clear. The sample mean should always be the sum of the sample divided by n. When I say "use the sample mean and divide by (n-1)", I mean that the sum of squares of deviations from the sample mean are divided by (n-1). That is Bessel's correction. (see https://en.wikipedia.org/wiki/Bessel's_correction )
 
Last edited:
FactChecker said:
More data tends to give a more accurate estimate of the true population standard deviation.The sample standard deviation underestimates the population standard deviation if you use the sample mean and divide by n. If you use the true population mean and divide by n or use the sample mean and divide by (n-1) that is not true. For the degree of the t-distribution, you should use the n or (n-1) that you divided by.

PS. Just to be more clear. The sample mean should always be the sum of the sample divided by n. When I say "use the sample mean and divide by (n-1)", I mean that the sum of squares of deviations from the sample mean are divided by (n-1). That is Bessel's correction. (see https://en.wikipedia.org/wiki/Bessel's_correction )

A minor point: the "population standard deviation" (i.e. the square root of the sum of squared deviations from the mean, divided by n-1) is actually a biased estimate of the standard deviation. This follows from Jensen's inequality, since the square root is a concave function. It's fairly difficult to find an unbiased estimator of a normal standard deviation, and the corrections have no closed form -- see https://en.wikipedia.org/wiki/Unbia...deviation#Results_for_the_normal_distribution
 
  • Like
Likes FactChecker
Number Nine said:
A minor point: the "population standard deviation" (i.e. the square root of the sum of squared deviations from the mean, divided by n-1) is actually a biased estimate of the standard deviation. This follows from Jensen's inequality, since the square root is a concave function. It's fairly difficult to find an unbiased estimator of a normal standard deviation, and the corrections have no closed form -- see https://en.wikipedia.org/wiki/Unbia...deviation#Results_for_the_normal_distribution
I stand corrected. Thanks. I will correct my prior post.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
Replies
4
Views
3K
  • · Replies 24 ·
Replies
24
Views
5K
  • · Replies 5 ·
Replies
5
Views
3K
Replies
18
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
31
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 28 ·
Replies
28
Views
3K