Finding or estimating confidence interval for populaion mean

Click For Summary
SUMMARY

The discussion centers on methods for estimating confidence intervals for the population mean, focusing on two algorithms presented by Koosis and Sanders. If the population standard deviation (sigma) is known, the exact confidence interval can be calculated using a normal distribution. If sigma is unknown and the population is normal, a Student's t distribution should be used, particularly with degrees of freedom calculated as n - 1. For sample sizes n ≥ 30, both algorithms suggest using the normal distribution, but Koosis emphasizes the importance of using sigma when known for accuracy.

PREREQUISITES
  • Understanding of confidence intervals and their significance in statistics
  • Familiarity with normal distribution and Student's t distribution
  • Knowledge of sample size implications on statistical analysis
  • Basic statistical terminology, including mean, standard deviation, and degrees of freedom
NEXT STEPS
  • Study the differences between normal distribution and Student's t distribution in depth
  • Learn how to calculate confidence intervals using R or Python libraries
  • Explore the implications of sample size on statistical inference
  • Investigate the concept of selective bias in sampling and its impact on results
USEFUL FOR

Statisticians, data analysts, researchers, and students seeking to understand confidence interval estimation methods and their applications in statistical analysis.

Rasalhague
Messages
1,383
Reaction score
2
From Koosis, I pieced together the following algorithm.

Is sigma known?

Yes? Then calculate the exact confidence interval using a normal distribution to estimate that of the sample means, with mean = the mean of sample means = the mean of the population, \mu_{\overline{x}}=\mu, and standard deviation \sigma_{\overline{x}}=\sigma/\sqrt{n}.

No? Then is the poplation normal?

Yes? Then (a) estimate the confidence interval with a Student's t distribution for the sample means, using degrees of freedom dof = n - 1, and standard deviation s_{\overline{x}}=s\sqrt{n}, or (b) for a slightly inferior result, and only if n\geq 30, estimate the confidence interval using the normal distribution with mean \mu_{\overline{x}}=\mu, and standard deviation s_{\overline{x}}=s\sqrt{n}.

No or don't know? Then is n\geq 30?

Yes? Then estimate the confidence interval, using a normal distribution to estimate that of the sample means, with mean \mu_{\overline{x}}=\mu, and standard deviation s_{\overline{x}}=s\sqrt{n}.

No? Then can't do.

*

But Sanders has the following, somewhat different algorithm.

Is n\geq 30?

Yes? Then use z values in computations.

No? Then are population values known to be normally distributed?

Yes? If the population standard deviation of the population is known, use z values in computations. Otherwise, use t values in computations.

No? Cannot use z or t values in computations.

*

Any comments on which is the best procedure? Actually Koosis presented the z test first, as if he, like Sanders, assumed that one would choose this over the t test wherever possible, even though he said it wasn't as good when both choices were possible. I wonder why z beats t in that case? Is it because the difference in accuracy is negligible then and the computations for t potentially more time consuming than those for z? (And if so, is this still the case with current software; both books are a few years old.)
 
Physics news on Phys.org
Hi Rasalhague! :smile:

The more information you use the more accurate the result.

If you know sigma beforehand, you have to use it to get the most accurate results.
However, in practice, sigma is often not known, so an estimate has to be made using a sample.
This is the reason the t-distribution has been thought up in the first place.

Whenever you use a sample to estimate the population mean or the variance, there is a significant risk on a selective bias, so whenever possible this should be avoided.

If your n is large enough the difference between the normal distribution and the t-distribution becomes negligible, so you can choose.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 22 ·
Replies
22
Views
4K
Replies
4
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
Replies
4
Views
2K