Confidence Intervals: t-distribution or normal distribution?

Richard_R · Oct 19, 2010

Hi all,

When working out confidence intervals based on population samples are you supposed to always use t-distributions, standard normal (z) distributions, or do you make a choice based on the sample size?

Up until now I've been lucky enough to have large sample sizes (for some work I'm doing) so have been using the z-distribution. However I now have some data sets which range from n=1 (lol) to n=29 so am not sure if I should now be using t-distributions to define confidence intervals, or how I'd make that decision (e.g. use t-distribution if n<30, for example?)

Thanks
-Rob

SW VandeCarr · Oct 22, 2010

Richard_R said:

Hi all,

When working out confidence intervals based on population samples are you supposed to always use t-distributions, standard normal (z) distributions, or do you make a choice based on the sample size?
Thanks
-Rob

Assuming the normal assumption is valid, the general rule is to use the t-distribution to calculate confidence intervals where the number of degrees of freedom (df=n-1) is less then 30, The Z and t scores are similar around this value. Skewed data, particularly in small samples, make CIs fairly useless. In larger samples, normalizing transformations can be useful for constructing CIs..

statdad · Oct 22, 2010

Actually the notion of using the sample size as the determining factor is being (as it should be) tossed out. It is a remnant of the days before computing power was so readily available.

IF the assumption of normality can be made, when you know [tex]\sigma[/tex] (population standard deviation) use the Z-interval. When you don't know sigma (so you have only the sample standard deviation) use the t-interval.

If your data is badly skewed, it is debatable whether the mean is the appropriate parameter to measure central tendency.

SW VandeCarr · Oct 22, 2010

statdad said:

Actually the notion of using the sample size as the determining factor is being (as it should be) tossed out. It is a remnant of the days before computing power was so readily available.

IF the assumption of normality can be made, when you know [tex]\sigma[/tex] (population standard deviation) use the Z-interval. When you don't know sigma (so you have only the sample standard deviation) use the t-interval.

If your data is badly skewed, it is debatable whether the mean is the appropriate parameter to measure central tendency.

Well I am retired and involved in other things, but I have researched the t distribution recently and I've not run across this. However, my research was mostly on the math and not the application.

What you say makes sense. Would you use the Z value for very small samples, say n=5, if you did know sigma?

EDIT: In most of my experience sigma is not known.

statdad · Oct 22, 2010

If the sample size is only 5 i would be hesitant to do any confidence interval but, if pushed, if sigma were known, and if told that the data were known to be normally distributed, the Z-interval would be appropriate.

Confidence Intervals: t-distribution or normal distribution?

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Graduate Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense

Undergrad My basic understanding of set theory

Undergrad The problem of points

Graduate Expected numbers of cards of a last color remaining

Undergrad How does axiom of foundation prevent infinite sequence of elements?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect