| New Reply |
confidence interval: |
Share Thread | Thread Tools |
| Nov4-10, 05:17 PM | #1 |
|
|
confidence interval:
when we calculate a value, say
@95% we get 9.99+- .002 this value is describing a parameter of the true population right? it is not saying 95% of the time we get samples, it will fall in this range correct? We never infer anything about statistics, only parameters of populations right? I'm pretty sure that is correct, I just came across something that interprets this differently. |
| Nov5-10, 06:20 AM | #2 |
|
Recognitions:
|
The [tex] 9.99 \pm .002 [/tex] is an estimate of some parameter. The 95% is the confidence level , also called the confidence coefficient . We typically say that we have 95% confidence that the given confidence interval contains the parameter .
The construction of one of these intervals is but one use of a procedure that, in the long run, produces result that will fall around the parameter 95% of the time. it does not mean that 95% of the time we take samples anything will fall inside this particular interval. |
| Nov5-10, 07:34 AM | #3 |
|
Mentor
|
Confidence intervals are closely allied with the concept of statistical significance. Suppose that a finite sample size experiment yields 9.99 as an estimate of some parameter. Whether the true value of that parameter is 9.991, 9.992, 9.993, or 200 thousand, we don't know.
That 200 thousand value can easily be rejected if one has any idea of the underlying random process. That the true parameter value is 200 thousand will not be credible at any reasonable level of significance. How about a true value of 9.993 value? In this case, we can reject the possibility that the true value of the parameter exceeds 9.99+0.002 at the 5% (100%-95%) significance level. The purpose of statistical inference is to infer information about and from statistics. |
| Nov5-10, 08:06 AM | #4 |
|
Recognitions:
|
confidence interval:
The situation is even more complicated: When you repeat the experiment, you will in general get both a different estimate, e.g. 9.97 instead of 9.99 and different confidence interval e.g. 9.97+-0.003 instead of 9.99+-0.002. Of all this different confidence intervals from different experiments, 95% will cover the true value.
|
| Nov5-10, 12:00 PM | #5 |
|
Recognitions:
|
"These equations implicitly assume that the underlying data is normally distributed. What if it isn't? Those estimates might be very wrong if the random process is not normal. "
? The formula for variance doesn't assume normality at all - many distributions have a variance. Second - nothing is truly normally distributed - the normal distribution, like every distribution, is an idealized description of population behavior. the "test about the assumption" really indicates whether things differ enough from this idealized behavior to make the assumption of normality a poor one. |
| Nov5-10, 12:28 PM | #6 |
|
Mentor
|
[tex]s^2 = \frac 1 N \sum_{i=1}^N (x_i-\bar x)[/tex] is a biased estimator. Removing the bias leads to [tex]s^2 = \frac 1 {N-1} \sum_{i=1}^N (x_i-\bar x)[/tex] But this is still a biased estimator of the standard deviation. The UMVU estimator for one distribution will not be the same as the UMVU estimator for another. |
| Nov5-10, 12:36 PM | #7 |
|
Recognitions:
|
[tex] s^2 [/tex] is not an estimator of the standard deviation - you misspoke.
The sample variance (your second version) is unbiased for the population variance regardless of the distribution, as long as second moments exist. When you begin speaking about other properties then distributional assumptions may come forward: but that isn't the original comment: that comment was "the formula for variance doesn't assume normality at all" Also, remember it isn't simply distributional assumptions: even with normality, if [tex] \mu [/tex] is known [tex] s^2 [/tex] is no longer UMVU for [tex] \sigma^2 [/tex]. |
| Nov5-10, 12:49 PM | #8 |
|
Mentor
|
|
| Nov5-10, 01:05 PM | #9 |
|
Recognitions:
|
I understand all this, and I said you simply misspoke. I simply pointed out that
1) The formulae for variance and standard deviation do not depend on the assumption of normality 2) Properties of those estimators can change when the distributional assumptions change. The result you state about the sample standard deviation not being an unbiased estimator of [tex] \sigma [/tex] has nothing to do with normality either: it is because [tex] E\bigg[\sqrt{s^2}\bigg] \ne \sqrt{E[s^2]} [/tex] This isn't a big problem since [tex] \sqrt{\frac{n-2}2} \frac{\Gamma\left(\frac{n-1}2\right)}{\Gamma\left(\frac n 2\right)} s [/tex] is UMVU for [tex] \sigma [/tex] in the normally distributed setting. |
| New Reply |
| Thread Tools | |
Similar Threads for: confidence interval:
|
||||
| Thread | Forum | Replies | ||
| Confidence Interval | Calculus & Beyond Homework | 3 | ||
| Confidence interval | Calculus & Beyond Homework | 1 | ||
| confidence and prediction interval | Set Theory, Logic, Probability, Statistics | 1 | ||
| value of za/2 for 85% confidence interval? | Set Theory, Logic, Probability, Statistics | 0 | ||
| confidence interval | Set Theory, Logic, Probability, Statistics | 1 | ||