Formula For Confidence Interval of Two Samples With Unequal Variances

Click For Summary

Discussion Overview

The discussion centers around the formula for calculating the confidence interval for the mean difference of two independent samples with unequal variances, particularly in relation to the t-test. Participants seek clarification on the variables involved and the calculation of standard error, as well as the relationship between confidence intervals and t-test results.

Discussion Character

  • Technical explanation
  • Conceptual clarification
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant requests online sources that define all variables and explain how to calculate standard error for the confidence interval related to the t-test for two independent samples with unequal variances.
  • Another participant suggests combining the two samples using a weighted average based on inverse variances, providing formulas for the mean and variance of the combined samples.
  • A participant reiterates the request for clarification on the confidence interval, specifically asking if it pertains to the difference of sample means.
  • A later reply clarifies that the confidence interval in question is indeed for the mean difference when population variances are unknown, and provides the formula for the t-score in this context.
  • One participant mentions that confidence intervals are multiples of the standard deviation, assuming Gaussian distributions.
  • Another participant, reflecting on their past experience, agrees with the earlier claims and states that the variance of the difference in sample means equals the sum of the two sample variances, leading to a specific formula for the 95% confidence interval.

Areas of Agreement / Disagreement

There is no consensus on the best sources for the requested information, and participants express differing views on the calculation methods and interpretations of the confidence interval. The discussion remains unresolved regarding the optimal approach to defining and calculating the confidence interval.

Contextual Notes

Some participants express uncertainty about the formulas and calculations, and there are references to assumptions about Gaussian distributions that may not be universally applicable. The discussion includes varying levels of familiarity with the topic, which may affect the clarity of the contributions.

Soaring Crane
Messages
461
Reaction score
0
Are there any online sources that would note (and define all variables, especially on how to calculate standard error) in the formula for calculating the confidence interval, a complement to the t-test, of two independent samples with unequal variances? (I want to see exactly how software/graphing calculators are giving me upper and lower limit values.)

Thank you.
 
Physics news on Phys.org
You want to combine those two samples?
Let ##\mu_i##, ##\sigma_i^2## be the mean and variance of measurement i.
The optimal combination can be obtained as a weighted average with the inverse variance as weights:

$$\mu = \frac{1}{\sigma_1^{-2}+\sigma_2^{-2}}\left(\frac{\mu_1}{\sigma_1^2} +\frac{\mu_2}{\sigma_2^2}\right)$$
$$\sigma^2 = \frac{1}{\sigma_1^{-2}+\sigma_2^{-2}}$$

This is just regular error propagation (assuming Gaussian distributions), and the weights are chosen to minimize ##\sigma##. You can check this yourself, if you like, there is no need to look for any reference. I did not check the second formula, but I think it is correct.

If you want to compare the samples, calculate the difference, and see if it is compatible with 0.
 
Last edited:
Soaring Crane said:
Are there any online sources that would note (and define all variables, especially on how to calculate standard error) in the formula for calculating the confidence interval, a complement to the t-test, of two independent samples with unequal variances? (I want to see exactly how software/graphing calculators are giving me upper and lower limit values.)

Thank you.

Confidence interval for what? Difference of the sample means? It could be anything.
 
ImaLooser,

I apologize. It would be the confidence interval for the mean difference (difference of the sample means) in the case where the population variances are unknown. For example, I know the formula for calulating the t-score for the t-test with unequal variances:

t = (mean difference)/(standard error),

where standard error = sqrt{[(s_1)^2/(n_1)^2] + [(s_2)^2/(n_2)^2]}

s = standard deviation
n = sample size

I wanted the formula for the confidence interval as a check for the t-test results.

Thank you.
 
Those confidence intervals are just multiples of the standard deviation - again, assuming Gaussian distributions.
 
Soaring Crane said:
ImaLooser,

I apologize. It would be the confidence interval for the mean difference (difference of the sample means) in the case where the population variances are unknown. For example, I know the formula for calulating the t-score for the t-test with unequal variances:

t = (mean difference)/(standard error),

where standard error = sqrt{[(s_1)^2/(n_1)^2] + [(s_2)^2/(n_2)^2]}

s = standard deviation
n = sample size

I wanted the formula for the confidence interval as a check for the t-test results.

Thank you.

I haven't done any of this for over fifteen years, but I believe you are correct. The variance of the difference in sample means is equal to the sum of the two sample variances. Take the square root to get the standard error. A 95% confidence interval is then [u - 1.97s, u + 1.97s] where u is the difference in sample means.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 22 ·
Replies
22
Views
4K
Replies
4
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 4 ·
Replies
4
Views
5K