Tests for Difference in Mean VERSUS Tests for Difference in Median

Click For Summary

Discussion Overview

The discussion revolves around the appropriate use of statistical tests for comparing two sample observations, specifically focusing on when to use tests for differences in mean versus tests for differences in median. The scope includes theoretical considerations and practical applications in statistics.

Discussion Character

  • Debate/contested
  • Technical explanation
  • Conceptual clarification

Main Points Raised

  • Some participants propose using a test for the difference in mean if both samples are normally distributed, while suggesting a test for median if the normality assumption is not met.
  • One participant emphasizes that the choice of test depends on the assumptions made and the specific context of the analysis, indicating that there is no one-size-fits-all answer.
  • Concerns are raised about the small sample size (16 observations total, 8 per category), with suggestions that Bayesian techniques might be more appropriate given the limited data.
  • There is mention of using a t-test if the data is normally distributed with an unknown population variance, along with considerations regarding the equality of variances and the potential use of a paired t-test if observations are linked.
  • Participants discuss the importance of checking normality assumptions, with the Shapiro-Wilk test highlighted as a common method for this purpose.
  • One participant suggests that if the assumptions for standard tests are not met, more complex tests may be necessary, though they express uncertainty about the specifics of these alternatives.
  • There is a call for further context regarding the statistical background of the original poster to better tailor advice to their situation.

Areas of Agreement / Disagreement

Participants express differing views on the appropriate tests to use based on the distribution of the data and the sample size. There is no consensus on a definitive approach, as various conditions and assumptions are acknowledged.

Contextual Notes

Limitations include the small sample size and the dependence on the assumptions of normality and variance equality. The discussion also highlights the need for further information about the data and the context of the analysis.

number0
Messages
102
Reaction score
0
Sup everyone,

Assume I have two sample observations.

I am wondering when I should use a test for a difference in mean and when I should use a test for a difference in median.

Should I test for mean if both the distribution of both the samples are normal?
Should I test for median otherwise?

I am confused!

Any help would be greatly appreciated.
 
Last edited:
Physics news on Phys.org
number0 said:
Sup everyone,

Assume I have two samples.

I am wondering when I should use a test for a difference in mean and when I should use a test for a difference in median.

Should I test for mean if both the distribution of both the samples are normal?
Should I test for median otherwise?

I am confused!

Any help would be greatly appreciated.

If you have sufficient evidence that your underlying distribution has a particular distribution of the Normal variety, it would probably be better to use a sampling distribution of the mean, which in your case is also a normal distribution.

There is no silver bullet answer for your problem because it depends on the assumptions you have and what you are trying to do: it's not completely a plug and chug mechanical process.

Also you have to realize that your sample size is not great no matter what test you are doing. When you have a low amount of samples like you have, its probably be better to use prior information techniques like those found in Bayesian statistics.

Also I think I have misunderstood you: when you say two 'samples' do you mean two distinct collections of observations or do you mean two observations only? If its the first answer, how many observations in each sample?
 
chiro said:
If you have sufficient evidence that your underlying distribution has a particular distribution of the Normal variety, it would probably be better to use a sampling distribution of the mean, which in your case is also a normal distribution.

There is no silver bullet answer for your problem because it depends on the assumptions you have and what you are trying to do: it's not completely a plug and chug mechanical process.

Also you have to realize that your sample size is not great no matter what test you are doing. When you have a low amount of samples like you have, its probably be better to use prior information techniques like those found in Bayesian statistics.

Also I think I have misunderstood you: when you say two 'samples' do you mean two distinct collections of observations or do you mean two observations only? If its the first answer, how many observations in each sample?

Oh, my apologies! I meant to say that I have two distinct collections of observations (16 observations total and 8 observation per each category). Other than that information (aside from the actual data), I am not given any conditions to work with.
 
number0 said:
Oh, my apologies! I meant to say that I have two distinct collections of observations (16 observations total and 8 observation per each category). Other than that information (aside from the actual data), I am not given any conditions to work with.

Well above you mentioned an assumption namely that your data is normally approximated.

If the data is (or is approximately) normally distributed with an unknown population variance, a good test would be to use a t-test.

Now again, more assumptions enter the equation. If the two variances are not statistically significant you would use a pooled variance. If not, you don't. Also if the different processes are linked (or are thought to be anyway) in a kind of "cause-effect" manner between pairs of observations, then you would consider a paired t-test.

All of the above tests also have distributional assumptions, and if these are not met, you may need to use tests that are lot more complicated.

If you want to test normal approximations, there are different tests for this but the main test is the Shapiro-Wilk test. Any decent statistical software package will do this very easily and quickly.

There are other tests, but I am a) not familiar with them and b) don't understand enough about their differences to give specific advice.

For this problem, I would check normality assumptions for both samples and then do a two-sample t-test. In saying this, if you want to draw conclusions that are statistically significant and useful, I would take a bit of time to either learn the statistics or to ask a statistician for advice.

If this is for some kind of research, I strongly recommend you get some advice. If this is a homework question, I would be interested in telling us what course this is in and what statistical background you have so that I can put your problem into the proper context.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
Replies
20
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 4 ·
Replies
4
Views
22K