Separate tests vs Simultaneous tests

  • Context: Graduate 
  • Thread starter Thread starter FallenApple
  • Start date Start date
Click For Summary

Discussion Overview

The discussion revolves around the advantages and disadvantages of testing multiple hypotheses separately versus simultaneously, particularly in the context of statistical analysis of the effects of a medication across different genders. Participants explore the implications of using paired t-tests and simultaneous inference, as well as the role of domain knowledge in determining the order of tests.

Discussion Character

  • Debate/contested
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • One participant questions the advantages of testing hypotheses separately compared to simultaneously, using a medication example to illustrate the scenario.
  • Another participant notes that testing separately requires correction for multiple comparisons, which can lead to incorrect raw p-values, while simultaneous testing automatically corrects for this.
  • A participant suggests that multiple regression might be a preferable method, although they acknowledge that results should be similar regardless of the approach.
  • Some participants emphasize the importance of domain knowledge in deciding the order of tests, particularly when hypotheses are dependent.
  • There is a suggestion that if the treatment effect differs across genders, it may be more efficient to focus on the differences first rather than conducting all tests.
  • Participants discuss the importance of constructing hypotheses based on the significance of the facts being tested, indicating a structured approach to hypothesis testing.
  • One participant expresses familiarity with regression and questions whether it resolves the ordering issue, noting that regression coefficients are interpreted with other variables held constant.

Areas of Agreement / Disagreement

Participants express differing views on the best approach to hypothesis testing, with some advocating for simultaneous testing due to automatic corrections, while others highlight the importance of domain knowledge and the context of the tests. The discussion remains unresolved regarding the optimal method for testing multiple hypotheses.

Contextual Notes

Participants mention the need for corrections in multiple comparisons and the potential influence of dependency among hypotheses, but do not resolve these complexities or provide definitive solutions.

FallenApple
Messages
564
Reaction score
61
Ok so let's say I have multiple hypothesis that I want to test. Is there an advantage to testing them separately compared to all at once?

Here is an example. Say there's a medication. We want to see how it affects males, how it affects females, and if the effect of the medication differs across gender.

Say I record baseline health and health after the treatment for each individual.

I could do a paired t test for the females. Then a paired t test for the males. Then I can do a two sample t test on the differences between males and females.

Or I can do them all at once using simultaneous inference.

Is there any draw backs for one of them vs the other?
 
Physics news on Phys.org
If you do them separately then you need to correct for multiple comparisons, and the raw p-values will be wrong. If you do them together then they will automatically be corrected.
 
  • Like
Likes   Reactions: FallenApple
Dale said:
If you do them separately then you need to correct for multiple comparisons, and the raw p-values will be wrong. If you do them together then they will automatically be corrected.

Ah ok. So then the idea is to use multiple regression?
 
I prefer it, but in the end it should come out pretty similar.
 
Hey FallenApple.

If they are dependent then order should be based on knowledge of what's important to test first or later on.

If they are independent then you can test them in any order.

The choice for doing so requires a bit of knowledge about both statistics and domain level knowledge.
 
  • Like
Likes   Reactions: FallenApple
chiro said:
Hey FallenApple.

If they are dependent then order should be based on knowledge of what's important to test first or later on.

If they are independent then you can test them in any order.

The choice for doing so requires a bit of knowledge about both statistics and domain level knowledge.

So for example, say that in reality the difference of treatment differs across males but that it doesn't for females. And if we suspect this, then it would be wise to just do the 3rd test, that is looking a the differences of differences first. Because doing the other two would be a waste.
 
You need to take into account what's called domain knowledge.

For example - let's say you know fact A and that leads you to think that you should do test H0 and HA because it's critical that you make sure fact A doesn't hold [or does]. Then you would do fact A.

You think about the things you know and the order they are tested in and you construct your hypotheses in the order of the facts being important to least important.

Usually though when you have multiple variables involved you do what is called a regression and you can look at specific estimates to answer things on multiple variables.

Have you ever done regression along with correlation and co-variance?
 
chiro said:
You need to take into account what's called domain knowledge.

For example - let's say you know fact A and that leads you to think that you should do test H0 and HA because it's critical that you make sure fact A doesn't hold [or does]. Then you would do fact A.

You think about the things you know and the order they are tested in and you construct your hypotheses in the order of the facts being important to least important.

Usually though when you have multiple variables involved you do what is called a regression and you can look at specific estimates to answer things on multiple variables.

Have you ever done regression along with correlation and co-variance?
Yes I'm familiar with regression, though not an expert at it. From what I know, multiple regression allows you to adjust for many variables at once. Does that take care of the ordering issue? Because in regression outputs, the coefficient is interpretated as the others held constant.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
Replies
20
Views
3K
  • · Replies 30 ·
2
Replies
30
Views
4K
Replies
1
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 16 ·
Replies
16
Views
4K
  • · Replies 20 ·
Replies
20
Views
4K