Discussion Overview
The discussion focuses on the methods and challenges of performing hypothesis testing for multivariate data, particularly in the context of binomial distributions and joint distributions. Participants explore the implications of independence and the appropriate statistical techniques for testing hypotheses involving multiple variables.
Discussion Character
- Exploratory
- Technical explanation
- Debate/contested
- Mathematical reasoning
Main Points Raised
- One participant inquires about the p-value for a pair of binomial draws (7,8) under the null hypothesis q=1/2, noting that individual p-values do not provide sufficient evidence to reject the null hypothesis.
- Another participant suggests that understanding the sampling distribution for n=2 and q=0.5 is essential for hypothesis testing, referencing a specific source for further clarification.
- There is a discussion about the necessity of knowing the joint distribution for a joint test of two variables, with a mention of the relationship F(x,y)=F(x)F(y) for independent identically distributed (iid) variables.
- A participant questions the appropriateness of using the joint distribution alone, arguing that it would lead to a probability of O(1/2^n) for independent random variables, and suggests that the Kolmogorov-Smirnov distance might be useful for independent samples.
- Another participant elaborates on the limitations of using the cumulative distribution function (cdf) alone for p-value calculations, proposing that the multivariate generalization of the Kolmogorov-Smirnov statistic could be applicable, though it would require complex calculations and possibly Monte-Carlo simulations.
- There is a query regarding the procedure for testing a sample from a multivariate normal distribution when the correlation between variables is considered, particularly in large sample sizes.
Areas of Agreement / Disagreement
Participants express differing views on the appropriate methods for hypothesis testing in multivariate contexts, particularly regarding the use of joint distributions and the implications of independence. The discussion remains unresolved with multiple competing perspectives presented.
Contextual Notes
Participants highlight limitations related to the assumptions of independence, the complexity of calculating critical values for multivariate tests, and the challenges posed by non-independent samples. There is also an acknowledgment of the lack of closed forms for certain multivariate distributions.