Multivariate hypothesis testing

Click For Summary
SUMMARY

Multivariate hypothesis testing involves evaluating the joint distribution of multiple variables, particularly when dealing with independent and identically distributed (iid) samples from a binomial distribution, such as Bin(10,q). In the discussed scenario, individual p-values for samples X1=7 and X2=8 under the null hypothesis H0:q=1/2 are approximately 0.172 and 0.055, respectively, indicating insufficient evidence to reject the null hypothesis at the 95% confidence level. For joint testing, understanding the joint distribution F(x,y) is crucial, and the Kolmogorov-Smirnov distance can be employed for independent random variables, although calculating critical values may necessitate Monte-Carlo simulations.

PREREQUISITES
  • Understanding of binomial distributions, specifically Bin(10,q)
  • Knowledge of p-value calculations and hypothesis testing
  • Familiarity with joint distributions and their properties
  • Experience with the Kolmogorov-Smirnov statistic and Monte-Carlo simulations
NEXT STEPS
  • Research the application of the Kolmogorov-Smirnov test in multivariate contexts
  • Explore Monte-Carlo simulation techniques for hypothesis testing
  • Learn about multivariate normal distributions and their properties
  • Study methods for calculating joint p-values in multivariate hypothesis testing
USEFUL FOR

Statisticians, data scientists, and researchers involved in multivariate analysis and hypothesis testing, particularly those working with binomial distributions and joint distributions.

bpet
Messages
531
Reaction score
7
How is hypothesis testing performed for multivariate data?

Say for simplicity we have two iid draws from a binomial distribution Bin(10,q) with X1=7, X2=8. Under the null hypothesis H0:q=1/2, the individual p-values (as one-tail probabilities) are approximately 0.172 and 0.055 respectively so neither data point is sufficient evidence on its own to reject the null at the 95% confidence level. What would be the p-value for the pair (7,8) ?
 
Physics news on Phys.org
One way to interpret your question is, "what is the sampling distribution generated by n=2, q=0.5?" as in http://faculty.vassar.edu/lowry/binomial.html

OTOH for a joint test of two variables you need to know their joint distribution. In the iid case that's F(x,y)=F(x)F(y).
 
Last edited:
EnumaElish said:
One way to interpret your question is, "what is the sampling distribution generated by n=2, q=0.5?" as in http://faculty.vassar.edu/lowry/binomial.html

Thanks though I don't quite understand how you mean to apply this to hypothesis testing.

OTOH for a joint test of two variables you need to know their joint distribution. In the iid case that's F(x,y)=F(x)F(y).

The joint distribution on its own isn't really appropriate because F(x1,...,xn) would be O(1/2^n). For independent rv's I guess the Kolmogorov-Smirnov distance would be useful as for a sample of size 1 it resembles a two-tail test. For non-independent samples I'm still not sure what is suitable.
 
Do you care to explain your statement below?
bpet said:
The joint distribution on its own isn't really appropriate because F(x1,...,xn) would be O(1/2^n).
 
EnumaElish said:
Do you care to explain your statement below?

Say the variables are independent, as a rough approximation you could say the values are clustered about the median so F(x1,...,xn) ~ (1/2)^n. So the cdf on its own isn't really sufficient to use as a p-value, but I guess the multivariate generalization of the KS statistic could be used - though to calculate the critical values would be quite difficult and probably require Monte-Carlo simulation.

As an example, since the multivariate normal cdf has no closed form, what would be a procedure to test a sample, say the distribution Xi ~ N(0,1) with E[XiXj]=r for i<>j, 1<=i,j<=N when N is large?
 

Similar threads

  • · Replies 24 ·
Replies
24
Views
6K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 10 ·
Replies
10
Views
3K
Replies
3
Views
12K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 20 ·
Replies
20
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K