Distribution of Difference of 2 2nd Degree Non-Central Chi Squared RVs

In summary: A=m_0*m_0'/m_0'*m_0 - m_1*m_1'/m_1'*m_1. You are correct in thinking that this is the difference of two central chi-squares with n degrees of freedom.
  • #1
rhz
12
0
Distribution of difference of two second degree non central chi squared random variables.

This problem can be cast as an indefinite quadratic form for which there are a number of general numerical techniques to determine the CDF. Alternatively, it may be written as a linear combination of independent chi squared random variables.

I'm wondering if there are any simplifications when the linear combination takes the form
of a simple difference of two second degree non-central chi squared distributions.

Context: Consider a two dimensional complex normal random vector x = [x1 x2]' ~ CN(u,R).

I am interested in the distribution of:

|x1|^2 - |x2|^2


Thanks! :smile:
 
Physics news on Phys.org
  • #2
Sorry, I formulated my problem incorrectly.

Correction. Sorry, but I have an error in my problem formulation. The random vector x is real in my case, not complex. So, I'm interested in the distribution of the difference of two chi square variates which each have one degree of freedom.
 
  • #3


Anyone?
 
  • #4


Ummm...5.

Its a difficult question, wait for an expert to get to it. Don't bump your post for 24 hours, ask these questions soon so someone can get around to answering them.
 
  • #5


The original post has a misleading title vis-a-vis the problem that is stated in the last few lines. It also isn't clear what is meant by a "CN" distribution. For example, a Wikipedia article points out that in signal processing, "complex normal" often means "circular complex normal", i.e. that the components of the vector are independent. The subsequent correction is that the variables X1 and X2 are real hardly clears anything up about their joint distribution. Are they independent or not?
 
  • #6


Stephen Tashi said:
The original post has a misleading title vis-a-vis the problem that is stated in the last few lines. It also isn't clear what is meant by a "CN" distribution. For example, a Wikipedia article points out that in signal processing, "complex normal" often means "circular complex normal", i.e. that the components of the vector are independent. The subsequent correction is that the variables X1 and X2 are real hardly clears anything up about their joint distribution. Are they independent or not?

Hi,

I'm really just interested in the distribution of the difference of two non-central chi-square random variables each with one degree of freedom.

Thanks.
 
  • #7


rhz said:
Hi,

I'm really just interested in the distribution of the difference of two non-central chi-square random variables each with one degree of freedom.

Thanks.

Simple random number generation can give you a numerical approximation of the CDF. It may seem crude at first sight but has the advantage of being flexible if later you want to change the distributional or dependence assumptions.

There are lots of other methods that could be used (even under the very restrictive assumption that X1 and X2 are the squares of two independent normal variables with known parameters). To narrow it down, could you say a bit more about the purpose of the exercise?
 
  • #8


There are lots of other methods that could be used (even under the very restrictive assumption that X1 and X2 are the squares of two independent normal variables with known parameters). To narrow it down, could you say a bit more about the purpose of the exercise?[/QUOTE]


Hi,

Thanks for your interest. Here is the full context. Consider a hypothesis testing problem. I have a vector of data x which is drawn from one of two families of normal distributions:


H_i: x~N(m_i,I)

where N(m,R) is a multivariate normal distribution with mean m and covariance R, and I is the identity matrix.

Under each hypothesis, the mean is known to within a multiplicative factor:

m_i = a*mm_i where a is a deterministic unknown scale and mm_i is a known vector.

The generalized likelihood ratio test for this problem takes the following form:

L(x) = x'*A*x

A = m_0*m_0'/m_0'*m_0 - m_1*m_1'/m_1'*m_1

which is the difference of two (dependent) non-central chi square random variable each with one degreee of freedom. It is straightforward to transform this to the difference of two independent non-central chi square random variable each with one degreee of freedom.

I hope that I've explained this well. If any additional info would be useful, just let me know.

Thanks again.
 
  • #9


rhz said:
...
L(x) = x'*A*x

A = m_0*m_0'/m_0'*m_0 - m_1*m_1'/m_1'*m_1

which is the difference of two (dependent) non-central chi square random variable each with one degreee of freedom.
...

Not sure I fully understand this step - I would've thought that (1/2)*x'*A*x is the difference of two central chi-squares with n degrees of freedom (if we express the log of the likelihood ratio in terms of the residuals e=x-a*mm ) ?
 
  • #10


bpet said:
Not sure I fully understand this step - I would've thought that (1/2)*x'*A*x is the difference of two central chi-squares with n degrees of freedom (if we express the log of the likelihood ratio in terms of the residuals e=x-a*mm ) ?

The _generalized_ LRT re-inserts maximum likelihood estimates of unknown parameters back into the likelihood function under each hypothesis and then takes the ratio of these two functions. Even if all parameters were known the residuals would only be zero mean under one of the two hypotheses.

Thanks
 
  • #11


rhz said:
The _generalized_ LRT re-inserts maximum likelihood estimates of unknown parameters back into the likelihood function under each hypothesis and then takes the ratio of these two functions. Even if all parameters were known the residuals would only be zero mean under one of the two hypotheses.

Thanks

Agreed - so if the terms m_0 and m_1 themselves depend on x (being MLE estimates) does that make L more complicated than just a quadratic function of x?
 

1. What is the distribution of the difference of two 2nd degree non-central chi squared random variables?

The distribution of the difference of two 2nd degree non-central chi squared random variables is known as the central F distribution. It is a continuous probability distribution that is used in statistical tests to compare the variances of two populations. It is also commonly used in regression analysis and analysis of variance (ANOVA).

2. How is the central F distribution related to the non-central chi squared distribution?

The central F distribution is the ratio of two non-central chi squared distributions. It is obtained by dividing the numerator degrees of freedom by the denominator degrees of freedom. The non-central chi squared distribution is a generalization of the central F distribution, where the numerator and denominator degrees of freedom are equal.

3. What is the mean and variance of the central F distribution?

The mean of the central F distribution is equal to the ratio of the denominator degrees of freedom to the numerator degrees of freedom. The variance is equal to the ratio of twice the denominator degrees of freedom to the numerator degrees of freedom squared.

4. How is the central F distribution used in hypothesis testing?

The central F distribution is used to test whether the variances of two populations are significantly different. In hypothesis testing, the F statistic is calculated by taking the ratio of the sample variances of the two populations. This statistic is then compared to the critical value of the central F distribution at a given significance level. If the calculated F statistic is greater than the critical value, it is likely that the variances are significantly different.

5. Are there any assumptions or limitations when using the central F distribution?

There are a few assumptions and limitations when using the central F distribution. First, it assumes that the populations being compared are normally distributed. It also assumes that the samples are independent and the variances are homogeneous. Additionally, the central F distribution is only applicable for continuous data and cannot be used for categorical data. Lastly, it is important to note that the central F distribution is only valid for a specific range of degrees of freedom. If the degrees of freedom are too small, the distribution may not be a good approximation of the true population distribution.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
4K
  • Set Theory, Logic, Probability, Statistics
Replies
20
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
756
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
4K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
3K
Back
Top