Double integration with normal distribution

  1. 1. The problem statement, all variables and given/known data
    Given X and Y are independent, normal distribution variable. a and b are constants.


    2. Relevant equations
    The probability of P(X+Y<b,X<a)



    3. The attempt at a solution

    P(X+Y<b,X<a)=\int_{-\infty}^{a}f(x)\int_{-\infty}^{b-x}f(y)dxdy

    Is there a close-form solution for this problem?

    Thanks,
    Best,
     
  2. jcsd
  3. jambaugh

    jambaugh 1,802
    Science Advisor
    Gold Member


    Yes there is but you need to convert the area integral into polar coordinates. Actually you could probabily solve the problem geometrically if you can visualize the area over which you are integrating and take into account the symmetry.

    I have assumed in answering you that the two variables are both unit normal distributions with zero mean. Remember that:
    [tex] \exp\left(\frac{x^2}{2\sigma^2}\right)\exp\left(\frac{y^2}{2\sigma^2}\right)
    = \exp\left(\frac{x^2+y^2}{2\sigma^2}\right) = \exp(r^2/2\sigma^2)[/tex]

    So it's symmetric under rotations about the origin!!!
     
  4. statdad

    statdad 1,478
    Homework Helper

    Note that the exponents in the previous reply should have negative signs in them - for example

    [tex]
    \exp \left(\frac{-x^2}{2\sigma^2}\right)
    [/tex]

    and so on throughout.

    Also, the original poster did not state this, but this assumes that [tex] X \text{ and } Y [/tex] are identically distributed. If they are not this become much more difficult.

    On a side thought, have you tried a transformation? You can write down the joint density of the two quantities. Now try

    [tex]
    \begin{align*}
    S & = X + Y \tag{S for sum}\\
    T & = X \tag{Your other variable}
    \end{align*}
    [/tex]

    You have [tex] -\infty < S, T < \infty [/tex]. Find the jacobian and obtain the joint density of [tex] S \text{ and } T [/tex]. Setting up the double integral after will give the same answer as the other suggested method. I haven't gone through the work to see whether one is easier than the other.
     
  5. Thanks to both of you. I am glad to see your replies.

    But I still doubt the methods are feasible to solve the problem.

    We can see,


    P(X+Y<b,X<a)=\int_{-\infty}^{a}f(x)\int_{-\infty}^{b-x}f(y)dxdy
    =\int_{-\infty}^{a}f(x)erf(b-x)dx


    However, erf(b-x) does not have a close-form solution.

    If you have time, please provide me the detailed solution. I will hightly appreciate, as I have been worked on it for many days. Thanks

    BTW, how tex my codes?
     
  6. tiny-tim

    tiny-tim 26,041
    Science Advisor
    Homework Helper

    tex

    Hi nowoman! Welcome to PF! :smile:

    For:

    [tex]P(X+Y<b,X<a)\ =\ \int_{-\infty}^{a}f(x)\int_{-\infty}^{b-x}f(y)\,dxdy
    =\int_{-\infty}^{a}f(x)\,erf(b-x)\,dx[/tex]

    you have to put [noparse][tex] before, and [/tex] after …

    [tex]P(X+Y<b,X<a)\ =\ \int_{-\infty}^{a}f(x)\int_{-\infty}^{b-x}f(y)\,dxdy
    =\int_{-\infty}^{a}f(x)\,erf(b-x)\,dx[/tex][/noparse] :smile:

    (also, as you see, "\ " gives you large spaces, and "\," gives you small spaces :wink:)
     
  7. Thanks for your information, tiny-tim.

    I have a bit more thoughts bout the problem.

    [tex]P(X+Y<b, X<a)[/tex]

    Let U=X-a, V=Y-b+a, then, we have

    [tex]P(U+V<0, U<0)[/tex], In this form, the integral area is a sector, we can use polar coordinates to solve it., Such as,

    [tex]P(U+V<0, U<0)=\int_{\theta}^{3\pi /2}\int_{-\infty}^{\infty} f(u+v,u)dudv[/tex]

    But how to compute f(u+v,u)?
     
  8. If we let U and V are standard normal distributions, we have

    [tex]
    P(U+V<0, U<0)=\int_{A}f(u,v)dA=\int_{3\pi /4}^{3\pi /2}\int_{0}^{\infty} f(r cos(\theta),rsin(\theta))rdrd\theta=\int_{3\pi /4}^{3\pi /2}\int_{0}^{\infty} f(r cos(\theta)) f(rsin(\theta))rdrd\theta=3/8
    [/tex]

    Does anyone verify me?
     
  9. jambaugh

    jambaugh 1,802
    Science Advisor
    Gold Member

    Pardon my error, I somehow assumed that a and b were zero so that the area over which the integration occured was an angular wedge with vertex at the origin. My point about symmetries is not very helpful. This is a much more involved problem than I first assumed. You may disregard my post.
     
  10. Thanks.
    Yes, I think if U and V are standard normal distribution, the problem is easy to solve. If U and V are general normal distributions, it is still very hard to solve. Any ideas?
     
  11. jambaugh

    jambaugh 1,802
    Science Advisor
    Gold Member

    In looking at it further I'm pretty sure there is no closed form solution.
     
  12. yes. Do you have ideas how to get an approximate solution that is computable and accurate enough for general purpose? Thanks
     
  13. jambaugh

    jambaugh 1,802
    Science Advisor
    Gold Member

    My first thought is to use Monte Carlo methods. Find a good normal random number generator. Have it generate pairs (x,y) and count how many satisfy the condition
    [tex] x+y<b,\quad x< a[/tex]. Divide by the number of tials.

    See http://www.taygeta.com/random/gaussian.html abotu generating normal distribution random numbers.

    Also recall that a binomial distribution approaches the gaussian as the number of trials increases so you might be able to scale the problem to be approximated by a sum over a double binomial distribution which might be computationally quicker via integer arithmetic.

    Beyond that there's just the standard numerical integration packages. You might try transforming to variables with simple boundaries such as U>0, V>0. Put all the messy into the p.d.f.

    That's all I got for now.
     
  14. Hi jambaugh,
    Thanks for much for your suggestions. The problem is actually a subroutine of a larger problem. The larger problem frequently refer to this problem. Hence, Monte Carlo method may be too expensive to implement it. Thanks though.
     
  15. jambaugh

    jambaugh 1,802
    Science Advisor
    Gold Member

    How precise does the value need to be?

    You might look at Gaussian Quadrature methods which allow you to get good estimates by evaluating at a few specific points.

    Also remember you're dealing with effectively a function of two variables (a,b). You can always pre-calculate a select set of (a,b) points in a data table and then use interpolation, extrapolation in the subroutine.

    What's this for anyway?
     
  16. Thanks, jambaugh. I'll have a look at these methods. Thanks for the long discussion with you. Take care!
     
Know someone interested in this topic? Share a link to this question via email, Google+, Twitter, or Facebook

Have something to add?