Generate a Multivariate Random Variable

Click For Summary
SUMMARY

This discussion focuses on generating multivariate random variables that follow a specific extreme-value distribution defined by the cumulative distribution function (CDF) F(x_1, x_2) = exp[-(e^{-2x_1} + e^{-2x_2})^{1/2}]. The user outlines a method involving the calculation of marginal and conditional distributions, specifically F_{x_1}(x_1, x_2) and F(x_1, x_2 | x_1), to derive random variables x_1 and x_2. The results indicate a discrepancy in the average values of x_1 and x_2, suggesting a potential dependence between the variables, which is confirmed by the community's input on their correlation.

PREREQUISITES
  • Understanding of cumulative distribution functions (CDFs)
  • Familiarity with marginal and conditional distributions
  • Knowledge of extreme-value distributions
  • Proficiency in statistical simulation techniques
NEXT STEPS
  • Research methods for generating random variables from multivariate distributions
  • Study the properties of extreme-value distributions in depth
  • Learn about covariance and correlation in the context of dependent random variables
  • Explore simulation techniques in statistical software such as R or Python
USEFUL FOR

Economics graduate students, statisticians, data scientists, and researchers working with multivariate distributions and simulations in statistical modeling.

Jeffack
Messages
14
Reaction score
0
Hi,

I'm an economics graduate student doing some work on a nested logit model.

I am trying to generate random variables that follow the following CDF:
F(x_1, x_2) =\textrm{exp}[ -(e^{-2x_1}+e^{-2x_2}) ^{1/2}]
(This is an extreme-value distribution)

With a single random variable, I know that (assuming you can invert the CDF), you can just draw ##u## from the Uniform [0,1] distribution and do ##x=F^{-1}(u)## to get a random variable that follows the distribution described by ##F(x)##.

With the multivariate case, I think what I need to do is:

1) Find ## F_{x_1}(x_1, x_2)##, the marginal distribution of ##F(x_1, x_2)##. I do this by taking the limit as ##x_2## goes to infinity, so ## F_{x_1}(x_1, x_2)=\textrm{exp}[ -(e^{-2x_1}) ^{1/2}]##

2) Find ## F(x_1, x_2 | x_1)##, the conditional distribution of ##F(x_1, x_2)## given ##x_1##. This is calculated this way: ## F(x_1, x_2 | x_1)= {\frac{F(x_1, x_2)}{F_{x_1}(x_1, x_2)}} ##

3) Invert ## F_{x_1}(x_1, x_2)=u_1## to get ## F_{x_1}^{-1}(u)=x_1 ##. This gives us a random ##x_1## for an value of ##u_1 \in (0,1) ##

4) Use the value of ##x_1## generated in the previous step in this step. Invert ## F(x_1, x_2 | x_1)=u_2 ## to get ## F^{-1}(u_2)=x_2##

Here are the formulas I use for determining the random variables (Sorry they're not all pretty and Latex-y... I pulled them from Excel)

x_1=(LN((LN(u_1))^2))/-2

x_2=(LN(((LN(u_2*(EXP(-1*((EXP(-2*x_1))^(1/2))))))/-1)^2-(EXP(-2*x_1))))/-2

I did all of these steps and, at first, thought I got a decent result; As long as I pick ##u##'s that are between 0 and 1, I get a real answer; larger u's generate larger x's; and u's that are arbitrarily close to zero (one) give x's that are very small (large). However, when I ran a simulation and looked at average values of each, my ##x_1##'s tend to be much larger than my ##x_2##'s (about .66 for ##x_1## and -.1 for ##x_2##. Since the CDF is symmetric, I think that these variables should have the same average.

Any help will be much appreciated. This is my first post ever on this site!
 
Physics news on Phys.org
Hey Jeffack and welcome to the forums.

A key question you need to answer: Are x1 and x2 independent variables or are they dependent? (If the limits of integration are tied up or if you have correlation or one variable has the property of being a function of the other then you have dependence).
 
I think that, based on the CDF, the variables are dependent, correct? The conditional density of ##x_2## depends on the value of ##x_1##.
 
Yes I agree with you but once you find the relationship between X1 and f(X1) = X2 then you calculate Cov(X1,X2) = E[X1*X2] - E[X1]E[X2] = E[X1*f(X1)] - E[X1]*E[f(X1)] using only the PDF for X1.
 
Thanks for your help. The variables are definitely correlated. Now I just need to figure out how to generate the random variables.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
1K
  • · Replies 12 ·
Replies
12
Views
4K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K