# Distribution of sum of discrete random variable

by Tosh5457
Tags: discrete, distribution, random, variable
 P: 221 Edit: I have to think more about this, I'll post later.
P: 4,542
 Quote by Tosh5457 Edit: I have to think more about this, I'll post later.
If you need a hint, assuming independence, think about convolution theorem for discrete random variables.
 P: 221 Ok, my problem is about poker tourneys. Consider the random variable Y, which is the sum of N random variables X: $$Y=X+X+...+X=NX$$ X is the random variable that assigns a prize value to each in-the-money position, and assigns -1 to out-of-money positions. So: $$X(position)=\left\{\begin{matrix}-1, position = "OTM" \\w_{1}, position = "1st" \\w_{2}, position = "2nd" \\... \\w_{n}, position = "nth" \end{matrix}\right.$$ OTM means out-of-money, and here n is the number of in-the-money positions. w1, w2,..., wn are constants of course. The probability mass function of X is: $$f(x)=\left\{\begin{matrix} \beta_{1}, x = -1 \\ \beta_{2}, x = w_{1} \\ \beta_{3}, x = w_{2} \\ ... \\ \beta_{n+1}, x = w_{n} \end{matrix}\right.$$ What I want to know is the probability mass function of Y (which represents the profit/loss of N tourneys). I could find this by using the convolution theorem, but that's where my problem arises. As I understood it, Y needs to depend on a variable y, so: $$Y(y)=X(x)+X(x)+...+X(x)=NX(x)$$ But I don't know how to define y... This is why I can't use the convolution theorem, because I didn't really understand this part.
P: 5,768

## Distribution of sum of discrete random variable

Assuming all the X's are independent, it would be better to use the characteristic function (Fourier transform of distribution). Then the characteristic function for Y is the Nth power of the char. function for X. To get the distribution function for Y, take the inverse transform.
P: 221
 Quote by mathman Assuming all the X's are independent, it would be better to use the characteristic function (Fourier transform of distribution). Then the characteristic function for Y is the Nth power of the char. function for X. To get the distribution function for Y, take the inverse transform.
Isn't that only for continuous random variables (and not discrete)?

Anyway, shouldn't it be easy by convolution? I think that I'm only missing something very basic...
P: 4,542
 Quote by Tosh5457 Isn't that only for continuous random variables (and not discrete)? Anyway, shouldn't it be easy by convolution? I think that I'm only missing something very basic...
The y variable is just a dummy variable and you can call it whatever you want. As long as you are doing a discrete convolution for PDF's with univariate distributions (i.e. with PDF in form P(X = x) = blah) then your dummy variable will always correspond to this x or whatever other dummy variable you have chosen.
P: 5,768
 Quote by Tosh5457 Isn't that only for continuous random variables (and not discrete)? Anyway, shouldn't it be easy by convolution? I think that I'm only missing something very basic...
You can do it for discrete random variables. The density function is a linear combination of Dirac delta functions, so the characteristic function is a linear combination of exponential functions.
P: 221
 You can do it for discrete random variables. The density function is a linear combination of Dirac delta functions, so the characteristic function is a linear combination of exponential functions.
I'm not comfortable with that, I never even studied dirac delta function.

About the convolution, I still don't understand some things...

1st problem:

Definition of convolution for discrete r.v:
$$f_{Z}(z)=\sum_{x=-\inf}^{x=+inf}f_{Y}(z-x)f_{X}(x)$$
where x is an integer.

The problem here is that $$f_{X}(x)$$ may be always 0 in my example (except for x = -1), since there may be no integer prizes... And if I defined the r.v. X slightly differently (not assigning that -1 value) that function would always be 0 and there would be no probablity mass function for Z? Something's not right here...
P: 4,542
 Quote by Tosh5457 I'm not comfortable with that, I never even studied dirac delta function. About the convolution, I still don't understand some things... 1st problem: Definition of convolution for discrete r.v: $$f_{Z}(z)=\sum_{x=-\inf}^{x=+inf}f_{Y}(z-x)f_{X}(x)$$ where x is an integer. The problem here is that $$f_{X}(x)$$ may be always 0 in my example (except for x = -1), since there may be no integer prizes... And if I defined the r.v. X slightly differently (not assigning that -1 value) that function would always be 0 and there would be no probablity mass function for Z? Something's not right here...
It would help if you stated the discrete PDF's for f(x) and for g(y) corresponding to the distributions of X and Y, just for clarification.
 P: 221 Ok I'll write everything again with the pmf included, so everything will be on the same post. Definition of convolution of pmf's for discrete random variables X and Y: $$f_{Z}(z)=\sum_{x=-inf}^{x=+inf}f_{Y}(z-x)f_{X}(x)$$ where x is an integer. PMF of X and Y (they have the same distribution): $$f(X=x)=\left\{\begin{matrix}a_{1}, x=-1 \\ a_{2}, x=w_{1} \\ a_{3}, x=w_{2} \\ ... \\ a_{n}, x=w_{n-1} \end{matrix}\right.$$ where w1, w2, ..., wn-1 are real constants. Problem: In the series, $$f_{X}(x)$$ may be always 0 except for -1, since w1, w2, ..., wn-1 in general won't be integers.
P: 4,542
 Quote by Tosh5457 Ok I'll write everything again with the pmf included, so everything will be on the same post. Definition of convolution of pmf's for discrete random variables X and Y: $$f_{Z}(z)=\sum_{x=-inf}^{x=+inf}f_{Y}(z-x)f_{X}(x)$$ where x is an integer. PMF of X and Y (they have the same distribution): $$f(X=x)=\left\{\begin{matrix}a_{1}, x=-1 \\ a_{2}, x=w_{1} \\ a_{3}, x=w_{2} \\ ... \\ a_{n}, x=w_{n-1} \end{matrix}\right.$$ where w1, w2, ..., wn-1 are real constants. Problem: In the series, $$f_{X}(x)$$ may be always 0 except for -1, since w1, w2, ..., wn-1 in general won't be integers.
If X and Y have the same distribution, then you won't have any problem. It doesn't matter what w1, w2, etc are if they are integers, rationals, or reals: you're dealing with defining the probabilities for Z and if X and Y have the same values that correspond to some probability (remember you said that X and Y have the same distribution which implies the same a(1),w1,w2 etc, then you will find the probability distribution using the probabilities and then create the mapping to values afterwards.

Are you wondering about the mapping procedure if your values that are mapped to probabilities are not integers?
P: 221
 Quote by chiro you're dealing with defining the probabilities for Z and if X and Y have the same values that correspond to some probability (remember you said that X and Y have the same distribution which implies the same a(1),w1,w2 etc, then you will find the probability distribution using the probabilities and then create the mapping to values afterwards.
Sorry, I didn't understand this part. What do you mean by "find the probability distribution using the probabilities"? Are you referring to the convolution?

 Are you wondering about the mapping procedure if your values that are mapped to probabilities are not integers?
Yes, that's my problem.
P: 2,892
 Quote by Tosh5457 Consider the random variable Y, which is the sum of N random variables X: $$Y=X+X+...+X=NX$$
This is faulty notation if the various 'X's' can have different outcomes. You should say "is the sum of N random variables $X_1,X_2,...X_N$. If you want to indicate that the $X_i$ have the same distribution, just say that in words. Don't do it by naming them as if they all realize the same value.

 $$Y(y)=X(x)+X(x)+...+X(x)=NX(x)$$
You mean:
$$Y(y) = X_1(x_1) + ...+ X_N(x_N)$$

and you can't combine these terms.

 But I don't know how to define y... This is why I can't use the convolution theorem, because I didn't really understand this part.

The variable y represents a particular value of the random variable Y. It just like saying "Let W be the random variable that represents the value of the face on a roll of a fair die". W(w) would be the particular event that the face value was w. (e.g. W(2) says the face that came up was 2.

To find the probability of particular value of Y, such as y = $40, you must sum the probabilities of all combinations of values $x_i$ for the $X_i$ which add up$40. So you consider all possible values of the $x_i$ that meet that requirement. That is essentially what "convolving" random variables means.
P: 4,542
 Quote by Tosh5457 Yes, that's my problem.
Stephen Tashi answered your question: consider all possibilities of values given your distributions and then use the convolution algorithm (or theorem) to get the actual probability for each mapping.

The best way to do this is to do it for the first two random variables and then simplify and repeat to incorporate the rest of them. Basically you should have n x m outputs for your convoluted distribution if n is the number of outputs for your starting and m is the number of outputs for the one you are convolving with. When you do this repeatedly m because the number of mappings in the last distribution you calculated through convolution.
P: 221
Thanks for the replies

 The variable y represents a particular value of the random variable Y. It just like saying "Let W be the random variable that represents the value of the face on a roll of a fair die". W(w) would be the particular event that the face value was w. (e.g. W(2) says the face that came up was 2.
I was taught that that variable w described the event, not the value of the random variable. In your example, and in my understanding, what would say that the value of the face on a roll of a fair die was 2 is the value of W, not w. I'd represent that as W(2) = 2 or W("2 came off") = 2, depending on what's the set I define w to be in.

 To find the probability of particular value of Y, such as y = $40, you must sum the probabilities of all combinations of values xi for the Xi which add up$40. So you consider all possible values of the xi that meet that requirement. That is essentially what "convolving" random variables means.
Sum the probabilities? Don't you mean multiply? If I sum the probabilities, and if I understood right, I'll get probabilities higher than 1 in some cases.
For example, Y = w1 + w1 + w1 + ... + w1 = N*w1 is a possibility for Y. If I sum the probabilities of X=w1 N times I can easily get a number higher than 1.

This is what I understood, please tell me if I'm wrong:

For simplicity let's say Y = X1 + X2, and each of these X's can only be w1 and w2. Then Y will be like this (eta is a dummy variable, didn't even bother to write it in each value of Y):
$$Y(\eta )=\left\{\begin{matrix} \\ w_{1}+w_{1} \\ w_{1}+w_{2} \\ w_{2}+w_{1} \\ w_{2}+w_{2} \end{matrix}\right.$$

In general, for the sum of 2 random variables, it will have n x m values, where n is the number of possible values X1 can have, and m is the number of possible values X2 can have (like chiro said).

Now, if I attribute a probability of 0.7 to w2 and 0.3 to w1 for example, the probability of Y=w2+w2 would be 1.4, in my understanding.
P: 4,542
 Quote by Tosh5457 Thanks for the replies Sum the probabilities? Don't you mean multiply? If I sum the probabilities, and if I understood right, I'll get probabilities higher than 1 in some cases. For example, Y = w1 + w1 + w1 + ... + w1 = N*w1 is a possibility for Y. If I sum the probabilities of X=w1 N times I can easily get a number higher than 1.
You won't end up doing this: you have to use the calculations from the convolution algorithm which will involve multiplying probabilities and also adding (using the convolution theorem for discrete random variables).

Remember that it won't be N*w1: instead you will have to do a convolution for the first two random variables, then do a convolution with this calculated PDF with the next variable and then you keep doing this until you have a PDF for n-1 random variables with the nth random variable. If you expand this out, you'll see that it is a lot more involved than the N*w1 behaviour that you are implying: it doesn't work like that.

Convolution simply represents a way to multiply frequencies together of two functions and this idea is not only for probability but it's used for many areas of signal analysis and other areas of applied mathematics.

 Related Discussions Calculus & Beyond Homework 9 Precalculus Mathematics Homework 6 Set Theory, Logic, Probability, Statistics 3 Calculus & Beyond Homework 0 Set Theory, Logic, Probability, Statistics 37