How Do You Calculate the Probability of Sum T in a Multivariable Sample Space?

Kariege · May 20, 2015

Hi,
I'm currently having a lot of trouble with this probability problem. For example:
Suppose there are 5 balls in a bag with number 1,2,3,4,5. I pick a ball at random 20 times (with replacement).
Lets say the probability of each ball being picked is:
P(1) = 0.5
P(2) = 0.15
P(3) = 0.1
P(4) = 0.2
P(5) = 0.05
After I pick the ball 20 times, I sum it up. The sum is denoted as T.
I want to find the probability that T>=50. How do I go about doing this?

I'm not entirely sure if this actually links to sample space. Sample space is just something that I have in my mind. I've seen sample space where there are 2 variables, but what about more than 2?

Any help would be appreciated
Thanks

mathman · May 20, 2015

It looks messy. The distribution is multinomial. For each combination you need to calculate the sum and finally add up the probabilities for those where the sum is what you want.

Mutinomial term \frac{n!}{i!j!k!l!m!}p_1^ip_2^jp_3^kp_4^lp_5^m over all non-negative possibilities where i+j+k+l+m=n. (n=20).

statdad · May 20, 2015

You could write a little routine to work out the actual sampling distribution for the sum (probably not the most efficient use of your time)
or use the Central Limit Theorem to get an approximation.
* You have the probability distribution for the number to be drawn on each selection - find the mean and variance
* The CLT says the sum is approximately normally distributed with mean = 20 times the mean found above and variance = 20 times the mean found above
Use the appropriate normal distribution for your approximation. (You may want to use a continuity correction: I haven't looked at any of the numbers)

Kariege · May 21, 2015

mathman said:

It looks messy. The distribution is multinomial. For each combination you need to calculate the sum and finally add up the probabilities for those where the sum is what you want.

Mutinomial term \frac{n!}{i!j!k!l!m!}p_1^ip_2^jp_3^kp_4^lp_5^m over all non-negative possibilities where i+j+k+l+m=n. (n=20).

Thanks. In this context, how would you approach this using the formula?

Also why does i+j+k+l+m = 20? I'm confused.

statdad · May 21, 2015

statdad said:

You could write a little routine to work out the actual sampling distribution for the sum (probably not the most efficient use of your time)
or use the Central Limit Theorem to get an approximation.
* You have the probability distribution for the number to be drawn on each selection - find the mean and variance
* The CLT says the sum is approximately normally distributed with mean = 20 times the mean found above and variance = 20 times the mean (<-- that should be variance: sorry) found above
Use the appropriate normal distribution for your approximation. (You may want to use a continuity correction: I haven't looked at any of the numbers)

Kariege · May 21, 2015

statdad said:

You could write a little routine to work out the actual sampling distribution for the sum (probably not the most efficient use of your time)
or use the Central Limit Theorem to get an approximation.
* You have the probability distribution for the number to be drawn on each selection - find the mean and variance
* The CLT says the sum is approximately normally distributed with mean = 20 times the mean found above and variance = 20 times the mean found above
Use the appropriate normal distribution for your approximation. (You may want to use a continuity correction: I haven't looked at any of the numbers)

Thanks for the reply.
CLT is quite a new concept to me but I think this can help me to solve the problem.
Sry if this is quite a stupid question but how would you find the mean and the variance in this case? Is it similar to finding the mean and variance of a frequency table because this is a probability table?

statdad · May 21, 2015

"Is it similar to finding the mean and variance of a frequency table because this is a probability table?"

Yes - make a table, row 1 the different numbers that could be picked, row 2 the probabilities you've assigned, and work as though they were frequencies. Here is an example (I simply picked the first page that popped up in a search)

http://www.mathsisfun.com/data/random-variables-mean-variance.html

How Do You Calculate the Probability of Sum T in a Multivariable Sample Space?

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Who May Find This Useful

Similar threads

Undergrad Please Explain (actually explain) The Monty Hall Problem

Undergrad A variant of the Monty Hall problem

Undergrad My basic understanding of set theory

Graduate Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense

High School Onto set mapping is the surjective set mapping, and into injective?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers