How Do You Calculate the Probability of Sum T in a Multivariable Sample Space?

Click For Summary

Discussion Overview

The discussion revolves around calculating the probability of achieving a sum T greater than or equal to 50 when drawing from a set of weighted balls multiple times. Participants explore the application of probability theory, particularly focusing on multinomial distributions and the Central Limit Theorem (CLT) in a multivariable sample space context.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning
  • Homework-related

Main Points Raised

  • One participant describes a scenario involving 5 balls with different probabilities and seeks to find the probability that the sum of 20 draws is at least 50.
  • Another participant identifies the distribution as multinomial and suggests calculating the sum for each combination of draws, using the multinomial coefficient.
  • Some participants propose using the Central Limit Theorem to approximate the distribution of the sum, noting that the sum can be treated as approximately normally distributed.
  • There is a discussion about how to find the mean and variance based on the provided probability distribution, with a participant questioning the similarity to frequency tables.
  • Clarifications are sought regarding the multinomial term and the condition that the sum of indices equals the total number of draws.

Areas of Agreement / Disagreement

Participants express varying approaches to the problem, with some favoring direct calculation using multinomial distributions and others advocating for approximation via the Central Limit Theorem. No consensus is reached on the best method to apply.

Contextual Notes

Participants mention the need to consider continuity corrections when applying the CLT, and there are unresolved questions about the specific calculations for mean and variance in this context.

Who May Find This Useful

This discussion may be useful for students or individuals interested in probability theory, particularly those dealing with weighted random selections and the application of the Central Limit Theorem in practical scenarios.

Kariege
Messages
15
Reaction score
0
Hi,
I'm currently having a lot of trouble with this probability problem. For example:
Suppose there are 5 balls in a bag with number 1,2,3,4,5. I pick a ball at random 20 times (with replacement).
Lets say the probability of each ball being picked is:
P(1) = 0.5
P(2) = 0.15
P(3) = 0.1
P(4) = 0.2
P(5) = 0.05
After I pick the ball 20 times, I sum it up. The sum is denoted as T.
I want to find the probability that T>=50. How do I go about doing this?

I'm not entirely sure if this actually links to sample space. Sample space is just something that I have in my mind. I've seen sample space where there are 2 variables, but what about more than 2?

Any help would be appreciated
Thanks
 
Physics news on Phys.org
It looks messy. The distribution is multinomial. For each combination you need to calculate the sum and finally add up the probabilities for those where the sum is what you want.

Mutinomial term \frac{n!}{i!j!k!l!m!}p_1^ip_2^jp_3^kp_4^lp_5^m over all non-negative possibilities where i+j+k+l+m=n. (n=20).
 
  • Like
Likes   Reactions: Kariege
You could write a little routine to work out the actual sampling distribution for the sum (probably not the most efficient use of your time)
or use the Central Limit Theorem to get an approximation.
* You have the probability distribution for the number to be drawn on each selection - find the mean and variance
* The CLT says the sum is approximately normally distributed with mean = 20 times the mean found above and variance = 20 times the mean found above
Use the appropriate normal distribution for your approximation. (You may want to use a continuity correction: I haven't looked at any of the numbers)
 
  • Like
Likes   Reactions: Kariege
mathman said:
It looks messy. The distribution is multinomial. For each combination you need to calculate the sum and finally add up the probabilities for those where the sum is what you want.

Mutinomial term \frac{n!}{i!j!k!l!m!}p_1^ip_2^jp_3^kp_4^lp_5^m over all non-negative possibilities where i+j+k+l+m=n. (n=20).

Thanks. In this context, how would you approach this using the formula?
Also why does i+j+k+l+m = 20? I'm confused.​
 
Last edited:
statdad said:
You could write a little routine to work out the actual sampling distribution for the sum (probably not the most efficient use of your time)
or use the Central Limit Theorem to get an approximation.
* You have the probability distribution for the number to be drawn on each selection - find the mean and variance
* The CLT says the sum is approximately normally distributed with mean = 20 times the mean found above and variance = 20 times the mean (<-- that should be variance: sorry) found above
Use the appropriate normal distribution for your approximation. (You may want to use a continuity correction: I haven't looked at any of the numbers)
 
statdad said:
You could write a little routine to work out the actual sampling distribution for the sum (probably not the most efficient use of your time)
or use the Central Limit Theorem to get an approximation.
* You have the probability distribution for the number to be drawn on each selection - find the mean and variance
* The CLT says the sum is approximately normally distributed with mean = 20 times the mean found above and variance = 20 times the mean found above
Use the appropriate normal distribution for your approximation. (You may want to use a continuity correction: I haven't looked at any of the numbers)

Thanks for the reply.
CLT is quite a new concept to me but I think this can help me to solve the problem.
Sry if this is quite a stupid question but how would you find the mean and the variance in this case? Is it similar to finding the mean and variance of a frequency table because this is a probability table?
 
"Is it similar to finding the mean and variance of a frequency table because this is a probability table?"

Yes - make a table, row 1 the different numbers that could be picked, row 2 the probabilities you've assigned, and work as though they were frequencies. Here is an example (I simply picked the first page that popped up in a search)

http://www.mathsisfun.com/data/random-variables-mean-variance.html
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 36 ·
2
Replies
36
Views
5K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 6 ·
Replies
6
Views
1K
Replies
5
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 29 ·
Replies
29
Views
5K
  • · Replies 1 ·
Replies
1
Views
2K