Q: How to compute P(X=200 and Y<150)?

Click For Summary

Homework Help Overview

The problem involves computing probabilities related to the outcomes of rolling a fair die 1000 times, specifically focusing on the occurrences of the numbers 5 and 6. The original poster seeks to find the probability that the number 6 appears exactly 200 times while the number 5 appears less than 150 times.

Discussion Character

  • Mixed

Approaches and Questions Raised

  • Participants discuss the relationship between the occurrences of the numbers 5 and 6, questioning how the condition of having 200 sixes affects the distribution of the remaining rolls. There is exploration of using binomial and trinomial distributions to model the problem.

Discussion Status

Participants are actively engaging with the problem, offering insights into the use of discrete probability mass functions and discussing the implications of conditioning on the number of sixes. Some participants are exploring the continuity correction for approximating probabilities.

Contextual Notes

There is mention of the problem's context within a chapter that follows discussions on various probability distributions, indicating a focus on discrete outcomes and their approximations.

CAF123
Gold Member
Messages
2,918
Reaction score
87

Homework Statement


1000 independent rolls of a fair die will be made. Compute an approximation to the probability that the number 6 will appear between 150 and 200 times inclusively. If the number 6 appears exactly 200 times, find the probability that the number 5 will appear less than 150 times.

The Attempt at a Solution


I have the first part correct. For the second part, I said: If X is the number of times a 6 appears and Y is the number of times a 5 appears, then we want P(Y<150|X=200). I simplified this to P(X=200 and Y<150)/(P(X=200), from which I could compute P(X=200). Since X and Y are not independent, I can't split the intersection up. I thought about rearranging the numerator into something involving a conditional probability
(i.e like P(X=200 and Y<150) = P(X=200|Y<150)P(Y<150)) but I don't see how this helps.) I also noticed that this intersection looked like a joint RV, but I don't have the density function to do the necessary integration.
Any advice on how to compute P(X=200 and Y<150)?
 
Physics news on Phys.org
If you know the number 6 appears 200 times, what would you say about the way in which the outcomes of the remaining 800 rolls are distributed?
 
CAF123 said:

Homework Statement


1000 independent rolls of a fair die will be made. Compute an approximation to the probability that the number 6 will appear between 150 and 200 times inclusively. If the number 6 appears exactly 200 times, find the probability that the number 5 will appear less than 150 times.

The Attempt at a Solution


I have the first part correct. For the second part, I said: If X is the number of times a 6 appears and Y is the number of times a 5 appears, then we want P(Y<150|X=200). I simplified this to P(X=200 and Y<150)/(P(X=200), from which I could compute P(X=200). Since X and Y are not independent, I can't split the intersection up. I thought about rearranging the numerator into something involving a conditional probability
(i.e like P(X=200 and Y<150) = P(X=200|Y<150)P(Y<150)) but I don't see how this helps.) I also noticed that this intersection looked like a joint RV, but I don't have the density function to do the necessary integration.
Any advice on how to compute P(X=200 and Y<150)?

There are no densities and integrations involved here---only discrete probability mass functions, and summations (although later you can make a continuous approximation and THEN get a density).

Besides the suggestion that Yuu Suzumi has made, you can also look at this as involving a trinomial distribution with three outcomes per trial: 5, 6 or 'other'. In N independent trials, what is the probability that you get k5 5s, k6 6s and N-k5-k6 'others'?

RGV
 
@Yuu I know that we have n independent Bernoulli trials => we have a binomial(800,1/6). Is this what you mean? How does this help? I recognised that in part a) of the question we had binomial(1000,1/6) which since n is large, approximates to a normal RV.
@RGV I haven't come across the trinomial distribution. This problem comes from a chapter following content on the uniform, normal, exponential distributions.
 
Be careful! When the six is removed, how many numbers share the 800 trials?
 
Ok, so I should have binomial(800,1/5). I see how the condition on there being 200 sixes comes into play now. I would be able to compute the probability if there was exactly 150 fives. How to do it for less than 150 fives? I know P( x < 5), say is P(0) + P(1) + P(2)..+ P(4), but this is not efficient for larger numbers.
 
CAF123 said:
Ok, so I should have binomial(800,1/5). I see how the condition on there being 200 sixes comes into play now. I would be able to compute the probability if there was exactly 150 fives. How to do it for less than 150 fives? I know P( x < 5), say is P(0) + P(1) + P(2)..+ P(4), but this is not efficient for larger numbers.

Originally, you said you had the first part correct (that is, the probab. of having between 150 and 200 sixes). How did you do that computation? Why can't you do something similar for the second part?

RGV
 
Ray Vickson said:
Originally, you said you had the first part correct (that is, the probab. of having between 150 and 200 sixes). How did you do that computation? Why can't you do something similar for the second part?

RGV
Many thanks. I didn't think they wanted an approximation for the second part, but this gives me the correct answer. One question: to get the correct answer, I had to do P(x<149.5) rather than P(x<150.5). I understand we add some small number on to allow a continuity correction, but why do we take the 1/2 away rather than add it on (in this case)? In general, can we add any small increment on?
 
CAF123 said:
Many thanks. I didn't think they wanted an approximation for the second part, but this gives me the correct answer. One question: to get the correct answer, I had to do P(x<149.5) rather than P(x<150.5). I understand we add some small number on to allow a continuity correction, but why do we take the 1/2 away rather than add it on (in this case)? In general, can we add any small increment on?

Because for a discrete random variable taking the values {0,1,2,...,800}, the event {X < 150} is the same as {X <= 149}, which you change to {X <= 149.5} in the continuous approximation. If you put had {X <= 150.5} you would have been be looking at {X <= 150}, not {X <= 149}.

RGV
 
  • #10
Ray Vickson said:
Because for a discrete random variable taking the values {0,1,2,...,800}, the event {X < 150} is the same as {X <= 149}, which you change to {X <= 149.5} in the continuous approximation. If you put had {X <= 150.5} you would have been be looking at {X <= 150}, not {X <= 149}.

RGV
ok, thanks. Does it have to be a half that we add on(or is this just a convenience)?
 
  • #11
CAF123 said:
ok, thanks. Does it have to be a half that we add on(or is this just a convenience)?

No, there is a good reason. Say we have p(k) for k = 1,2,3,... and we are approximating this by a continuous probability density f(x). We have
[tex]p(1) \approx \int_{1/2}^{3/2} f(x) \, dx,\\<br /> p(2) \approx \int_{3/2}^{5/2} f (x) \, dx, \\<br /> \vdots \\<br /> p(k) \approx \int_{k - 1/2}^{k + 1/2} f(x) \, dx,[/tex]
so
[tex]p(a) + \cdots + p(b) \approx \int_{a-1/2}^{b+1/2} f(x) \, dx .[/tex]

In words, p(k) equals the area of a rectangle of height p(k) and with 1, which is almost the same as the area under the f(x) curve for x from k-1/2 to k+1/2. A sum of the probabilities equals the sum of the areas.

RGV
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
Replies
7
Views
2K
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 6 ·
Replies
6
Views
2K