Ryker said:
Homework Statement
What is the probability that in a class of N people you will be able to find a trio that shares the same birthday? You only need to find one such trio, and the rest can have whatever birthday they want, as long as it's not the same as that of the chosen trio. So it could be that we have a trio and everyone else has the same birthday or that we have multiple trios. The only thing that matter is that there is one such trio.
The Attempt at a Solution
{365\choose 1}{N\choose 3}\frac{1}{365^{3}}\frac{364^{N-3}}{365^{N-3}}
I don't see where I could've gone wrong, but
Mathworld says that the probability of
at least 3 people sharing a birthday tops 50% at 88. But with my formula I get a lower number than that despite the fact that what I'm asking is a subset of that. So my probability for the same number N should be lower, thereby requiring a bigger class for the probability to exceed 50%.
What am I doing wrong?
Your quantity above is merely the first term in a (finite) series; when the other terms are included, the number needed to have a probability of ≥ 50% rises to 89. In fact, for n = 88 the probability of one or more trio is 0.48935 while for n = 89 it is 0.500044. Here is how I get these.
For i = 1,2,...,365, let D_i be the event that exactly 3 people have birthday i. Then, the inclusion-exclusion principle implies that the probability P
≥1 = P{at least one event D
i occurs} is
P_{\geq 1} = S_1 - S_2 + S_3 - S_4 + \cdots .
Here
S_1 = \sum_{i=1}^{365} P(D_i) = 356 P(D_1) \\<br />
S_2 = \sum_{1 \leq i < j \leq 365} P(D_i D_j) = {365 \choose 2} P(D_1 D_2)\\<br />
S_3 = \sum_{1 \leq i < j < k \leq 365} P(D_i D_j D_k) = {365 \choose 3} P(D_1 D_2 D_3)\\<br />
\vdots<br />
where we assume each of the 365 days is equally likely and where we use the notation ##AB## instead of ##A \cap B##, ##A B C## instead of ##A \cap B \cap C##, etc. You computed ##S_1## in your post #1.
The probability P(D_1) is given by the binomial distribution with parameters N, p = 1/365:
P(D_1) = {N \choose 3} p^3 (1-p)^{N-3}.
The probability P(D_1 D_2) is given by the trinomial distribution with parameters ##N, p_1 = p_2 = p = 1/365, p_3 = 1 - 2p##:
P(D_1 D_2) = {N \choose 3, 3, N-6} p^6 (1-2p)^{N-6} = <br />
\frac{N!}{3!^2 \, (N-6)!} (1/365)^6 (363/365)^{N-6},.
Similarly,
P(D_1 D_2 \ldots D_k)<br />
= {N \choose 3,3,3, \ldots, 3, N - 3k} p^{3k} (1 - kp)^{N - 3k},
where the multinomial coefficient is
{N \choose j_1, j_2, \ldots, j_r, N-j_1 - j_2 - \cdots - j_r}<br />
= \frac{N!}{j_1! j_2! \cdots j_r! (N - j_1 - j_2 - \cdots j_r)!}
Your computation was just S_1, but the other terms are important as well. While in principle the expression for P
≥1 should include up to ##\min(365, \lfloor N/3 \rfloor)## terms, in practice it suffices to take about 8 terms because the successive S
k are decreasing and the series is alternating, so the error by stopping at S
j is less than S
j+1.