- #1
Mr.V.
- 9
- 1
Hi!
I have a data set of ~5000 unique elements.
From that set I have 2 subsets that are not mutually exclusive. For example if the elements are letters from A-Z, the first set could be A, B, C, D, E, F, and G the second set could be E, F, G, H, I, and J.
Here's the question...
The first subset has 100 elements randomly chosen from the 5000. The second subset has 100 elements randomly chosen from the 5000. Of interest is that 10 of the elements from subset1 are also in subset 2.
What is the probability of that happening?
Here's my logic so far, though I'm not sure I'm right.
If we use subset 1 as the reference:
If subset 1 had 1 element, the probability of getting any 1 element in subset 2 is:
[tex]\frac{100}{5000}[/tex] or [tex]\frac{1}{50}[/tex]. The probability of getting 2 elements in a set of 2 is: [tex]\frac{100}{5000} * \frac{99}{4999}[/tex] ... the probability of getting 10 if subset 1 were only 10 elements would be [tex]\frac{100!}{90!}*\frac{4990!}{5000!}[/tex] which is roughly [tex](\frac{1}{50})^{10} = 1.024*10^{-17}[/tex]
However since I have a set of 100, I need to include the chance of getting that set of 10 in many different ways...I think I should use combinations correct? So since I had a set of 100, and I want a subset of 10, there are [tex]\frac{100!}{10!*90!}[/tex] different ways of ordering that set...
So is the correct answer...
[tex](\frac{1}{50})^{10} * \frac{100!}{10!*90!} = 0.00018[/tex]
?
Thanks for your help!
I have a data set of ~5000 unique elements.
From that set I have 2 subsets that are not mutually exclusive. For example if the elements are letters from A-Z, the first set could be A, B, C, D, E, F, and G the second set could be E, F, G, H, I, and J.
Here's the question...
The first subset has 100 elements randomly chosen from the 5000. The second subset has 100 elements randomly chosen from the 5000. Of interest is that 10 of the elements from subset1 are also in subset 2.
What is the probability of that happening?
Here's my logic so far, though I'm not sure I'm right.
If we use subset 1 as the reference:
If subset 1 had 1 element, the probability of getting any 1 element in subset 2 is:
[tex]\frac{100}{5000}[/tex] or [tex]\frac{1}{50}[/tex]. The probability of getting 2 elements in a set of 2 is: [tex]\frac{100}{5000} * \frac{99}{4999}[/tex] ... the probability of getting 10 if subset 1 were only 10 elements would be [tex]\frac{100!}{90!}*\frac{4990!}{5000!}[/tex] which is roughly [tex](\frac{1}{50})^{10} = 1.024*10^{-17}[/tex]
However since I have a set of 100, I need to include the chance of getting that set of 10 in many different ways...I think I should use combinations correct? So since I had a set of 100, and I want a subset of 10, there are [tex]\frac{100!}{10!*90!}[/tex] different ways of ordering that set...
So is the correct answer...
[tex](\frac{1}{50})^{10} * \frac{100!}{10!*90!} = 0.00018[/tex]
?
Thanks for your help!
Last edited: