Question about determining probability

In summary, the conversation discusses the probability of getting a certain number of elements that overlap between two subsets of a larger data set. The probability is calculated using combinations and is approximately 0.00018.
  • #1
Mr.V.
9
1
Hi!
I have a data set of ~5000 unique elements.
From that set I have 2 subsets that are not mutually exclusive. For example if the elements are letters from A-Z, the first set could be A, B, C, D, E, F, and G the second set could be E, F, G, H, I, and J.
Here's the question...
The first subset has 100 elements randomly chosen from the 5000. The second subset has 100 elements randomly chosen from the 5000. Of interest is that 10 of the elements from subset1 are also in subset 2.
What is the probability of that happening?
Here's my logic so far, though I'm not sure I'm right.
If we use subset 1 as the reference:
If subset 1 had 1 element, the probability of getting any 1 element in subset 2 is:
[tex]\frac{100}{5000}[/tex] or [tex]\frac{1}{50}[/tex]. The probability of getting 2 elements in a set of 2 is: [tex]\frac{100}{5000} * \frac{99}{4999}[/tex] ... the probability of getting 10 if subset 1 were only 10 elements would be [tex]\frac{100!}{90!}*\frac{4990!}{5000!}[/tex] which is roughly [tex](\frac{1}{50})^{10} = 1.024*10^{-17}[/tex]
However since I have a set of 100, I need to include the chance of getting that set of 10 in many different ways...I think I should use combinations correct? So since I had a set of 100, and I want a subset of 10, there are [tex]\frac{100!}{10!*90!}[/tex] different ways of ordering that set...
So is the correct answer...
[tex](\frac{1}{50})^{10} * \frac{100!}{10!*90!} = 0.00018[/tex]
?
Thanks for your help!
 
Last edited:
Physics news on Phys.org
  • #2
Hmm.

Lets say we have 1000 objects, and 20 are in a particular subset - call it set A, and we want to know what the odds are of picking a subset B of 20 elements at random so that exactly 2 of these elements is in A.

Clearly, there are 1000 choose 20 ways to choose B.
Now, if B is to have two elements from A, then there are 20 choose 2 ways to pick those two elements, and, since B will also have 18 elements from the remaining 1980 objects, there are 1980 chose 18 ways to pick those.

Thus, the probablity of getting exactly 2 elements would be:
[tex]\frac{20 \rm{C}2 \times 1980 \rm{C}18}{2000 \rm{C} 20}[/tex]
 
  • #3


I would approach this question by first clarifying the goal of determining the probability. Is the goal to find the probability of getting exactly 10 elements that are also in subset 2, or is it to find the probability of getting at least 10 elements that are also in subset 2? This distinction is important because the probability will be different for each scenario.

Assuming the goal is to find the probability of getting exactly 10 elements that are also in subset 2, your logic and calculations seem to be correct. You have correctly used the concept of combinations to account for the different ways in which the 10 elements can be chosen from the set of 100 elements in subset 1.

However, if the goal is to find the probability of getting at least 10 elements that are also in subset 2, then the calculation would be slightly different. In this case, you would need to calculate the probability of getting 10, 11, 12,...,100 elements that are also in subset 2, and then add all these probabilities together to get the overall probability. This would require more calculations, but the concept of combinations can still be applied to each scenario.

In conclusion, your approach and calculations seem to be correct, but it is important to clarify the goal of determining the probability before proceeding with the calculations.
 

1. What is probability?

Probability is a measure of the likelihood of an event occurring. It is expressed as a number between 0 and 1, where 0 represents an impossible event and 1 represents a certain event.

2. How is probability determined?

Probability is determined by dividing the number of favorable outcomes by the total number of possible outcomes. For example, if you roll a six-sided die, the probability of rolling a 3 would be 1/6 because there is only one favorable outcome out of six possible outcomes.

3. What is the difference between theoretical and experimental probability?

Theoretical probability is based on mathematical calculations and assumes that all outcomes are equally likely. Experimental probability is based on actual observations or experiments and may differ from theoretical probability due to chance or other factors.

4. How does sample size affect probability?

The larger the sample size, the more accurate the probability calculation will be. This is because a larger sample size reduces the impact of chance and provides a more representative sample of the population.

5. Can probability be greater than 1?

No, probability cannot be greater than 1. A probability of 1 represents a certain event, while a probability greater than 1 would imply that an event is more than certain, which is not possible.

Similar threads

  • Precalculus Mathematics Homework Help
Replies
6
Views
815
  • Precalculus Mathematics Homework Help
Replies
11
Views
1K
  • Precalculus Mathematics Homework Help
2
Replies
36
Views
2K
  • Precalculus Mathematics Homework Help
Replies
3
Views
864
  • Precalculus Mathematics Homework Help
Replies
14
Views
2K
  • Precalculus Mathematics Homework Help
Replies
2
Views
1K
  • Precalculus Mathematics Homework Help
Replies
3
Views
911
  • Precalculus Mathematics Homework Help
Replies
10
Views
820
  • Precalculus Mathematics Homework Help
Replies
12
Views
2K
  • Precalculus Mathematics Homework Help
Replies
16
Views
2K
Back
Top