Finding a sample size that would give a certain probability of sucess

Click For Summary

Homework Help Overview

The problem involves determining the sample size needed to achieve a specific probability of selecting at least one individual with a disease from a population. The context is rooted in probability theory, specifically focusing on sampling without replacement and the implications of hypergeometric distribution.

Discussion Character

  • Exploratory, Conceptual clarification, Mathematical reasoning, Problem interpretation

Approaches and Questions Raised

  • Participants discuss the probability of selecting individuals without the disease and how to express this in terms of the sample size. There is exploration of using binomial coefficients and hypergeometric distribution to set up the problem. Questions arise about the correct application of these distributions and how to derive the sample size needed for a probability threshold.

Discussion Status

The discussion has seen various attempts to clarify the setup of the problem, with some participants providing insights into the use of binomial and hypergeometric distributions. There is acknowledgment of the complexity in deriving a function solely in terms of the sample size, and some participants have reported progress in their understanding and approaches.

Contextual Notes

Participants note the challenge of working without replacement and the implications this has on the choice of statistical methods. There is also mention of constraints related to the lack of a closed-form solution and the need for computational methods to find the sample size.

ascapoccia
Messages
21
Reaction score
0

Homework Statement



a population of n people contain k individuals with a disease. a sample of m people without replacement are chosen. how big should m be so that the probability of having at least one person with the disease is approximately .9?

find m when n= 1000 and k= 10

hint. find the proabability of there being no one in the sample with the disease.


Homework Equations



professor states there is no closed form solution for this answer, but that we are supposed to find it via R. i, and my other ecn grad friends, can't figure out how to get an answer for either that, or excel or matlab, as we kept being told the answer doesn't exist.

The Attempt at a Solution



first, i tackled the hint and thought the probability of no one having the disease could be m choose 0 divided by n chose k, which gave me k!/n!. supposing this is right, or at least on the right path, i don't understand how knowing what the probability that no one get the disease gives me for finding the size of m needed.

i then thought about the idea that to be absolutely sure that you get a sample size large enough to get at least one person with the disease, it would need to be n-k+1. however, i don't know how to extend this idea to a probability of .9 (except multiplying it by .9, which even i know is totally wrong).

i don't really need help with the artihematic, but setting up the problem. thanks for any help you can give me.
 
Physics news on Phys.org
The number of ways of choosing m people without the disease is C(990,m), right? (Where the 'C's are binomial coefficients). The total number of ways of choosing m people is C(1000,m). What does that make the probability of choosing m people without the disease? You want that less than 0.1, correct? How large does m have to be?
 
Last edited:
that definitely helps, thank you.
 
hmm, okay, so i looked at the problem again and saw that i can't use the binomial dist, since i an NOT replacing my elements. so i tried to use the hyper geometric, such that:

{(k choose 1)(n-k choose m-1)} / (n choose m), and simplifying, got :

km(n-k)! / n!(n-k-m-1)! = .90

however, even by plugging in number for n and k, i can't get the function in terms soley of m.
 
Yes, you CAN use the binomial distribution to find the probability of choosing m people who aren't sick. Of course you aren't replacing anything. But you are only choosing once.
 
alright, so i finally did the problem by using the hyper geometric and finding how bug the sample size m should be by the very good old guess anad check standby. thanks y'all the help.
 

Similar threads

  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 5 ·
Replies
5
Views
2K
Replies
1
Views
1K
  • · Replies 9 ·
Replies
9
Views
6K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
12
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K