De Moivre-Laplace theorem help

  • Thread starter Thread starter 2sin54
  • Start date Start date
  • Tags Tags
    Theorem
Click For Summary
SUMMARY

The discussion centers on calculating the probability that more voters for party B than party A will be selected from a random sample of 20,000 voters, given that 5,500,000 out of 10,000,000 total voters support party A. Using the de Moivre-Laplace theorem, the probability is expressed as P = Θ(λ2) - Θ(λ1), with λ1 calculated to be approximately 14.23. The resulting probability is extremely small, approximately 0.2999 × 10-45, indicating that such an outcome is highly unlikely. The discussion highlights the use of normal approximation techniques for large sample sizes in probability theory.

PREREQUISITES
  • Understanding of the de Moivre-Laplace theorem
  • Familiarity with probability distributions, specifically normal distribution
  • Basic knowledge of statistical notation and calculations
  • Experience with asymptotic analysis in probability theory
NEXT STEPS
  • Study the Central Limit Theorem and its applications in probability
  • Learn about normal approximation techniques for binomial distributions
  • Explore advanced statistical methods for calculating probabilities in large samples
  • Investigate the implications of extreme values in probability distributions
USEFUL FOR

Students and professionals in statistics, data science, and mathematics who are interested in probability theory, particularly those dealing with large sample sizes and normal approximations.

2sin54
Messages
109
Reaction score
1

Homework Statement


There are 10 000 000 people who are going to vote. 5 500 000 will vote for the party A, the rest will vote for the party B.
20 000 voters are randomly chosen. What is the probability that there will be more party B voters than there will be party A voters (sorry for poor translation).

2. The attempt at a solution

P(party A voter) = 0.55
P(party B voter) = 0.45

There have to be at least 10 001 party B voters for the condition to be satisfied. This number can range from 10 001 to 20 000. Then, according to de Moivre-Laplace theorem the probability which I need to find:

P = Θ(λ2) - Θ(λ1) ,

where λ1 = (10 001 - 20 000*0.45)/√(20 000*0.45*(1-0.45))

Unfortunately, λ1 ≈ 14.23. λ2 is even bigger. Both, when put under the Θ function asymptotically approximate to 1. Do I have to use a different approach here?
 
Physics news on Phys.org
You should expect the probability to be extremely small. The probability distribution for the number of B voters is approximately normal with mean ##9000## and standard deviation ##\sigma = \sqrt{npq} \approx 70.4##. The probability of exceeding ##3\sigma## above the mean is already only about one in a thousand, and you are looking at the probability that it exceeds around ##14\sigma## above the mean. So I don't necessarily think you are doing anything wrong, but the answer is going to be tiny.
 
Last edited:
Thank you for the clarification. I guess it is just a bit counter-intuitive to me.
 
Gytax said:
Thank you for the clarification. I guess it is just a bit counter-intuitive to me.
To get some intuition, consider flipping a coin 20000 times. Surely if the coin is fair (##p = q = 0.5##), it should be extremely unlikely that you would see, say, 11000 heads or more. Probably the only surprise is exactly how tiny the probability is, but you would expect it to be quite small. Your problem is similar, but with slightly different ##p## and ##q##.
 
Gytax said:

Homework Statement


There are 10 000 000 people who are going to vote. 5 500 000 will vote for the party A, the rest will vote for the party B.
20 000 voters are randomly chosen. What is the probability that there will be more party B voters than there will be party A voters (sorry for poor translation).

2. The attempt at a solution

P(party A voter) = 0.55
P(party B voter) = 0.45

There have to be at least 10 001 party B voters for the condition to be satisfied. This number can range from 10 001 to 20 000. Then, according to de Moivre-Laplace theorem the probability which I need to find:

P = Θ(λ2) - Θ(λ1) ,

where λ1 = (10 001 - 20 000*0.45)/√(20 000*0.45*(1-0.45))

Unfortunately, λ1 ≈ 14.23. λ2 is even bigger. Both, when put under the Θ function asymptotically approximate to 1. Do I have to use a different approach here?

If you want an actual numerical approximation, use the fact that for the standard normal pdf
\phi(z) = \frac{1}{\sqrt{2 \pi}} e^{-z^2/2}
we have, for large ##z > 0##:
\Phi_{>}(z) \equiv \int_{z}^{\infty} \phi(t) \, dt \sim \frac{1}{z} \phi(z).
This follows from integration by parts: in ##\int \phi(t) \, dt,## take ##u = 1/t,
: dv = t \phi(t) \, dt = -d (\phi(t))##. Thus
\int_z^{\infty} \phi(t) \, dt = \phi(z)/z - \int_z^{\infty} \frac{1}{t^2} \phi(t) \, dt,
and for large ##z > 0## we can drop the second term (its magnitude is ##< \phi(z)/z^2##). If you want better accuracy, get another term using integration by parts again, etc.

Anyway, for ##z = 14.23## we have ##\Phi_{>}(z) \approx 0.2999 \times 10^{-45}##.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
3K
Replies
2
Views
1K
  • · Replies 4 ·
Replies
4
Views
8K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 15 ·
Replies
15
Views
4K
  • · Replies 5 ·
Replies
5
Views
5K
  • · Replies 5 ·
Replies
5
Views
5K
  • · Replies 4 ·
Replies
4
Views
5K
Replies
4
Views
5K