Qualitative Stats question (central limit theorem)

Click For Summary
SUMMARY

The discussion centers on the Central Limit Theorem (CLT) and its implications for sampling distributions. Specifically, it concludes that statements II and III are valid consequences of the CLT, while I, IV, and V are not applicable due to sample size and proportion considerations. The discussion highlights that for a sample to yield an approximately normal distribution, the sample size and the product of sample size and proportion (Np) must meet certain thresholds, specifically Np > 10. This ensures that the approximation to the normal distribution is reliable.

PREREQUISITES
  • Understanding of Central Limit Theorem (CLT)
  • Knowledge of Simple Random Sampling (SRS)
  • Familiarity with Binomial Distribution
  • Concept of sample size and its impact on distribution
NEXT STEPS
  • Study the implications of Central Limit Theorem in statistical analysis
  • Learn about the conditions for normal approximation in Binomial distributions
  • Explore statistical sampling techniques and their applications
  • Investigate the significance of sample size in inferential statistics
USEFUL FOR

Statisticians, data analysts, students of statistics, and anyone involved in research requiring an understanding of sampling distributions and the Central Limit Theorem.

habman_6
Messages
16
Reaction score
0
3) Which of the following are consequences of the Central Limit Theorem?
I) A SRS of resale house prices for 100 randomly selected transactions from all sale transactions in 2001 (in Toronto) will be obtained. Since the sample is large, we should expect the histogram for the sample to be nearly normal.
II) We will draw a SRS (simple random sample) of 100 students from all University of Toronto students, and measure each person’s cholesterol level. The average cholesterol level for the sample should be approximately normally distributed.
III) We want to estimate the proportion of Ontario voters who intend to vote for the Liberal party in the next election, and decide to draw a SRS of 400 voters. The percentage of the people in the sample who will say that they intend to vote Liberal is approximately normally distributed.
IV) We will draw a SRS of 100 adults from the Canadian military, and count the number who have the AIDS virus. The number of individuals in the sample who will be found to have the AIDS virus should be approximately normally distributed.
V) We are interested in the average income for all Canadian families for 2001. The mean income for all Canadian families should be approximately normal, due to the large number of families in the population.


The answer is II and III. I understand why I is wrong, and I understand why V is wrong. However, to me, IV seems exactly the same as III. Apparently its because the proportion is too low, but that does not make sense to me in terms of CLT. What should it matter what the proportion is, as long as the sample means have that same proportion?
 
Physics news on Phys.org
If the sample size or the proportion is too small the approximation is too poor to be useful. As an example, consider the Binomial distribution(N,p). The CLT applies to this case. The rule of thumb is that the normal approximation to the Binomial is poor unless Np > 10.

http://www.stat.yale.edu/Courses/1997-98/101/binom.htm

To convince yourself, look at the pdf of some binomial distributions that don't satisfy Np > 10, e.g. N = 100, p = 0.01.
 
Thanks Aleph, I understand now.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 31 ·
2
Replies
31
Views
4K
Replies
1
Views
2K
  • · Replies 22 ·
Replies
22
Views
4K
  • · Replies 7 ·
Replies
7
Views
8K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K