Does the Central Limit Theorem Mean Sample Means Form a Normal Distribution?

Click For Summary

Discussion Overview

The discussion revolves around the Central Limit Theorem (CLT) and its implications regarding the distribution of sample means. Participants explore the meaning of the theorem, its application to sampling, and the behavior of sample means in relation to population means. The conversation includes conceptual clarifications and technical reasoning about probability distributions and sample sizes.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • Some participants express confusion about the phrase "the mean is normally approximated," questioning its implications regarding the relationship between sample means and population means.
  • One participant emphasizes the importance of understanding the precise mathematical statement of the CLT rather than relying on popular summaries.
  • Another participant explains that the mean of an independent random sample has an approximately normal distribution, with the standard deviation decreasing as sample size increases.
  • There is a discussion about how plotting the means of samples of increasing size leads to a smoother and more normal-looking distribution compared to plotting individual sample values.
  • One participant raises a question about the terminology, seeking clarification on how a single sample can have a distribution when it has only one mean.
  • Another participant discusses the effect of averaging multiple independent samples, noting that extreme values are less likely to skew the mean when larger sample sizes are used.
  • There is mention of the potential misunderstanding that grouping samples into larger batches can lead to greater certainty about the mean, which is challenged by the nature of the CLT.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the interpretation of the Central Limit Theorem and its implications. Multiple competing views and uncertainties remain regarding the terminology and the application of the theorem to sampling distributions.

Contextual Notes

Participants highlight limitations in understanding the CLT, including confusion over terminology and the nature of distributions derived from sample means versus individual sample values. There is also an acknowledgment of the dependence on sample size and the implications for the behavior of sample means.

ampakine
Messages
60
Reaction score
0
I keep reading explanations that say things like "the mean is normally approximated" but I don't know what that means. Are they saying that if you take a load of samples then plot the means of every one of those samples on a graph that the mean of that graph will be approximately the population mean of the population that you took the samples from? For example let's say I want to know the probability that I can catapult a gypsy 15 metres or more so once a year I go around the world randomly picking out gypsies and seeing how far I can catapult them. The population is how far I can catapult every gypsy in the world but on my yearly I only catapult about 20 gypsies. Does the central limit theorem say that if I take the average catapult distances obtained from each of these yearly gypsy catapulting expeditions and plot them on a graph that this graph will keep becoming more and more normal every year as I add a new mean to it? Is that the idea or have I got it wrong?
 
Physics news on Phys.org
ampakine said:
I keep reading explanations that say things like "the mean is normally approximated" but I don't know what that means.

You should read and ask questions about the precise mathematical statement of the theorem, not about popularized summaries of it. Then people will offer you popularized summaries of it as explanations and you can demand to know what they are talking about.

Are they saying that if you take a load of samples then plot the means of every one of those samples on a graph that the mean of that graph will be approximately the population mean of the population that you took the samples from?

Theorems about probability rarely say anything definite about actual outcomes, approximate or otherwise. Instead they talk about the probabilities of outcomes and approximations to probability distributions. The mean of an independent random sample of N things from a population has an approximately normal distribution. The standard deviation of this approximately normal distribution gets smaller as N increases. So it would be correct conclusion that the mean of a large sample is "probably" close to the mean of the population. The Central Limit Theorem implies this, but it wouldn't be correct to say that they Central Limit Theorem "is" that statement.

For example let's say I want to know the probability that I can catapult a gypsy 15 metres or more so once a year I go around the world randomly picking out gypsies and seeing how far I can catapult them. The population is how far I can catapult every gypsy in the world but on my yearly I only catapult about 20 gypsies. Does the central limit theorem say that if I take the average catapult distances obtained from each of these yearly gypsy catapulting expeditions and plot them on a graph that this graph will keep becoming more and more normal every year as I add a new mean to it? Is that the idea or have I got it wrong?

Suppose you have a random variable whose distribution is not normal. Imagine one whose density is shaped like an isoceles triangle. If you take samples of size 1 and plot them as a histogram, the it is probable that your plot will begin to look like an isoceles triangle as you take more and more samples.

Suppose you take samples of size 10 and histogram the mean of those samples (not the value of each of the 10 individually, only the mean of all 10). You do this for many samples of size 10. Then it is probable that your plot won't look like an isoceles triangle. It will be smoother.

If you take samples of size 1000 and histogram their means, the graph will probably be even smoother and more "normal" looking.
 
Stephen Tashi said:
The mean of an independent random sample of N things from a population has an approximately normal distribution.

I think its the terminology that's confusing me. You say that the mean of a sample has an approximately normal distribution. When I think of a sample I think of 1 sample which can have only 1 mean so the idea of a distribution doesn't make sense. Do you mean that if I was to take multiple samples from a population then get the mean of each of these samples then plot them on a graph that I'd have a distribution for the mean?
 
If you only look at the mean of, say, 100 independently chosen values of a random variable, you increase the probabilities that extreme values in the sample will "cancel out". If you only plot the means of such "batched" samples, they have a smaller probability of taking on extreme values than if you plot the values of single samples. (For example,of X is a 0-or-1 random variable, the mean of a sample of 100 X's might be 100/100 = 1, but that is less likely than observing a single example of X = 1. Furthermore a histogram of single observations, can't look very normal since it has only two bars on it for X = 0 or X =1 , while the histogram of a sample mean has bars for values like 85/100, 10/100 etc.)


If you had 10,000 independent random samples, you could group them into mutually exclusive batches of 10 or 100 or 1000 etc. The Central Limit Theorem doesn't say that you can "cheat the Devil" by doing this and gain greater and greater certainty about the mean value. (The number of samples N is decreasing as you group observations into larger and larger batches.) If you take the mean of the 10,000 observations and plot it as a single point then, yes, the Central Limit Theorem says you have plotted 1 observation from an approximately normal distribution and it tells you about the standard deviation of that distribution.
 
Last edited:

Similar threads

  • · Replies 31 ·
2
Replies
31
Views
4K
  • · Replies 22 ·
Replies
22
Views
4K
  • · Replies 7 ·
Replies
7
Views
8K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 4 ·
Replies
4
Views
6K
  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 15 ·
Replies
15
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K