Central Limit Theorem and Standardized Sums

Click For Summary

Discussion Overview

The discussion revolves around the Central Limit Theorem (CLT) and its application to the probability of sums of independent random variables. Participants explore the relationship between the probability of a sum falling within a certain range and the standardized form of that sum as described by the CLT.

Discussion Character

  • Exploratory, Technical explanation, Debate/contested

Main Points Raised

  • One participant expresses confusion regarding the relationship between the probability P(b > S_n > a) and the standardized sum as described by the CLT.
  • Another participant argues that the CLT does not necessarily use a standardized sum and explains that it states the sum of independent random variables will be approximately normally distributed with specific mean and standard deviation as n increases.
  • A later reply clarifies that while the two probabilities are not equal, they are approximately equal under certain conditions, and provides a method for converting the sum to a standard normal distribution.
  • Further discussion includes a clarification about the terminology used in describing the CLT, specifically regarding "all possible samples" and the correct terms for standard deviation.

Areas of Agreement / Disagreement

Participants exhibit disagreement regarding the necessity of using a standardized sum in the context of the CLT. There is no consensus on the interpretation of the theorem or the terminology used.

Contextual Notes

Participants note potential typographical errors in the terminology related to standard deviation and distribution, and there is some uncertainty about the phrasing of "all possible samples" in relation to the CLT.

caffeine
I don't think I *really* understand the Central Limit Theorem.

Suppose we have a set of n independent random variables \{X_i\} with the same distribution function, same finite mean, and same finite variance. Suppose we form the sum S_n = \sum_{i=1}^n X_i. Suppose I want to know the probability that S_n is between a and b. In other words, I want to know P(b > S_n > a).

The central limit theorem uses a standardized sum:

<br /> P\left(b &gt; \frac{ \sum_{i=1}^n X_i - n\mu}{\sqrt{n}\sigma} &gt; a\right)<br /> = \frac{1}{\sqrt{2\pi}} \int_a^b e^{-y^2/2} \, dy<br />

What is the relationship between what I want:

P(b &gt; S_n &gt; a)

and what the central limit theorem tells me about:

P\left(b &gt; \frac{ \sum_{i=1}^n X_i - n\mu}{\sqrt{n}\sigma} &gt; a\right)How can they possibly be equal? If they are equal, how is that possible? And if they're not equal, how would I get what I want?
 
Last edited by a moderator:
Physics news on Phys.org
caffeine said:
I don't think I *really* understand the Central Limit Theorem.

Suppose we have a set of n independent random variables \{X_i\} with the same distribution function, same finite mean, and same finite variance. Suppose we form the sum S_n = \sum_{i=1}^n X_i. Suppose I want to know the probability that S_n is between a and b. In other words, I want to know P(b &gt; S_n &gt; a).

The central limit theorem uses a standardized sum:

<br /> P\left(b &gt; \frac{ \sum_{i=1}^n X_i - n\mu}{\sqrt{n}\sigma} &gt; a\right)<br /> = \frac{1}{\sqrt{2\pi}} \int_a^b e^{-y^2/2} \, dy<br />
No, the Central Limit Theorem does not use a standardized sum. (At least it doesn't have to. I can't speak for whatever textbook you are using.)

Essentially, the Central Limit Theorem says that if we consider all possible samples of size n from a distribution having finite mean, \mu, and finite standard deviation, \sigma, then their sum will be approximately normally distributed with mean n\mu and standard distribution \sqrt{n}\sigma- and the larger n is the better that approximation will be.

What is the relationship between what I want:

P(b &gt; S_n &gt; a)

and what the central limit theorem tells me about:

P\left(b &gt; \frac{ \sum_{i=1}^n X_i - n\mu}{\sqrt{n}\sigma} &gt; a\right)


How can they possibly be equal? If they are equal, how is that possible? And if they're not equal, how would I get what I want?
Well, they are not equal- they are approximately equal.

Assuming that your base distribution has mean \mu and standard deviation \sigma, then Sn is approximately normally distributed with mean n\sigma and standard deviation \sqrt{n}\sigma. You then convert from that to the standard normal distribution (with mean 0 and standard deviation 1) as you probably have learned before:
z= \frac{x- \mu}{\sigma}
In particular, take x= Sn, a< S_n< b becomes a- n\mu&lt; S_n-n\mu&lt; b-n\mu and then
\frac{a-n\mu}{\sqrt{n}\sigma}&lt; \frac{S_n- n\mu}{\sqrt{n}\sigma}&lt; \frac{b- n\mu}{\sqrt{n}\sigma}

Look up those values in a standardized normal distribution table.
 
HallsofIvy said:
Essentially, the Central Limit Theorem says that if we consider all possible samples of size n from a distribution having finite mean, \mu, and finite standard deviation, \sigma, then their sum will be approximately normally distributed with mean n\mu and standard distribution \sqrt{n}\sigma- and the larger n is the better that approximation will be.

This is famous as Lindeberg-Levy CLT. I am not sure but I have a doubt about the terms "all possible samples"... essentially, for a sequence of independently and identically distributed random variables with same mean \mu and same finite sd \sigma, the quantity (sum of n variables - n\mu)/\sqrt{n}\sigma should asymptotically follow normal distribution with mean 0 and standard deviation 1(belive "distribution" is a typing mistake).
 
Last edited:
Yes, I meant "all possible samples" for a specific distribution- therefore "independently and identically distributed"

You are right that I mean "standard deviation", not "standard distribution"!
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 22 ·
Replies
22
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 31 ·
2
Replies
31
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 42 ·
2
Replies
42
Views
6K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K