How Does the Central Limit Theorem Estimate Error in Area Calculations?

  • Context: Graduate 
  • Thread starter Thread starter nolita_day
  • Start date Start date
  • Tags Tags
    Area Estimation
Click For Summary
SUMMARY

The discussion focuses on using the Central Limit Theorem (CLT) to estimate the error in area calculations for a random point distributed uniformly over a unit square. The user defines a random variable Y based on whether a point lies inside object A, approximating A with the formula A≈(1/n)∑Y_i. To gauge the probable size of the error, the user seeks to determine the necessary sample size n such that P(|\hat{A} - A| < .01) ≈ .99, where A is given as 0.2. The discussion highlights the relationship between the variance of the estimator \hat{A} and the true area A, emphasizing the need for clarity in distinguishing between \hat{A} and A.

PREREQUISITES
  • Understanding of the Central Limit Theorem (CLT)
  • Familiarity with Bernoulli distributions
  • Knowledge of variance and standard deviation calculations
  • Basic concepts of probability theory
NEXT STEPS
  • Study the application of the Central Limit Theorem in estimating population parameters
  • Learn about Bernoulli distribution and its properties
  • Explore variance and standard deviation in the context of sample means
  • Investigate confidence intervals and their relation to error estimation
USEFUL FOR

Statisticians, data scientists, and anyone involved in probabilistic modeling or error estimation in statistical analysis will benefit from this discussion.

nolita_day
Messages
3
Reaction score
0
Your help/input is much appreciated! Thanks in advance! :smile:

Object A is in the unit square 0<x<1, 0<y<1. Consider a random point distributed uniformly over the square; let Y=1 if the point lies inside A and Y=0 otherwise. How could A be estimated from a sequence of n individual points unif. distr. on the square?

I got this part just taking Y as ~Bernoulli(p=A). Let Y_{i} = f(X_{i}), then A≈\frac{1}{n}\sum{Y_{i}}

How could the CLT be used to gauge the probable size of the error of the estimate? Denoting the estimate by \hat{A}, if A=0.2, how large should n be so that P(|\hat{A} - A| &lt; .01) ≈ .99?

I can't figure out this one. Am I supposed to take \hat{A} as the random variable or |\hat{A}-A| as the RV? What is the standard deviation of \hat{A} vs. |\hat{A}-A|? My attempt below is only for the first part of the question in the previous paragraph.

Taking \hat{A} as the RV, I started out like this...

E[\dfrac{Y_{1}+...+Y_{n}}{n}]=\hat{A}
Then Var(\hat{A})=Var( E[\dfrac{Y_{1}+...+Y_{n}}{n}] )=\frac{1}{n^{2}}Var( nE[Y_{i}] ) = Var( E[Y_{i}] ) = Var(A) (since Yi ~ Bernoulli(A) = A)

To estimate A based on \hat{A}, pick a small error and a high probability of getting that small error and you could use CLT as follows:

P(|\hat{A} - A| &lt; .01) = .99
P(-.01 &lt; \hat{A} - A &lt; .01) = .99
P(\dfrac{-.01}{σ_{\hat{A}}} &lt; \dfrac{\hat{A} - A}{σ_{\hat{A}}} &lt; \dfrac{.01}{σ_{\hat{A}}}) = .99
P(\dfrac{-.01}{\sqrt{A}} &lt; Z &lt; \dfrac{.01}{\sqrt{A}}) = .99
\Phi(\dfrac{.01}{\sqrt{A}}) - \Phi(\dfrac{-.01}{\sqrt{A}}) = .99
2\Phi(\dfrac{.01}{\sqrt{A}}) - 1) = .99
\dfrac{.01}{\sqrt{A}}= 2.58
A = .000015023?

I'm definitely sure this is wrong... how the heck did I come up with an actual number for A?
 
Physics news on Phys.org
This is a bit hard to follow because you confuse A with A-hat.
\hat{A} = \sum Y_i/n, A = E(\hat{A}).
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
8K
Replies
1
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
6
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
Replies
1
Views
1K
  • · Replies 3 ·
Replies
3
Views
3K
Replies
6
Views
1K