Handling Random Uncertainties: Best Practices for Niles

Niles · Sep 17, 2010

Hi

In http://sl-proj-bi-specification.web.cern.ch/sl-proj-bi-specification/Activities/Glossary/techglos.pdf it says that: ... if the sources of uncertainties are numerous, the Gaussian distribution is generally a good approximation.

I don't quite understand why. The Central Limit Theorem (CLT) only says that if we have a sum S of N random variables, then S will be Gaussian for very large N. So the CLT does not explain the above. In that case, where does the statement come from?Niles.

statdad · Sep 17, 2010

The general interpretation is that the effects of those uncertainties are additive - that's where the CLT comes in.

Niles · Sep 17, 2010

But that would only explain why the errors are Gaussian, not why the measured variable is Gaussian.

statdad · Sep 17, 2010

If the problem is a location problem, the "model" can be described as

Variable = Mean value + Random error

with "Mean value" a constant. Since the random error is Gaussian, so is the variable.

Niles · Sep 17, 2010

statdad said:

If the problem is a location problem, the "model" can be described as ...

I am not sure I understand what you mean by "location problem". In my case we are talking about measured speeds.

statdad · Sep 17, 2010

A "location problem" simply means you are trying to determine the mean value of a variable. A mean is one type of measure of location, or measure of center.

I don't know exactly what type of problem you're involved in: my posts above were

1) to show how the Gaussian distribution arises from the "many sources of uncertainty"
2) to show one way in which a measured random quantity can be assumed to have a gaussian distribution

Niles · Sep 17, 2010

Ok, I understand. In post #2 and #4 you use "error" and "uncertainty" interchangeably. Does

Variable = Mean value + Uncertainty

also hold for a location problem?Niles.

statdad · Sep 17, 2010

I guess - the main idea is that the variable is a constant value + some unmeasurable random behavior, which is often modeled by a normal (Gaussian) distribution.

An approach slightly more general than the location model is given by

Variable = Model + Error

where ``Model'' is some deterministic (non-random) expression. Consider multiple regression:

[tex] Y = \underbrace{\beta_0 + \beta_1 x_1 + \dots + \beta_p x_p}_{\text{Model}} + \varepsilon[/tex]

with [itex]\varepsilon[/itex] is the random error component

Niles · Sep 17, 2010

Thanks, it was very kind of you.

Best wishes,
Niles.

Handling Random Uncertainties: Best Practices for Niles

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Graduate Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense

Undergrad My basic understanding of set theory

Undergrad How do E[X] and E[|X|] relate?

Graduate Expected numbers of cards of a last color remaining

Undergrad The problem of points

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight