The Normal Distribution - Random Errors

In summary, by making a new measurement, the probability of the obtained measured value falling within a 95% confidence interval for the mean is 0.95. However, the concept of a confidence interval can have different interpretations, leading to different ways of calculating and interpreting probabilities.
  • #1
RaduAndrei
114
1
So let's say I do some measurements and obtain a set of measured values. The measurement is characterized by random errors so by making enough measurements, they approach a normal distribution.
In other words, my set of measured values can be approximated by a normal distribution characterized by a mean and standard deviation calculated from the set.
Thus, I have some 95% confidence interval for one standard deviation.

Is it correct to say this: 'By making a new measurement, the probability of the obtained measured value to fall within the 95% confidence interval is 95%.' ?
 
Physics news on Phys.org
  • #2
RaduAndrei said:
Is it correct to say this: 'By making a new measurement, the probability of the obtained measured value to fall within the 95% confidence interval is 95%.' ?

That is an ambiguous statement because "confidence interval" has at least two different definitions.

The definition of "confidence interval" is usually taken to mean an interval of specified span (e.g. plus or minus 23.5) about a fixed but unknown population parameter such as the mean ##\mu##. In that interpretation, it is correct to say (for a 95% confidence interval for the mean) that a new measurement has a probability of 0.95 of falling within that confidence interval. However , this type of confidence interval has no specific numerical endpoints. For example, the confidence interval ##(\mu -23.5, \mu + 23.5)## does not have specific numerical endpoints because the specific value of ##\mu## is unknown.

In common speech, a different meaning of "confidence interval" is that it signifies an interval of a specific span (e.g. plus or minus 23.5) about an estimated population parameter such as an estimated value 120.8 for the mean of a distribution. If the population parameter is assumed to have a "fixed but unknown" value, it is incorrect to say that there is a 95% probability that a new measurement will be within such a confidence interval. (e.g. It is incorrect to say that there is a 95% chance that a new measurement will be in the interval ( 120.8 - 23.5, 120.8 + 23.5). )

( As a tactic of persuasion, it is common to present intervals by drawing "error bars" around estimated population values, so that these figures indicated specific intervals like (120.8 - 23.5, 120.8 + 23.5). This invites the audience to make the misinterpretation that there is some definite probability that new measurements will be in such a specific interval. )

If we use Bayesian statistics and assume the unknown parameter (e.g. the population mean) has various probabilities of having various values instead having a "fixed, but unknown" value then we can compute the probability that new measurements are within specific intervals about the estimated value of the parameter. Such intervals are called "credible intervals" or "Bayesian confidence intervals".
 

1. What is the normal distribution?

The normal distribution is a statistical concept that describes the probability distribution of a set of data. It is a bell-shaped curve that is symmetrical and follows a specific pattern.

2. How is the normal distribution related to random errors?

The normal distribution is commonly used to represent random errors in scientific experiments and measurements. Random errors are unpredictable and can occur due to various factors, such as human error, equipment malfunction, or environmental conditions. The normal distribution helps to understand and measure the frequency and magnitude of these errors.

3. What is the role of standard deviation in the normal distribution?

Standard deviation is a measure of how spread out the data is in a normal distribution. It represents the average distance of data points from the mean. A smaller standard deviation indicates that the data points are closer to the mean, while a larger standard deviation indicates a wider spread of data points.

4. Can the normal distribution be applied to all types of data?

The normal distribution is commonly used for continuous data, meaning data that can take on any value within a range. It is not suitable for discrete data, which can only take on specific values. However, some types of discrete data, such as counts and measurements, can be approximated by the normal distribution if the sample size is large enough.

5. How can the normal distribution be used in hypothesis testing?

In hypothesis testing, the normal distribution is often used to determine the probability of obtaining a specific result. By calculating the z-score, which measures how many standard deviations a data point is from the mean, we can determine the likelihood of obtaining a result due to chance alone. This is useful in determining the significance of experimental results and making conclusions about the underlying population.

Similar threads

  • Other Physics Topics
Replies
5
Views
1K
  • Other Physics Topics
Replies
3
Views
6K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
485
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
21
Views
2K
  • General Math
Replies
2
Views
804
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
9
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
996
Back
Top