What are the chances of a 5 sigma confidence rating being wrong?

  • Context: Graduate 
  • Thread starter Thread starter g.lemaitre
  • Start date Start date
  • Tags Tags
    Rating Sigma
Click For Summary

Discussion Overview

The discussion revolves around the probability of a 5 sigma confidence rating being incorrect, particularly in the context of statistical significance in physics experiments. Participants explore various interpretations and calculations related to this probability, referencing specific cases such as the Higgs boson detection.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Some participants state that a 5 sigma confidence rating corresponds to a probability of being wrong of approximately 0.000028%, equating to 1 in 35,000, while others suggest it could be as low as 1 in 3.5 million.
  • One participant calculates the probability using R, arriving at a figure of about 1 in 1.7 million, emphasizing the importance of the underlying distribution assumptions.
  • Another participant highlights that the interpretation of "odds of being wrong" is complex and that for normally distributed data, about 99.99995% of data lies within 5 standard deviations of the mean.
  • A quote is provided regarding physicists' caution in declaring discoveries, noting that a three-sigma confidence rating indicates a 0.13% chance of results being due to random chance, while five-sigma indicates a much lower probability.
  • One participant critiques the accuracy of a quoted figure for the probability associated with 5 sigma, suggesting it may be off by two decimal places based on calculations using the pnorm function in R.
  • Another participant explains the relationship between the error function and the probability of a 5 sigma result, indicating that news reports often simplify this to a "1 in 3.5 million" chance of no detection.
  • Clarification is provided that the chance of being wrong in rejecting the null hypothesis differs from the chance of the null hypothesis being correct.

Areas of Agreement / Disagreement

Participants express multiple competing views regarding the probability of a 5 sigma confidence rating being wrong, with no consensus reached on a definitive figure. The complexity of the topic and differing interpretations of statistical terms contribute to the ongoing debate.

Contextual Notes

Participants note that the calculations and interpretations depend heavily on the assumptions made about the underlying statistical distributions, particularly whether a Gaussian distribution is applicable.

g.lemaitre
Messages
267
Reaction score
2
I'm trying to figure out what the odds of 5 sigma confidence rating has of being wrong. According to one website it is
0.000028% which is 1 in 35,000 but I've seen so many divergent answers as to what the odds of 5 sigma being wrong are that I want to be sure. I've seen people say it is as high as 1 in 3 million or as low as 1 in 700
 
Physics news on Phys.org
g.lemaitre said:
I'm trying to figure out what the odds of 5 sigma confidence rating has of being wrong. According to one website it is
0.000028% which is 1 in 35,000 but I've seen so many divergent answers as to what the odds of 5 sigma being wrong are that I want to be sure. I've seen people say it is as high as 1 in 3 million or as low as 1 in 700

Hey g.lemaitre and welcome to the forums.

For this problem, I'm assuming you have a standard normal and wish to figure out the probability of being greater than 5 standard deviations outside of the mean.

If you are using different distributions, different assumptions, or you have a specific problem then please inform the rest of the readers here so that we can give you better advice.

Using R, I got the answer to be 2 * 2.866515718791939118515e-07 = 5.733031437583878237117e-07 = 0.000000573303.. which is really small. Taking the inverse of this gives us: 1744277.89 which equates roughly to a 1 in 1744278 chance or say a 1 in 1.7 million chance.

If you only considered one tail it would be just under a 1 in 3.5 million chance.

The thing is though that this is misleading if you don't provide more information, and this assumes that the distribution relating to what you are measuring has a Gaussian distribution. If it doesn't, or if you need to use another model, then this assumption will be wrong.

To get the calculation in R I used pnorm(-5.0,0,1) and multiplied that by 2 to get final probability (because of symmetry).
 
You're not being very precise here (e.g. "odds of being wrong" is more complex than you realize, as is sigma); the best I can tell you is that, for normally distributed data, approximately 99.99995% of the data lie within 5 standard deviations of the mean.
 
let me give an exact quote

Physicists are doing everything they can to rule out the likelihood of random chance leading them down the wrong path.

Analysis of the LHC data available in December last year, which pointed to a possible Higgs boson, was deemed to be inconclusive because chance could not be sufficiently ruled out as a factor affecting the result.

The analysis had a “three-sigma” confidence rating - meaning the result has a 0.13% possibility of being due to chance.

Physicists generally wait for a “five-sigma” confidence rating – 0.000028% possibility of a chance result – before they declare a discovery.
 
g.lemaitre said:
let me give an exact quote

This quote looks like it assumes a normal distribution and refers to the quantities P(Z < z) where z = -3 and -5 respectively, but the figure for -5 is off by two decimal places according to R with the pnorm function, if they are assuming a Gaussian distribution.

What this means is that there is a cutoff value for the probability and they are saying that if goes below some cutoff or above some cutoff, then it is considered more than -3 or -5 standard deviations in the respective direction.

Can you point the readers to the article?
 
It looks like you're trying to make sense of the numbers being thrown around regarding the Higgs boson, so I'll just throw out a summary.

For starters, the integral of a normal distribution from -n*sigma to n*sigma is equal to erf(n/sqrt(2)), where n is any real number and erf is the error function. We can therefore say that a 5-sigma result has a probability of erf(5/sqrt(2)) = 0.9999994 (i.e. 99.99994 %).

Now, a lot of the news reports are saying that this indicates that there is a "1 in 3.5 million" chance that there was no Higgs detection. This number is equal to
0.5 - erf(5/sqrt(2))/2.

Why do they divide by 2? Because they are looking for "bumps" above a "noise" level. In other words, they are only considering one side of the distribution. The 0.5 is just the integral over half of the normal distribution.

But you want to know about "odds." By definition, odds = P(failure)/P(success) where P means "probability of." Therefore, the odds of the Higgs result being a fluke is
[1 - erf(5/sqrt(2))] / erf(5/sqrt(2)) = "1 to 1.75 million."
 
g.lemaitre said:
I'm trying to figure out what the odds of 5 sigma confidence rating has of being wrong. According to one website it is
0.000028% which is 1 in 35,000
No, you've missed the '%'. It's 1 in 3.5 million.
Btw, this is not the chance of being wrong in rejecting the null hypothesis. It is the chance that the observed data was merely by chance, i.e. the chance of the data being thus given the null hypothesis. This is not the same thing as the chance that the null hypothesis is correct.
 

Similar threads

  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 1 ·
Replies
1
Views
9K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 126 ·
5
Replies
126
Views
9K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 9 ·
Replies
9
Views
7K
  • · Replies 24 ·
Replies
24
Views
6K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
5
Views
2K