Discussion Overview
The discussion revolves around the probability of a 5 sigma confidence rating being incorrect, particularly in the context of statistical significance in physics experiments. Participants explore various interpretations and calculations related to this probability, referencing specific cases such as the Higgs boson detection.
Discussion Character
- Exploratory
- Technical explanation
- Debate/contested
Main Points Raised
- Some participants state that a 5 sigma confidence rating corresponds to a probability of being wrong of approximately 0.000028%, equating to 1 in 35,000, while others suggest it could be as low as 1 in 3.5 million.
- One participant calculates the probability using R, arriving at a figure of about 1 in 1.7 million, emphasizing the importance of the underlying distribution assumptions.
- Another participant highlights that the interpretation of "odds of being wrong" is complex and that for normally distributed data, about 99.99995% of data lies within 5 standard deviations of the mean.
- A quote is provided regarding physicists' caution in declaring discoveries, noting that a three-sigma confidence rating indicates a 0.13% chance of results being due to random chance, while five-sigma indicates a much lower probability.
- One participant critiques the accuracy of a quoted figure for the probability associated with 5 sigma, suggesting it may be off by two decimal places based on calculations using the pnorm function in R.
- Another participant explains the relationship between the error function and the probability of a 5 sigma result, indicating that news reports often simplify this to a "1 in 3.5 million" chance of no detection.
- Clarification is provided that the chance of being wrong in rejecting the null hypothesis differs from the chance of the null hypothesis being correct.
Areas of Agreement / Disagreement
Participants express multiple competing views regarding the probability of a 5 sigma confidence rating being wrong, with no consensus reached on a definitive figure. The complexity of the topic and differing interpretations of statistical terms contribute to the ongoing debate.
Contextual Notes
Participants note that the calculations and interpretations depend heavily on the assumptions made about the underlying statistical distributions, particularly whether a Gaussian distribution is applicable.