"Strength" of the mean of the distribution curve

Click For Summary

Discussion Overview

The discussion revolves around the statistical concepts related to the mean of distribution curves, particularly in the context of determining the viability of results and the thresholds for declaring something a "fact" in scientific terms. Participants explore questions about sample sizes, confidence intervals, and the implications of statistical significance in various scientific fields.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • Some participants inquire about the minimum number of samples required to deem a result viable, suggesting that it may depend on the data rather than a fixed number.
  • There is discussion about the significance of confidence intervals, with some suggesting that a 95% confidence level might be a threshold for considering a mean value as a "fact," while others argue that this level may be too low.
  • Participants express uncertainty about whether a 100% confidence interval is necessary for declaring something a fact, with some noting that individual measured values can be considered facts regardless of confidence levels.
  • Some contributions highlight the variability in standards across different scientific fields, particularly in high-energy physics, where higher confidence levels (e.g., 99.7% or 99.99997%) are often required for claims of new particle discoveries.
  • There is mention of the Central Limit Theorem and its relevance to sampling and confidence intervals, with an emphasis on how larger samples can lead to narrower confidence intervals.
  • Participants discuss the importance of repeatability and consistency in measurements before a value is accepted as a scientific fact, indicating that publication alone does not guarantee acceptance by the scientific community.

Areas of Agreement / Disagreement

Participants express a range of views regarding the thresholds for declaring facts based on statistical measures, indicating that there is no consensus on a specific confidence level or sample size that universally applies across all contexts.

Contextual Notes

Limitations include the dependence on definitions of "fact" and "confidence interval," as well as the variability in standards across different scientific disciplines. The discussion does not resolve the complexities surrounding the interpretation of statistical data in scientific claims.

icakeov
Messages
379
Reaction score
27
My understanding of the distribution curves is very basic but I do have a couple of somewhat generic questions. I looked up a number of definitions but have had a hard time finding these specific answers:

- Is there an agreed on minimum number of samples that one needs to take to deem a result to be viable? Or does the result more depend on the data we get?

- Additionally, is there a certain "percentile" value of how far the standard deviation is from the mean (i.e. how "far" it is from it, or how low its value is), that would conclude that the mean value is "very likely" to happen?

What I am really aiming for with these questions is:

- If in science we are to declare something a "fact", how "precise" does the distribution curve have to be in order for us to declare it a fact? Does it have to occur 100% times? Or is there some "window" of probability that we take into account and what is that?

Apologies if the questions is sloppy or if it makes no sense.
Thanks for any feedback!
 
Physics news on Phys.org
You have to decide how accurate you want your estimate of the mean to be and what probability you want that the true mean is within that range. Typically, you use the data to find a "confidence interval" where you can say that the mean is 95%, 97.5% or 99% like to be within that interval. The probabilities are assuming that you know the general form of the distribution of the thing you are measuring (like normal, poisson, etc.)
 
Perfect! Thanks FactChecker!
I am guessing that 95% and over would be the value at which the mean value starts becoming a "fact"?
Not sure if this is the right forum for this question, but is this the step to conclude something is a fact? Or do facts have to have 100% confidence interval?
 
icakeov said:
Perfect! Thanks FactChecker!
I am guessing that 95% and over would be the value at which the mean value starts becoming a "fact"?
Not sure if this is the right forum for this question, but is this the step to conclude something is a fact? Or do facts have to have 100% confidence interval?
For most things, you can never get 100% confidence. 95% is probably low because that will be wrong one out of every 20 times. There is a trade-off between the size of the confidence interval (the accuracy you want from your estimated mean) and the level of confidence that the interval contains the true mean. Different applications require different standards as far as the level of confidence and accuracy. You have to use your own judgment. An extremely conservative example (although they are not talking about a mean) is the level of confidence that a new particle has been found. For high-energy physics, they like 99.7% confidence before they say they have found "evidence of a particle" and 99.99997% confidence before they say that a new particle has been discovered.
 
Last edited:
  • Like
Likes   Reactions: icakeov
icakeov said:
Perfect! Thanks FactChecker!
I am guessing that 95% and over would be the value at which the mean value starts becoming a "fact"?
Not sure if this is the right forum for this question, but is this the step to conclude something is a fact? Or do facts have to have 100% confidence interval?
Each individual measured value is a fact. The mean of any collection of such values is also a fact. There is no lower limit, even the smallest piece of data is a fact.
 
  • Like
Likes   Reactions: icakeov
icakeov said:
What I am really aiming for with these questions is:

- If in science we are to declare something a "fact", how "precise" does the distribution curve have to be in order for us to declare it a fact? Does it have to occur 100% times?

What do you mean by "it"? In order for something to be a fact, it would at least have to be expressed as a complete sentence.

If we consider a population of things, we can can make many different statements about the population. For example:

1) All the objects in the population have weight 102.3 and measurements that show a different value than 102.3 have errors in them.

or

2) The mean value of the objects in the population is 102.3, but there are objects in the population that have a weight different than the mean value.
 
  • Like
Likes   Reactions: icakeov
Thanks for all your responses, super helpful!
Yes, I was referring specifically to the mean value when talking about "it" being a fact.
I am understanding that every value on a graph can become a "fact" and have its "confidence interval" become very high with many measurements.
 
FactChecker said:
For high-energy physics, they like 99.7% confidence before they say they have found "evidence of a particle" and 99.99997% confidence before they say that a new particle has been discovered.

Hi again, I was wondering if there is any published document that outlines the standards above that I can reference?
Many thanks again!
 
I don't know how authoritative this is. The standards of 3-sigma and 5-sigma correspond to "probabilities" of 99.7% and 99.99997%. These "probabilities" are the odds that a set of results that unusual or more unusual would not occur if there was not a new particle.
 
  • #11
This is super helpful, thank you!
 
  • #12
The Central Limit Theorem relates what you obtain through sampling and the actual population parameters. Check out too the sampling distribution of the mean which gives you a confidence interval others mentioned here. Notice the larger the sample the narrower (i.e., "more confident") the estimate.
 
  • Like
Likes   Reactions: icakeov
  • #13
icakeov said:
- If in science we are to declare something a "fact", how "precise" does the distribution curve have to be in order for us to declare it a fact?

Measurements are only part of the facts of science. A paper giving a measurement should give a confidence interval for the measurement. Saying we have 95% interval doesn't tell us much until we also know the length of interval.

A paper publishing a measurement doesn't establish a scientific fact even if it announces 95% confidence. Usually, several other researchers must repeat the measurement and get consistent results before the measured value is accepted as a fact.

The famous and customary 95% confidence is a standard many journals require before they will consider publishing a result. However, the publication of a measurement in a journal does not guarantee the "scientific community" will regard the measurement as factual based on that single publication.
 
  • Like
Likes   Reactions: icakeov
  • #14
That is SUPER helpful Stephen & WWGD, thank you so much!
 

Similar threads

  • · Replies 42 ·
2
Replies
42
Views
6K
Replies
20
Views
2K
  • · Replies 31 ·
2
Replies
31
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
4
Views
1K
  • · Replies 4 ·
Replies
4
Views
3K