How are critical values in statistical tests obtained?

  • Context: High School 
  • Thread starter Thread starter Mayhem
  • Start date Start date
  • Tags Tags
    Statistical
Click For Summary
SUMMARY

Critical values in statistical tests are derived from the cumulative density function (CDF) of the distribution in question, which is the integral of the probability density function (PDF). These values are essential for interpreting test statistics against chosen confidence levels, typically 95% or 97.2%, with fields like particle physics requiring a five sigma level (99.9999% confidence). The critical values are often looked up in tables that are generated using numerical methods, reflecting the degrees of freedom associated with the data set. Understanding these concepts is crucial for accurate statistical analysis and interpretation.

PREREQUISITES
  • Understanding of cumulative density functions (CDF)
  • Familiarity with probability density functions (PDF)
  • Knowledge of statistical confidence levels
  • Basic skills in numerical methods for statistical calculations
NEXT STEPS
  • Research the derivation of cumulative density functions for various distributions
  • Learn about numerical methods used to compute critical values
  • Explore the significance of different confidence levels in statistical testing
  • Study the application of inverse CDF in statistical analysis
USEFUL FOR

Statisticians, data analysts, researchers in scientific fields, and anyone involved in statistical testing and interpretation of data sets.

Mayhem
Messages
425
Reaction score
317
In science, statistics are constantly used to give 'rigorous' interpretations of data sets. In this process, tests are often employed to verify a property that is being investigated. For example, normal distribution or randomness. Usually an algorithm is employed on the data set and a test statistic is obtained and compared to a critical value. This critical value is dependent on the degrees of freedom of the dat set and confidence level used, but are almost always "strange" value that must be looked up in a table. How our test statistic compares to this value can lead to very different conclusions.

Where do these critical values come from and how can we know that they can be trusted?
 
Physics news on Phys.org
There is not a hard and fast statistical proof. There is only "confidence" at a certain level. You need to start with a confidence level. Many subjects traditionally use 95% or 97.2% confidence. But some subjects, like particle physics insist on "five sigma" level, which is about 99.9999% confidence! From that point on, the translation of that, using the statistically correct degrees of freedom, leads to a direct way to interpret your parameter of interest. The final result, as a limit on that parameter, often looks like a "strange" value. But it came from the initial confidence level.
 
  • Like
Likes   Reactions: Agent Smith
FactChecker said:
There is not a hard and fast statistical proof. There is only "confidence" at a certain level. You need to start with a confidence level. Many subjects traditionally use 95% or 97.2% confidence. But some subjects, like particle physics insist on "five sigma" level, which is about 99.9999% confidence! From that point on, the translation of that, using the statistically correct degrees of freedom, leads to a direct way to interpret your parameter of interest. The final result, as a limit on that parameter, often looks like a "strange" value. But it came from the initial confidence level.
I understand that it comes from a confidence level, but how are these critical values actually produced?
 
Mayhem said:
I understand that it comes from a confidence level, but how are these critical values actually produced?
They are the inverse of the cumulative density function for the distribution in question.
 
  • Like
Likes   Reactions: FactChecker
Dale said:
They are the inverse of the cumulative density function for the distribution in question.
Furthermore, each CDF may require different numerical methods and some use special functions that were motivated by that CDF. Often, the user relies on tables are generated and published.
 
  • Like
Likes   Reactions: Agent Smith and Dale
Mayhem said:
I understand that it comes from a confidence level, but how are these critical values actually produced?
You've got some data. You assume that these data are random and from a certain distribution. This is applied mathematics so close is good enough (or better than nothing, depending). Again assuming that distribution, one may look up in a table the critical value for the confidence level you have chosen. The values in the table for that distribution were usually computed numerically i.e. with a computer.
 
Dale said:
They are the inverse of the cumulative density function for the distribution in question.
This sounds interesting. Is it something that can be explained to a chemist who is more on the practical side of statistics?
 
Mayhem said:
This sounds interesting. Is it something that can be explained to a chemist who is more on the practical side of statistics?
Are you familiar with what a probability density function is?

The cumulative density function is just the integral of the probability density function. Once you calculate that integral, then you just use it to fill a table. Inverting the cumulative density function is then just a matter of looking up a value in the table backwards.
 
  • Like
Likes   Reactions: Agent Smith and Mayhem
@Dale I know what a cumulative frequency is:
Test scoreFrequencyCumulative frequency
0 to 2022
20 to 401012
40 - 604052
60 - 801062
80 - 100567

With the frequencies and the range (max - min) we can build a probability density function (my data looks like is a normal distribution). Is this also called a probability density function (pdf)? We then use it to construct a cumulative density function (cdf). With the cdf and z scores we can construct a table, which we can use to find the % of data points below a certain z score. Correct? 🤔
 
  • #10
I suspect your strategy of asking PF to write you a statistics textbook one paragraph at a time will be inefficient and you will be better off working through such a text.
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 12 ·
Replies
12
Views
2K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 20 ·
Replies
20
Views
4K
  • · Replies 6 ·
Replies
6
Views
4K