How are critical values in statistical tests obtained?

  • Context: High School 
  • Thread starter Thread starter Mayhem
  • Start date Start date
  • Tags Tags
    Statistical
Click For Summary

Discussion Overview

The discussion revolves around the origins and trustworthiness of critical values in statistical tests, focusing on their dependence on confidence levels and degrees of freedom. Participants explore the mathematical foundations and practical applications of these values in various fields, including physics and chemistry.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants note that critical values are derived from a chosen confidence level, with common levels being 95% or 99.9999% in particle physics.
  • It is mentioned that critical values are the inverse of the cumulative density function (CDF) for the relevant distribution.
  • Participants discuss the numerical methods used to compute CDFs and the reliance on tables for critical values.
  • One participant raises a question about explaining these concepts to practitioners in chemistry, indicating a need for practical applications of the theory.
  • A participant provides an example of constructing a probability density function (PDF) and cumulative density function (CDF) from data, questioning the terminology and process involved.
  • Another participant suggests that seeking detailed explanations from the forum may not be the most efficient way to learn statistics.

Areas of Agreement / Disagreement

There is no consensus on the complete understanding of how critical values are produced, as participants express varying levels of familiarity with the underlying concepts and methods.

Contextual Notes

Participants express uncertainty regarding the mathematical processes involved in deriving critical values and the specific numerical methods used for different distributions. There is also a lack of clarity on the terminology related to probability density functions and cumulative density functions.

Who May Find This Useful

This discussion may be of interest to students and professionals in statistics, physics, and chemistry, particularly those looking to understand the foundations of statistical testing and critical value determination.

Mayhem
Messages
425
Reaction score
317
In science, statistics are constantly used to give 'rigorous' interpretations of data sets. In this process, tests are often employed to verify a property that is being investigated. For example, normal distribution or randomness. Usually an algorithm is employed on the data set and a test statistic is obtained and compared to a critical value. This critical value is dependent on the degrees of freedom of the dat set and confidence level used, but are almost always "strange" value that must be looked up in a table. How our test statistic compares to this value can lead to very different conclusions.

Where do these critical values come from and how can we know that they can be trusted?
 
Physics news on Phys.org
There is not a hard and fast statistical proof. There is only "confidence" at a certain level. You need to start with a confidence level. Many subjects traditionally use 95% or 97.2% confidence. But some subjects, like particle physics insist on "five sigma" level, which is about 99.9999% confidence! From that point on, the translation of that, using the statistically correct degrees of freedom, leads to a direct way to interpret your parameter of interest. The final result, as a limit on that parameter, often looks like a "strange" value. But it came from the initial confidence level.
 
  • Like
Likes   Reactions: Agent Smith
FactChecker said:
There is not a hard and fast statistical proof. There is only "confidence" at a certain level. You need to start with a confidence level. Many subjects traditionally use 95% or 97.2% confidence. But some subjects, like particle physics insist on "five sigma" level, which is about 99.9999% confidence! From that point on, the translation of that, using the statistically correct degrees of freedom, leads to a direct way to interpret your parameter of interest. The final result, as a limit on that parameter, often looks like a "strange" value. But it came from the initial confidence level.
I understand that it comes from a confidence level, but how are these critical values actually produced?
 
Mayhem said:
I understand that it comes from a confidence level, but how are these critical values actually produced?
They are the inverse of the cumulative density function for the distribution in question.
 
  • Like
Likes   Reactions: FactChecker
Dale said:
They are the inverse of the cumulative density function for the distribution in question.
Furthermore, each CDF may require different numerical methods and some use special functions that were motivated by that CDF. Often, the user relies on tables are generated and published.
 
  • Like
Likes   Reactions: Agent Smith and Dale
Mayhem said:
I understand that it comes from a confidence level, but how are these critical values actually produced?
You've got some data. You assume that these data are random and from a certain distribution. This is applied mathematics so close is good enough (or better than nothing, depending). Again assuming that distribution, one may look up in a table the critical value for the confidence level you have chosen. The values in the table for that distribution were usually computed numerically i.e. with a computer.
 
Dale said:
They are the inverse of the cumulative density function for the distribution in question.
This sounds interesting. Is it something that can be explained to a chemist who is more on the practical side of statistics?
 
Mayhem said:
This sounds interesting. Is it something that can be explained to a chemist who is more on the practical side of statistics?
Are you familiar with what a probability density function is?

The cumulative density function is just the integral of the probability density function. Once you calculate that integral, then you just use it to fill a table. Inverting the cumulative density function is then just a matter of looking up a value in the table backwards.
 
  • Like
Likes   Reactions: Agent Smith and Mayhem
@Dale I know what a cumulative frequency is:
Test scoreFrequencyCumulative frequency
0 to 2022
20 to 401012
40 - 604052
60 - 801062
80 - 100567

With the frequencies and the range (max - min) we can build a probability density function (my data looks like is a normal distribution). Is this also called a probability density function (pdf)? We then use it to construct a cumulative density function (cdf). With the cdf and z scores we can construct a table, which we can use to find the % of data points below a certain z score. Correct? 🤔
 
  • #10
I suspect your strategy of asking PF to write you a statistics textbook one paragraph at a time will be inefficient and you will be better off working through such a text.
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 12 ·
Replies
12
Views
2K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 20 ·
Replies
20
Views
4K
  • · Replies 6 ·
Replies
6
Views
4K