How to get the typical value or typical data from the dataset

  • Context: High School 
  • Thread starter Thread starter Pengtaofu
  • Start date Start date
  • Tags Tags
    Data Value
Click For Summary

Discussion Overview

The discussion revolves around the concept of "typical value" in datasets, particularly in the context of statistical measures such as median, mean, and mode. Participants explore how to determine a typical value from experimental data and the implications of using different statistical measures. The conversation includes theoretical calculations, comparisons with measured datasets, and the subjective nature of statistical interpretation.

Discussion Character

  • Exploratory
  • Debate/contested
  • Technical explanation

Main Points Raised

  • Some participants question whether "typical value" is a scientifically accepted term, suggesting it may not have a clear definition in statistics.
  • There is a discussion about the differences between mean, median, and mode, with some participants emphasizing that these measures can yield different insights about a dataset.
  • One participant mentions the χ^2 value as a potential measure to assess the fit between theoretical models and measured values, highlighting the importance of considering statistical error.
  • Concerns are raised about the subjective nature of applying statistics, with references to traditions in specific fields influencing the choice of methods.
  • Participants note that the problem of determining the "most typical" value can be framed mathematically, but this requires specific assumptions that may vary among practitioners.
  • There is acknowledgment of the influence of external factors on measured values, suggesting that further explanation of the underlying physics may be necessary.

Areas of Agreement / Disagreement

Participants generally do not agree on a single definition of "typical value," and multiple competing views remain regarding the best approach to determine it from a dataset. The discussion reflects uncertainty and differing opinions on the application of statistical measures.

Contextual Notes

Limitations include the lack of consensus on the definition of "typical value," the dependence on subjective choices in statistical methods, and the unresolved nature of how to best account for external factors affecting measured data.

Pengtaofu
Messages
23
Reaction score
2
TL;DR
get the typical data from database
I make a theoretical calculation and then compare the calculation result and the median of the corresonding measured dateset. The difference between them is very slight, so I state that the theoretical model is right and good. However one expert has suspended whether the median is typical value. Could anyone help to explain how to get the typical value in the experiment dataset.
Measures of central tendancy, e.g. median, mean and mode, have been introduced in a lot of statistical books, however these books seldom include the defination of the typical value or introduce the method to get the typical value in dataset. It could be found that median, mean and mode of the dataset is the typical value in some web(http://www.physics.csbsju.edu/stats/descriptive2.html). I don't know whether the typical value is one common expression or scietifical term. Could anyone help how to get the typical value or choose the typical data from the known dateset. Thank you very much.
 
Physics news on Phys.org
I don't think there is an agreed upon definition of the "typical value". In your dataset, how different are the mean, mode, and median? Another thing you could look at is the χ^2 value. This will depend on the difference between the theoretical model and the measured value, but also on the error in the measured value. How does the difference between the theoretical and measured values compare to the statistical error in the measured value?
 
  • Like
Likes   Reactions: Pengtaofu and Stephen Tashi
Pengtaofu said:
I don't know whether the typical value is one common expression or scietifical term.
"Typical value" is not a scientific term (in English).

Could anyone help how to get the typical value or choose the typical data from the known dateset. Thank you very much.
Applying statistics is a subjective matter, often influenced by traditions. Look at published articles in your field of study and see what methods have been accepted by the editors of journals.

It is possible to formulate the problem of choosing the "most typical" value as a specific mathematical problem. This can be done in various ways. The procedure requires a sophisticated understanding of mathematical statistics. It also requires many specific assumptions. Which assumptions should be made? - that is also a subjective decision.
 
  • Like
Likes   Reactions: Pengtaofu
"Measures of central tendency, e.g. median, mean and mode"

No, that is not what a "measure of central tendency" means. Central tendency is detected by e.g. the standard deviation σ: the larger σ is, the less central tendency the distribution has.

"one expert has suspended whether the median is typical value."

Huh? This is not the normal use of the word "suspended". It's not clear what this means.

Mean, and median are both measures of "middle" of a distribution. But there are a number of possible means that could be used. Usually it's the arithmetic mean. If all data is positive, it is sometimes more useful to use the geometric mean (the nth root of the product of the n data values).

The mode is the data value that is the most common. Or in the case of a distribution, it's the value with the largest probability. (Or for a continuous distribution, the value with the largest probability density.) But depending on the distribution, it need not be at all "typical".
 
zinq said:
"Measures of central tendency, e.g. median, mean and mode"

No, that is not what a "measure of central tendency" means. Central tendency is detected by e.g. the standard deviation σ: the larger σ is, the less central tendency the distribution has.

"one expert has suspended whether the median is typical value."

Huh? This is not the normal use of the word "suspended". It's not clear what this means.

Mean, and median are both measures of "middle" of a distribution. But there are a number of possible means that could be used. Usually it's the arithmetic mean. If all data is positive, it is sometimes more useful to use the geometric mean (the nth root of the product of the n data values).

The mode is the data value that is the most common. Or in the case of a distribution, it's the value with the largest probability. (Or for a continuous distribution, the value with the largest probability density.) But depending on the distribution, it need not be at all "typical".
Thank you for explanation and remind of the word "suspended". I acknownledge that use of "suspended" is not right. It should be replaced by ask, i.e. "one expert has asked whether the median is typical value."
 
Stephen Tashi said:
"Typical value" is not a scientific term (in English).Applying statistics is a subjective matter, often influenced by traditions. Look at published articles in your field of study and see what methods have been accepted by the editors of journals.

It is possible to formulate the problem of choosing the "most typical" value as a specific mathematical problem. This can be done in various ways. The procedure requires a sophisticated understanding of mathematical statistics. It also requires many specific assumptions. Which assumptions should be made? - that is also a subjective decision.
Thank you for explanation. When I studied the statistics course at uinivesity many years ago, I thought statistics is one subject of scietific. However, I have recently realized the there are a lot of subjective factors to affect the results in statistics.
 
phyzguy said:
I don't think there is an agreed upon definition of the "typical value".
Thank you very much for your explanation. Do you known whether the "typical data" isn't also agreed upon definition in statistics.
phyzguy said:
In your dataset, how different are the mean, mode, and median? Another thing you could look at is the χ^2 value. This will depend on the difference between the theoretical model and the measured value, but also on the error in the measured value. How does the difference between the theoretical and measured values compare to the statistical error in the measured value?
I have checked that the mean and median approximate in the dataset. But it is found that the some theoretical results has been out of the range [median - std, median + std]. I think the mesured values have been affected by other factors and I need to make more explanation in physics mechanism.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 25 ·
Replies
25
Views
3K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 37 ·
2
Replies
37
Views
5K
  • · Replies 4 ·
Replies
4
Views
22K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 7 ·
Replies
7
Views
2K