Interpreting Normal Distribution Logic for Data Analysis

Click For Summary

Discussion Overview

The discussion revolves around the hypothesis of normal distribution in data analysis, specifically focusing on the justification and mathematical arguments that can support this hypothesis before conducting formal hypothesis testing. The context includes practical applications related to data from bus arrival times and the interpretation of histograms.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant seeks arguments to support the hypothesis of normal distribution beyond visual inspection of histograms.
  • Another participant suggests that a common explanation for normal distribution is the sum of many independent random variables, noting that this reasoning may be too vague for mathematical testing.
  • A participant describes their specific case of bus arrival times, indicating that while the histogram resembles a normal distribution, it also shows similarities to a log-normal distribution, raising questions about the validity of the normality hypothesis.
  • There are suggestions to evaluate skewness and kurtosis as measures of normality, with a recommendation to explore various statistical tests for normality, including Kullbach-Leiber distance, Kolmogorov-Smirnov test, Agostino's K squared test, Anderson-Darling test, and Shapiro-Wilk test.

Areas of Agreement / Disagreement

Participants express differing views on how to justify the normality hypothesis, with some proposing mathematical tests while others highlight the philosophical aspects of normal distribution. The discussion remains unresolved regarding the best approach to validate the hypothesis.

Contextual Notes

Participants mention various statistical tests and measures, but there is no consensus on which specific methods are most appropriate for the given data or situation. The discussion also reflects uncertainty regarding the nature of the data and its distribution.

Mark J.
Messages
81
Reaction score
0
Hi.
I need some arguments to move forward the hypothesis that my data are normally distributed.
Except fact that I build histogram and it is similar of shape as normal distribution how can I find some other math arguments (I mean interpreting the logic of normal distribution) to argue about this hypothesis before beginning hypothesis testing with different criteria and tests?

Regards
 
Physics news on Phys.org
One common explanation for a phenomena producing a normal distribution is that it the sum (meaning literally an arithmetic sum) of many independent random random variables.

A hazier philosophical version of this is reasoning is that a phenomena that results from the combined effect of many small and independent random causes will have a normal distribution. (This argument is too vague to be tested mathematically, but you haven't made it clear what kind of "justification" for a normal distribution you want.)
 
I have a process of bus arrivals.
While taking inter-arrival times between 2 following buses as random variables I build histogram and it has shape of normal distribution but meanwhile it is similar to log-normal ,etc.
Now I am in search of some math arguments (theory)to check in order to follow the hypothesis of normal distribution.
If you can advice me on that pls
 
Mark J. said:
I have a process of bus arrivals.
While taking inter-arrival times between 2 following buses as random variables I build histogram and it has shape of normal distribution but meanwhile it is similar to log-normal ,etc.
Now I am in search of some math arguments (theory)to check in order to follow the hypothesis of normal distribution.
If you can advice me on that pls

There are a number of tests of normality. You can directly evaluate the third and fourth moments (skewness and kurtosis) which both should be close to 0. In addition there are specific tests which you can look up: Kullbach-Leiber distance, Kolmogorov-Smirnoff (adaption), Agostino's K squared test, Anderson-Darling, Shapiro-Wilks and tests in SPSS and other statistical software.

http://webspace.ship.edu/pgmarr/Geo441/Examples/Normality Tests.pdf
 
Last edited:

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
4K