Confusion about normal probability plots

Click For Summary

Discussion Overview

The discussion revolves around the construction and interpretation of normal probability plots (normal quantile plots), focusing on the axes used, the nature of the data plotted, and the calculations involved in determining theoretical values. Participants express confusion regarding standard practices and methodologies in creating these plots.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • One participant questions which values should be plotted on the vertical axis, observing that there seems to be no standard practice.
  • Another participant suggests that the axes can represent either observed or theoretical values, depending on what simplifies the calculations.
  • There is a discussion about whether probabilities or actual values should be plotted, with some indicating that observed probabilities are typically derived from frequency data.
  • Participants inquire about the nature of the scales used in the plots, with a suggestion that they are usually linear.
  • There is a question regarding how to calculate the theoretical value corresponding to an observed value, with a reference to the theory of probabilities.
  • One participant proposes a method for plotting observed values against their corresponding probabilities, seeking confirmation on the approach.
  • Another participant mentions the calculation of z-scores as part of determining probabilities, but raises a concern that plotting (v, z-score of v) would yield a straight line, regardless of data distribution.

Areas of Agreement / Disagreement

Participants express differing views on the standard practices for plotting normal probability plots, with no consensus reached on the best approach or methodology. The discussion remains unresolved regarding the specifics of plotting and calculations.

Contextual Notes

Participants highlight potential ambiguities in definitions and practices related to normal probability plots, including the calculation of theoretical values and the interpretation of axes. There are also unresolved questions about the implications of using z-scores in the context of these plots.

nomadreid
Gold Member
Messages
1,773
Reaction score
256
The expositions for a normal probability plot (aka normal quantile plot) (in which observed probabilities are plotted against theoretical probabilities, or sometimes the other way around, to get a rough check as to whether a set of data is normally distributed by checking linearity) are not too clear (to me).
To make this easy to answer, I will put my doubts into four succinct questions:
First, which one standardly goes on the vertical axis: the observed or theoretical values?
Secondly, does one put probabilities, or the values, on the axes? If probabilities, are the observed probabilities just calculated as per a frequency table?
Thirdly (and this depends on the answer to the previous question), are the scales linear, a concatenation of logarithmic scales, or what?
Fourthly, how does one calculate the theoretical value that goes to a given observed value?
Thanks for answering any (or all) of these.
 
Physics news on Phys.org
http://analyse-it.com/blog/2008/11/normal-quantile-probability-plots
First, which one standardly goes on the vertical axis: the observed or theoretical values?
Doesn't matter. I haven't heard of a standard. Use the one that makes the math easiest.

Secondly, does one put probabilities, or the values, on the axes? If probabilities, are the observed probabilities just calculated as per a frequency table?
All the ones I've seen are from frequency data.
The idea is to use the theoretical distribution to generate a theoretical data set which you compare with the actual data set. So you use whatever the data says it is.

Thirdly (and this depends on the answer to the previous question), are the scales linear, a concatenation of logarithmic scales, or what?
They are usually linear.

Fourthly, how does one calculate the theoretical value that goes to a given observed value?
You use the theory of probabilities. You know how a normal distribution works right?
 
Thanks, Stephen Bridge. So, following your answers and the link you sent, I would proceed as follows:
put the observed values on one axis (say, the horizontal one)
Then for each observed value v, I find Prob(X< v), and plot that on the other axis.
Right?
 
Thanks, Stephen Bridge. So, following your answers and the link you sent, I would proceed as follows:
put the observed values on one axis :say, the horizontal one.
Then for each observed value v, I find Prob(X< v) from the normal curve, and plot (v, Prob(X< v)) .
Right?
 
You would compute Z-score.
http://www.measuringusability.com/zcalc.htm

... once you have the key words, you can look them up ;)
 
Thanks, of course I would calculate the z-score in order to find the probability, but I don't think you mean that the points are (v, z-score of v), because that will always give you a straight line
y=(1/σ)x - (μ/σ),
regardless of whether your data is normally distributed or not.
 
Last edited:

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 11 ·
Replies
11
Views
4K
  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 24 ·
Replies
24
Views
4K
Replies
5
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K