What is the meaning of 'order of observations' in quantile plots?

  • Thread starter Saladsamurai
  • Start date
In summary, the quantile plot simply plots the data values on the vertical axis against the fraction of observations exceeded by the data value. The quantile is a value for which a specified fraction of the data values is less than or equal to the value. The quantile plot is used to determine the distribution of a data set.
  • #1
Saladsamurai
3,020
7

Homework Statement


I am reading a definition a quantile plot and a quantile and I am getting a little confused. It says:

A quantile, q(f), is a value for which a specified fraction f of the data values is less than or equal to q(f).

Ok, I get that. Then it says:

A quantile plot simply plots the data values on the vertical axis against the fraction of observations exceeded by the data value.

Get that too; but then it says

For theoretical purposes this fraction is computed by

[tex]f_i=\frac{i - \frac{3}{8}}{n+\frac{1}{4}}[/tex]​

where i is the order of the observations when they are ranked low to high.
I don't get that part. What exactly is this 'order of observations?'

I am interpreting it as the following:
If i = 4, we are finding the fraction f4, which is the fraction of data values that are less than or equal to the value of the 4th observation.

Is that correct?
 
Last edited:
Physics news on Phys.org
  • #2
The 4th smallest observation. The text is talking about order statistics.

When you collect data, your measurements will be rather random. Suppose you perform some experiment in which you expect to be measuring the same random process over and over. You might well think something fishy if your first measurement is -0.43, the next one -0.21, the next one 0.16, 0.28, 0.45, and so on. You wouldn't think anything is fishy if you took your randomly-collected data and sorted it from smallest to largest and saw -0.43, -0.21, 0.16, ...

That sorted data tells a whole lot about the underlying distribution, even when you haven't the foggiest idea about the nature of the random process. When you do have some idea about that process it also tells quite a bit. Here you are assuming a normal distribution. The quantiles generated by that statistic, (i-3/8)/(N+1/4), are approximately unbiased if the underlying random process is normal. This is the type 9 quantile formula. There are, not surprisingly, 8 others.
 
  • #3
OK, I follow you, I think. This is the first week of class and he started us on this random section in the 8th chapter in the text. So I am assuming that the derivation of the formula above might be outside of my scope of understanding for the time being?
 

1. What is a quantile?

A quantile is a statistical concept that divides a dataset into equal-sized subsets. For example, the median is a quantile that divides a dataset into two equal-sized subsets, with 50% of the data falling below the median and 50% above.

2. How are quantiles calculated?

Quantiles are calculated by ranking the data from smallest to largest and then dividing the data into equal-sized subsets based on the desired quantile. For example, the median would be the value in the middle of the ranked dataset.

3. What is the purpose of using quantiles?

Quantiles are used to summarize and understand the distribution of a dataset. They can provide information about the spread and central tendency of the data, and can help identify outliers or extreme values.

4. What are subscripts in statistics?

In statistics, subscripts are used to differentiate between variables or groups within a dataset. They are typically used to represent different levels of a categorical variable or to distinguish between multiple variables with the same name.

5. How are subscripts used in calculations?

Subscripts are commonly used in statistical formulas to indicate which variables or groups are being referenced. They can also be used to represent different conditions or scenarios in a calculation, such as in multiple regression analysis.

Similar threads

  • Calculus and Beyond Homework Help
Replies
4
Views
1K
  • Calculus and Beyond Homework Help
Replies
8
Views
943
  • Calculus and Beyond Homework Help
Replies
9
Views
1K
  • Calculus and Beyond Homework Help
Replies
4
Views
1K
  • Calculus and Beyond Homework Help
Replies
2
Views
1K
  • Calculus and Beyond Homework Help
Replies
11
Views
1K
  • Calculus and Beyond Homework Help
Replies
24
Views
2K
  • Calculus and Beyond Homework Help
Replies
3
Views
272
  • Calculus and Beyond Homework Help
Replies
6
Views
228
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
2K
Back
Top