Why are there only 3 quartiles when dividing data into 4 equal parts?

  • Thread starter Thread starter 15123
  • Start date Start date
  • Tags Tags
    Confused
Click For Summary
SUMMARY

In statistical analysis, quantiles are used to divide data into equal parts, with quartiles specifically dividing data into four segments. There are always n-1 quantiles for n equal parts; thus, when dividing data into four quartiles, three quartiles are identified. The confusion arises from equating the number of intervals with the number of cuts, as a quartile represents a specific value rather than an interval. The fourth quartile exists as the range encompassing the highest values, but it is not counted as a quartile in the same way as the others.

PREREQUISITES
  • Understanding of basic statistical concepts, including quantiles and quartiles.
  • Familiarity with the definitions of median, tercile, quartile, quintile, and percentile.
  • Knowledge of data distribution and probability theory.
  • Ability to interpret boxplots and their representation of data segments.
NEXT STEPS
  • Study the definitions and applications of quantiles in statistics.
  • Learn how to interpret boxplots and the significance of quartiles in data visualization.
  • Explore the differences between quartiles and other quantiles, such as deciles and percentiles.
  • Investigate the mathematical principles behind data segmentation and interval analysis.
USEFUL FOR

Students, educators, and data analysts seeking clarity on statistical concepts related to data segmentation and quantile definitions.

15123
Messages
12
Reaction score
0
My professor stated the following:

"Dividing in pieces is called 'quantiles'. In almost all cases quartiles are used, where n=4. We divide it into four equal pieces and we are going to check where the margin values are.
Pay close attention: if I split my data into 4 equal pieces, I will have 3 quartiles. If I have 10 equal pieces, I have 9 deciles. You always have one less quantile than the n parts."

I don't understand why there are always n-1 quantiles. Why is the 4th quartile never mentioned, not even in boxplots? Some people say this is because the 4th quartile is the supremum (<=100).
So why does he say there are only 3 quartiles, when you have a 4th quartile as well?
e.g.: if a student scores 95%, he will belong in the 4th quartile because I cannot classify him in the 3rd quartile (75%), because 95% is between 75% and 100% (75<95<100). If a quartile is 25%, then why do we only have 3 quartiles (3*25%=75%. What about the remaining 25%)?
 
Physics news on Phys.org
That has nothing to do with probability or statistics but simple arithmetic. If you divide a rope or a stick into "n" pieces you will have to make "n-1" cuts.
 
There is still a fourth quartile. If a score on a test is 22%, I put it in the first quartile, if the score is 33%, I put it under the Median and if the score is 70%, I put it under the third quartile. What do you do with a value that is 95%? You should put it in the fourth quartile, so to me there is a fourth quartile. My professor says there are only three quartiles.

I am still lost on this.
 
There seems to be a confusion between the number of intervals (4) and the number of cuts (3). It sounds to me that the professor is confused or you didn't understand him.
 
mathman said:
There seems to be a confusion between the number of intervals (4) and the number of cuts (3). It sounds to me that the professor is confused or you didn't understand him.
I probably didn't understand him. I think he simply meant cuts=intervals-1.
 
A quartile, or any quantile for that matter, is a number, not an interval. The 25th quartile is the single value which 25% of the data falls below. It is not the interval, it is the cut. There certainly is an interval containing the largest 25% of data but that is not, by definition, a quartile. You are confusing intervals with their endpoints.
 
alan2 said:
A quartile, or any quantile for that matter, is a number, not an interval. The 25th quartile is the single value which 25% of the data falls below. It is not the interval, it is the cut. There certainly is an interval containing the largest 25% of data but that is not, by definition, a quartile. You are confusing intervals with their endpoints.
25th quartile? Don't you mean quantile? However, thanks for the explanation. Someone mentioned this to me, but I think he is incorrect:
someperson said:
Q1 - the lower 25% of the total probability

Q2 - the area encompassing 25%-75% of the total probability - Q2 is actually an area of 50% probability centered at the median.

Q3 - the area from 75%-100% of the total probability.
 
No, I meant quartile because you referred to 25%. It's just terminology but median, tercile, quartile, quintile, and percentile are special names for the quantiles that divide your interval into 2, 3, 4, 5, and 100 sub-intervals respectively. Whoever mentioned that quote to you was incorrect but I think you've got it now.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 47 ·
2
Replies
47
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 3 ·
Replies
3
Views
508
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 21 ·
Replies
21
Views
5K