Help understanding the Empirical Rule & Chebyshev Theory

  • Context: Undergrad 
  • Thread starter Thread starter iPhysicz
  • Start date Start date
  • Tags Tags
    Theory
Click For Summary

Discussion Overview

The discussion focuses on understanding the Empirical Rule and Chebyshev's Theorem, exploring their similarities and differences, and applying these concepts to specific data scenarios. It includes theoretical explanations and practical applications related to statistics.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning
  • Homework-related

Main Points Raised

  • One participant expresses confusion about the distinctions between the Empirical Rule and Chebyshev's Theorem, noting that the former applies to normal distributions while the latter applies to any distribution with finite variance.
  • Another participant clarifies that the Empirical Rule is a set of guidelines for percentages within standard deviations of the mean for normal distributions, while Chebyshev's Theorem is a proven theorem applicable to all distributions.
  • A participant presents a specific data set with a mean of 75 and a standard deviation of 5, seeking assistance in calculating proportions of measurements within certain ranges using Chebyshev's Theorem.
  • Further clarification is provided that the proportions calculated should be stated as lower bounds, emphasizing that Chebyshev's Theorem provides at least certain percentages for the specified ranges.
  • Participants discuss the implications of the calculations, particularly regarding the maximum percentage of data that could be above a certain threshold, indicating uncertainty in the distribution shape.

Areas of Agreement / Disagreement

Participants generally agree on the basic definitions and applications of the Empirical Rule and Chebyshev's Theorem, but there is ongoing uncertainty regarding the specific calculations and implications for the data set presented.

Contextual Notes

Participants note the limitations of Chebyshev's Theorem in providing only lower bounds for the proportions, and the lack of information about the data set's size and distribution shape complicates the calculations.

iPhysicz
Messages
1
Reaction score
0
I'm having trouble distinguishing the similarities and the differences between the Empirical Rule and Chebyshev’s Theory. I'm a long time lurker here and figured this would be the place to ask. I understand that Chebyshev's Theory deals with real world distributions and Empirical Rule deals with normal distributions but I can't really distinguish what else to say about it... Please help thanks!
 
Physics news on Phys.org
You essentially nailed it. The Empirical Rule is simple a condensed set of 'rules' (guidelines would be a better term') about the approximate percentages that are found with 1, 2, and 3 standard deviations of the mean for a normal distribution. It is not a mathematical theorem.

Chebyshev's Theorem, on the other hand, IS a theorem - there is a proof of the result: the only requirement is that the distribution have a finite variance. Chebyshev's theorem holds for any distribution, symmetric or skewed. It's most important use (IMO) is not in data description but in more theoretical settings.
 
I just can't grasp how to figure out proportions of measurements below a certain number. For instance: Set data has mean of 75 and standard deviation of 5. No info about size of data set or shape of distribution (therefore use chebyshev's).
1. What can you say about proportions of measurements between 60 and 90. (I got 89%).
2. Between 65 and 85. ( I got 75%)
3. Above 90? This is where I get stuck! Can someone please help me?
 
Last edited:
lwerlinger said:
I just can't grasp how to figure out proportions of measurements below a certain number. For instance: Set data has mean of 75 and standard deviation of 5. No info about size of data set or shape of distribution (therefore use chebyshev's).
1. What can you say about proportions of measurements between 60 and 90. (I got 89%).
2. Between 65 and 85. ( I got 75%)
The only improvement I would make on these numbers is to say at least [itex]89\%[/itex] and at least [itex]75\%[/itex] - Chebyschev's Theorem gives a lower bound on the trapped percentages.
3. Above 90? This is where I get stuck! Can someone please help me?

Remember from part (1) of your question that at least [itex]89\%[/itex] of the scores are between 65 and 90. Since you can't assume anything about the shape of the distribution, the best you can say is this: we're still missing a maximum of [itex]11\%[/itex]
of the data. It's possible that all of it missing data is above 90, so the only conclusion to make is at most [itex]11\%[/itex] of the scores are above 90
 
ah ha. Well that seems almost to easy. Thanks statdad, I really appreciate the help!
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 23 ·
Replies
23
Views
5K
  • · Replies 11 ·
Replies
11
Views
4K
Replies
0
Views
2K
  • · Replies 105 ·
4
Replies
105
Views
16K
  • · Replies 15 ·
Replies
15
Views
4K
  • · Replies 7 ·
Replies
7
Views
4K