Histogram of unequal class width

  • Context: Undergrad 
  • Thread starter Thread starter kelvin macks
  • Start date Start date
  • Tags Tags
    Class Histogram Width
Click For Summary

Discussion Overview

The discussion revolves around the representation of frequency data in a histogram with unequal class widths, specifically addressing how to accurately depict frequencies when the total frequency for a class is given but divided among sub-intervals. Participants explore the implications of class width on histogram representation and the concept of frequency density.

Discussion Character

  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant questions how the frequency of 20 for the class 100-199 can be represented as 10 in the histogram, expressing confusion over the apparent discrepancy.
  • Another participant suggests that if the frequency were split into two sub-classes of 10 each, the histogram should reflect that the total area represents the frequency, not just the height of the bars.
  • A participant reiterates the importance of area in the histogram, emphasizing that the total area must remain consistent regardless of how the data is binned.
  • There is a suggestion to label the y-axis as frequency density, though some participants express uncertainty about the appropriateness of this label.
  • One participant argues that normalization is necessary to account for class width, questioning whether a bar should extend to 20 if the class width is larger.
  • Another participant agrees with the normalization point but seeks clarification on the labeling of the y-axis.

Areas of Agreement / Disagreement

Participants express differing views on how to represent the frequency in the histogram, particularly regarding the normalization process and the labeling of the y-axis. There is no consensus on the best approach to take in this situation.

Contextual Notes

Participants highlight the need to consider class width when representing data in histograms, indicating that the discussion may be limited by assumptions about the data's original binning and the implications of frequency density.

kelvin macks
Messages
60
Reaction score
0
for the class 100-199, the f (number of household) is 20, but in the histogram, the frequency is divided by 2 which 20/2 = 10 , but how can it show that in the class 100-199 , the total frequency of 100-199 is 10 ? when i look at the histogram, i would directly think that there are f=10 in the class of 100-199... how can it relate to the f for 100-199 is 20? i totally can't undrestand.
 

Attachments

  • IMG_20140616_064107[1].jpg
    IMG_20140616_064107[1].jpg
    34.2 KB · Views: 1,572
  • IMG_20140616_064133[1].jpg
    IMG_20140616_064133[1].jpg
    34 KB · Views: 759
Physics news on Phys.org
Imagine that you had ##f=10## for class 100-149 and ##f=10## for class 150-199, what would the bar graph look like? Now imagine that the some of the information is lost, and you only know that ##f=20## for class 100-199. Why should the graph then be different? (Note that I split it into two times 10 for illustration purposes. In reality, this is only true on average. Increasing the class size means that the average is taken over a greater interval.)

What is important is the area of the bar in the graph.
 
  • Like
Likes   Reactions: 1 person
DrClaude said:
Imagine that you had ##f=10## for class 100-149 and ##f=10## for class 150-199, what would the bar graph look like? Now imagine that the some of the information is lost, and you only know that ##f=20## for class 100-199. Why should the graph then be different? (Note that I split it into two times 10 for illustration purposes. In reality, this is only true on average. Increasing the class size means that the average is taken over a greater interval.)

What is important is the area of the bar in the graph.

i can understand your explanation. why can't i just draw f=20 for 100-199? since the question only give f=20 for 100-199
 
shall i label the y-axis of the graph above as frequency density?
 
kelvin macks said:
i can understand your explanation. why can't i just draw f=20 for 100-199? since the question only give f=20 for 100-199

Because you have to normalize the data to take into account the class width. The total area must be the same whatever you choose as the size of the bins.

Say there is no household that consumed less than 100 kWh. You could then make the class go from 0 to 199, and you would still have ##f = 20##. Do you still think that the bar should go up to 20 from 0 to 199?
 
kelvin macks said:
shall i label the y-axis of the graph above as frequency density?

I'm not sure if it's appropriate to call it that. I'll leave it to others to answer that.

It is true that the y scaling would change if the original data had been binned in smaller bins. Here, the data is normalized to what they call the "standard width."
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
4K
Replies
6
Views
3K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 3 ·
Replies
3
Views
888
  • · Replies 1 ·
Replies
1
Views
11K
  • · Replies 2 ·
Replies
2
Views
5K
  • · Replies 2 ·
Replies
2
Views
6K
  • · Replies 2 ·
Replies
2
Views
3K