How do I determine the class size for creating a histogram in statistics?

  • Thread starter Thread starter Saladsamurai
  • Start date Start date
  • Tags Tags
    Histogram Stats
Click For Summary
SUMMARY

Determining the class size for creating a histogram involves calculating the range by subtracting the lowest value from the highest value of the dataset, which in this case is the number of Medical Doctors per 100,000 people across various states. The class size can be considered arbitrary and should be chosen based on the desired appearance of the histogram, such as whether it should resemble a normal distribution or allow for gaps. While using actual endpoints for the range is one method, it is not mandatory.

PREREQUISITES
  • Understanding of basic statistical concepts, including histograms and class intervals.
  • Familiarity with calculating range in a dataset.
  • Knowledge of how to interpret histograms and their significance in data visualization.
  • Basic proficiency in using statistical software or tools for data analysis.
NEXT STEPS
  • Research methods for determining optimal class size for histograms in statistics.
  • Learn about the impact of class size on histogram shape and data interpretation.
  • Explore statistical software options for creating histograms, such as R or Python's Matplotlib.
  • Study the concept of normal distribution and how it relates to histogram construction.
USEFUL FOR

Statisticians, data analysts, students learning statistics, and anyone involved in data visualization who needs to create effective histograms.

Saladsamurai
Messages
3,009
Reaction score
7
I have a quick question regarding a stats problem in my girlfriend's text. It is pretty easy I suppose, but I am not quite sure; the text does not appear to give a "general form" of how to obtain the info.

Feel free to correct me if I misuse words here, I am not familiar with the stats lingo yet.

It is asking to make a histogram. There is a table that gives us each state (the individual) and the number of Medical Doctors per 100,000 people is in each state (<--the variable, I presume).

Now I know that I find the Range by subtracting the Lowest Actual variable from the Highest actual Variable.

Here is where I am getting a little lost: I think now I am supposed to divide the actual range by the "class size" in order to find the number of classes so I can start to draw my histogram.

But how do I choose the size of each class? Is it arbitrary?

And why do I use the actuals to compute the range instead of the reasonable beginning/end?Thanks,
Casey
 
Physics news on Phys.org
I'd first plot the data by state to get a "feel."

Class size is arbitrary. It partly depends on whether you'd like the histogram to look like a "normal curve" or can live with "gaps in the middle."

You do not have to use the actual endpoints, but that's one way.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
5K
  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 15 ·
Replies
15
Views
2K
Replies
2
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
Replies
6
Views
606
  • · Replies 3 ·
Replies
3
Views
3K