Addressing Odd Behaviour in Histogram Bin Counting for Experimental Data

  • Context: Undergrad 
  • Thread starter Thread starter Hypercubes
  • Start date Start date
  • Tags Tags
    Bin Histogram
Click For Summary
SUMMARY

The discussion addresses an issue with histogram bin counting in OpenOffice Calc, where the frequency function incorrectly categorizes the value '1' into the '50000' bin instead of the '0' bin. The user discovered that the frequency function evaluates bins from the lowest to the highest, which can lead to unexpected results if the bin ranges are not set correctly. Adjusting the bin ranges or starting the bins at '1' can resolve this issue and ensure accurate frequency counts.

PREREQUISITES
  • Understanding of histogram binning concepts
  • Familiarity with OpenOffice Calc functions
  • Knowledge of frequency distribution
  • Basic statistical principles related to data categorization
NEXT STEPS
  • Research how to set bin ranges correctly in OpenOffice Calc
  • Learn about statistical functions in OpenOffice Calc
  • Explore best practices for histogram creation in spreadsheet software
  • Investigate alternative tools for data visualization, such as R or Python's Matplotlib
USEFUL FOR

This discussion is beneficial for data analysts, researchers, and anyone using OpenOffice Calc for statistical analysis and data visualization, particularly those working with histograms and frequency distributions.

Hypercubes
Messages
38
Reaction score
0
I am making a histogram for some experimental data in OpenOffice Calc (Excel equivalent). However, when I have it count frequency for a certain number, it displays odd behaviour.

For example:

Data:
1
499500
1000
375250
1000

Bins:
0
50000
100000
150000
200000

The problem is that it counts the 1 in the data in the 50000 bin instead of the 0 bin, which seems pretty ridiculous. If I set the 0 bin to 5, for example, it is grouped under that. My question is, is this statistically correct behaviour? If not, how should I fix this? Start the bin at 1, perhaps?

Thanks in advance.
 
Physics news on Phys.org
Never mind, I found what the problem was.

Apparently the frequency function searches for frequency from a bin down to the last one. For example:

Bins
0
5000
10000

The corresponding ranges are:
0-0
0-5000
5001-10000
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
8K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 8 ·
Replies
8
Views
5K
  • · Replies 5 ·
Replies
5
Views
6K
  • · Replies 1 ·
Replies
1
Views
13K
  • · Replies 4 ·
Replies
4
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
460