How is the harmonic mean affected by additional data points?

Click For Summary

Discussion Overview

The discussion revolves around the effects of adding additional data points on the harmonic mean (HM) of a given dataset. Participants explore how the HM changes with the introduction of new points, particularly focusing on whether the new HM will be higher or lower based on the values of the added points. The conversation includes both theoretical considerations and specific calculations.

Discussion Character

  • Exploratory
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant presents a dataset and calculates the HM, noting that adding two new points resulted in a higher HM than expected, prompting questions about the influence of new data points.
  • Some participants propose that if a new point is lower than the current HM, it will lower the overall HM, while if it is higher, it will increase the HM.
  • Another participant suggests that the HM of multiple new points can be predicted based on their individual HMs, indicating that the combined HM will be influenced by the HMs of the new points.
  • One participant corrects a formula presented earlier, asserting that the correct expression for combining HMs involves a weighted harmonic mean, which will yield a result between the two HMs being combined.

Areas of Agreement / Disagreement

Participants generally agree on the basic principles of how the HM is affected by new data points, but there is some contention regarding the specifics of the formulas and the implications of adding multiple points. The discussion remains unresolved regarding the predictability of the HM when multiple new points are added.

Contextual Notes

Some participants note that the calculations and predictions depend on the specific values of the new data points and their relationship to the existing HM, highlighting the complexity of the situation when multiple points are involved.

Who May Find This Useful

This discussion may be useful for those interested in statistical methods, particularly in understanding the properties and behavior of the harmonic mean in relation to data analysis and aggregation of datasets.

Feynstein100
Messages
181
Reaction score
17
We have a collection of 8 discrete data points. They are:
10, 20, 30, 20, 30, 40, 30, 40
In increasing order:
10, 20*2, 30*3, 40*2
The harmonic mean of this data series is 22.86
I read on Wikipedia that the harmonic mean is skewed towards the smaller values i.e. smaller values will affect the HM more than larger values. So I thought that if we add 2 additional data points 20 and 30, our HM would be even smaller. And yet, when I calculated the HM of this new data series with 10 points:
10, 20*3, 30*4, 40*2
it turned out to be 23.08 i.e. higher than the previous case. Why did that happen?
One of our new points was lower than the HM whereas the other was higher. I thought the HM would be more skewed toward the lower value and thus would bring the overall mean down. Ah is it because the second datapoint was much higher than the HM?
In general, I'm interested in the question of how adding new datapoints will affect the HM of the existing data series.
We're not changing the endpoints, they remain constant. So any new point added will lie somewhere inside the bounds of the data series. In our example, that's 10 and 40.
So I think the answer is quite simple. If New point < HM, it lowers the HM. If New point > HM, it increases the HM.
It seems quite straightforward for adding one datapoint but what if we add multiple? In essence appending another data series to the existing one. Can we predict in advance if the new HM will be higher or lower?
 
Physics news on Phys.org
It looks like a simple quantitative question. One point at a time will give a predictable result. More than one - results?
 
Feynstein100 said:
It seems quite straightforward for adding one datapoint but what if we add multiple? In essence appending another data series to the existing one. Can we predict in advance if the new HM will be higher or lower?
Yes. If the harmonic mean of the new points is lower, the HM of all the points will be lower, If it is higher the new HM will be higher. Easy to see from HM of \frac {1}{\sum {\frac{1}{a_i}}} and \frac {1}{\sum {\frac{1}{b_i}}} is \frac {1}{ \sum {\frac{1}{a_i}} + \sum {\frac{1}{b_i}} }
 
  • Like
Likes   Reactions: Feynstein100
mathman said:
It looks like a simple quantitative question. One point at a time will give a predictable result. More than one - results?
That's............kind of what I'm asking 😂
 
willem2 said:
Yes. If the harmonic mean of the new points is lower, the HM of all the points will be lower, If it is higher the new HM will be higher. Easy to see from HM of \frac {1}{\sum {\frac{1}{a_i}}} and \frac {1}{\sum {\frac{1}{b_i}}} is \frac {1}{ \sum {\frac{1}{a_i}} + \sum {\frac{1}{b_i}} }
Thanks for the reply. I worked it out myself and turns out, the combined harmonic mean Hc of two harmonic means H1 with m items and H2 with n items will be
Hc =(m + n)/(m/H1 + n/H2)
i.e. the weighted harmonic mean of H1 and H2. And by the general property of all means, Hc will be somewhere between H1 and H2. Thus, if the new HM is lower, the overall HM will be lower as well. And if the new HM is higher, the overall HM will be higher as well.

Btw your third formula has a mistake. It should be 2/(sum of inverses), not 1/(sum of inverses) and that's a special case of when both series have the same number of items. The general formula is the weighted harmonic mean.
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 20 ·
Replies
20
Views
3K
  • · Replies 26 ·
Replies
26
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 18 ·
Replies
18
Views
5K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 19 ·
Replies
19
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
8K