Bi-geometrical mean using logs -- don't get the same result

Click For Summary

Homework Help Overview

The discussion revolves around the computation of the bi-geometrical mean, particularly in the context of data that includes negative values. The original poster is attempting to understand discrepancies between results obtained using different formulas for the geometric mean, specifically when applying logarithmic methods versus traditional geometric mean calculations.

Discussion Character

  • Conceptual clarification, Assumption checking, Problem interpretation

Approaches and Questions Raised

  • Participants explore the differences between three distinct formulas for calculating averages and growth rates, questioning the expectations of equivalence between them. The original poster seeks clarity on why the logarithmic approach yields a different result compared to the traditional geometric mean formula.

Discussion Status

There is ongoing exploration of the definitions and implications of the different formulas presented. Some participants provide insights into the nature of growth rates versus averages, while others express confusion regarding the clarity of the explanations given. No consensus has been reached, and the discussion remains open with various interpretations being considered.

Contextual Notes

Participants note the potential for misunderstanding due to the mixing of concepts related to growth rates and averages. There is also mention of specific conditions under which the formulas may yield different results, particularly regarding the treatment of values less than one and the implications of adding one in certain calculations.

ducmod
Messages
86
Reaction score
0

Homework Statement


Hello!
I am trying to compute the bi-geometrical mean on data that contains negatives.
But before that I wanted the test the formula that accounts only for positive values using the sum of their logarithms. By doing so I don't get the result I compute by using the "usual" geo mean formula:
for values between 0 and 1: ((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1
for other positive values: (last value / initial value) ^ (1/number of periods) - 1

Homework Equations


Here is the example:
1,062 1,252 1,587 1,934 2,519
Geometric mean = (2519 / 1062) ^ (1/5) - 1 = 18.86%

The Attempt at a Solution


Using the formula log g = sum of logs of each value (see picture attached)
log g = (LOG(1062)+LOG(1252)+LOG(1587)+LOG(1934)+LOG(2519))/number of periods
I get geo mean = 0.505, which is nowhere close to 18.86%
What am I doing wrong?

Thank you!
 

Attachments

  • Screen Shot 2017-01-27 at 2.58.33 PM.png
    Screen Shot 2017-01-27 at 2.58.33 PM.png
    4.7 KB · Views: 582
Physics news on Phys.org
You have three completely different formulas computing different things here. I don't see why you expect two of them to give the same result.

(last value / initial value) ^ (1/(number of periods)) - 1 for sorted values gives you some information how much the numbers grow per step. Where "number of periods" should probably be the number of data points minus one.

Averaging the logs (the result is not 0.505) gives you some information about the average value, but not about the differences between the values. Calculate its exponential to get back to the original scale.

((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1 is yet another parameter, telling you something about the average. It is similar to the logarithmic average but not identical.
 
mfb said:
You have three completely different formulas computing different things here. I don't see why you expect two of them to give the same result.

(last value / initial value) ^ (1/(number of periods)) - 1 for sorted values gives you some information how much the numbers grow per step. Where "number of periods" should probably be the number of data points minus one.

Averaging the logs (the result is not 0.505) gives you some information about the average value, but not about the differences between the values. Calculate its exponential to get back to the original scale.

((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1 is yet another parameter, telling you something about the average. It is similar to the logarithmic average but not identical.
Thank you very much for your reply.
As to the first formula: (last value / initial value) ^ (1/(number of periods)) - 1 it is a compound average growth rate, which should give the same result as the geometric average, and I wouldn't say that it is for a sorted set of data, because "sorted" implies that values grow sequentially (second is bigger than the first one, etc), which is not the case usually and which is why this formula is used to compute growth rate; the last formula is a geometric average computed for values between 0 and 1 (this is why 1 is added, to smooth those values).
 
An average growth rate and an average of the values are different things.

Your growth rate doesn't change if you multiply all values by 1000, but the average of the values does change.
 
mfb said:
An average growth rate and an average of the values are different things.

Your growth rate doesn't change if you multiply all values by 1000, but the average of the values does change.
Sorry but all this doesn't help me to understand and solve the whole issue. Your answers are very vague, and they leave me very confused.
 
mfb said:
You have three completely different formulas computing different things here. I don't see why you expect two of them to give the same result.

(last value / initial value) ^ (1/(number of periods)) - 1 for sorted values gives you some information how much the numbers grow per step. Where "number of periods" should probably be the number of data points minus one.

Averaging the logs (the result is not 0.505) gives you some information about the average value, but not about the differences between the values. Calculate its exponential to get back to the original scale.

((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1 is yet another parameter, telling you something about the average. It is similar to the logarithmic average but not identical.

It ought to yield identical results to the log form, except maybe for roundoff errors, because
$$\log \left[ \prod_{i=1}^n A_i \right]^{1/n} = \frac{1}{n}\sum_{i=1}^n \log A_i . $$
 
ducmod said:
Sorry but all this doesn't help me to understand and solve the whole issue. Your answers are very vague, and they leave me very confused.

You really are confusing yourself by mixing up several different concepts. The two series s1 = {1, 1.2, 1.44} and s2 = {1000,1200, 1440} both have the same (geometric) growth rate r = 0.2 (20%), but their numerical values are orders of magnitude different. Furthermore, the calculation ##(1440/1000)^{1/3} -1 \doteq 1.1292 -1 = 0.1292## does not give a value anywhere near the true growth rate 0.20.

I can think of no significance whatsoever for the quantity ##(a_n/a_1)^{1/n}-1## that you have called the "geometric mean". In mathematics the concept of geometric mean is well-defined (and has been for hundreds of years); it is not what you are claiming it to be.
 
Don't forget the 1 that gets added and subtracted.
Example: a0=1, a1=2. The formula with +1 gives ##\sqrt{6}-1 \approx 1.45##, the log formula gives 1.16. Let a0 go to zero and the log average diverges while the other one stays finite.

ducmod said:
Sorry but all this doesn't help me to understand and solve the whole issue. Your answers are very vague, and they leave me very confused.
I don't understand what is unclear.
You have three different formulas calculating different things.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
12K
  • · Replies 0 ·
Replies
0
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 10 ·
Replies
10
Views
4K
  • · Replies 13 ·
Replies
13
Views
3K