Bi-geometrical mean using logs -- don't get the same result

In summary, the first equation calculates the geometric mean of a sorted series of numbers, while the second equation calculates the exponential average of a series of numbers.
  • #1
ducmod
86
0

Homework Statement


Hello!
I am trying to compute the bi-geometrical mean on data that contains negatives.
But before that I wanted the test the formula that accounts only for positive values using the sum of their logarithms. By doing so I don't get the result I compute by using the "usual" geo mean formula:
for values between 0 and 1: ((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1
for other positive values: (last value / initial value) ^ (1/number of periods) - 1

Homework Equations


Here is the example:
1,062 1,252 1,587 1,934 2,519
Geometric mean = (2519 / 1062) ^ (1/5) - 1 = 18.86%

The Attempt at a Solution


Using the formula log g = sum of logs of each value (see picture attached)
log g = (LOG(1062)+LOG(1252)+LOG(1587)+LOG(1934)+LOG(2519))/number of periods
I get geo mean = 0.505, which is nowhere close to 18.86%
What am I doing wrong?

Thank you!
 

Attachments

  • Screen Shot 2017-01-27 at 2.58.33 PM.png
    Screen Shot 2017-01-27 at 2.58.33 PM.png
    4.7 KB · Views: 523
Physics news on Phys.org
  • #2
You have three completely different formulas computing different things here. I don't see why you expect two of them to give the same result.

(last value / initial value) ^ (1/(number of periods)) - 1 for sorted values gives you some information how much the numbers grow per step. Where "number of periods" should probably be the number of data points minus one.

Averaging the logs (the result is not 0.505) gives you some information about the average value, but not about the differences between the values. Calculate its exponential to get back to the original scale.

((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1 is yet another parameter, telling you something about the average. It is similar to the logarithmic average but not identical.
 
  • #3
mfb said:
You have three completely different formulas computing different things here. I don't see why you expect two of them to give the same result.

(last value / initial value) ^ (1/(number of periods)) - 1 for sorted values gives you some information how much the numbers grow per step. Where "number of periods" should probably be the number of data points minus one.

Averaging the logs (the result is not 0.505) gives you some information about the average value, but not about the differences between the values. Calculate its exponential to get back to the original scale.

((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1 is yet another parameter, telling you something about the average. It is similar to the logarithmic average but not identical.
Thank you very much for your reply.
As to the first formula: (last value / initial value) ^ (1/(number of periods)) - 1 it is a compound average growth rate, which should give the same result as the geometric average, and I wouldn't say that it is for a sorted set of data, because "sorted" implies that values grow sequentially (second is bigger than the first one, etc), which is not the case usually and which is why this formula is used to compute growth rate; the last formula is a geometric average computed for values between 0 and 1 (this is why 1 is added, to smooth those values).
 
  • #4
An average growth rate and an average of the values are different things.

Your growth rate doesn't change if you multiply all values by 1000, but the average of the values does change.
 
  • #5
mfb said:
An average growth rate and an average of the values are different things.

Your growth rate doesn't change if you multiply all values by 1000, but the average of the values does change.
Sorry but all this doesn't help me to understand and solve the whole issue. Your answers are very vague, and they leave me very confused.
 
  • #6
mfb said:
You have three completely different formulas computing different things here. I don't see why you expect two of them to give the same result.

(last value / initial value) ^ (1/(number of periods)) - 1 for sorted values gives you some information how much the numbers grow per step. Where "number of periods" should probably be the number of data points minus one.

Averaging the logs (the result is not 0.505) gives you some information about the average value, but not about the differences between the values. Calculate its exponential to get back to the original scale.

((1+a1) * (1+a2) * ...(1+ aN)) ^(1/n) - 1 is yet another parameter, telling you something about the average. It is similar to the logarithmic average but not identical.

It ought to yield identical results to the log form, except maybe for roundoff errors, because
$$\log \left[ \prod_{i=1}^n A_i \right]^{1/n} = \frac{1}{n}\sum_{i=1}^n \log A_i . $$
 
  • #7
ducmod said:
Sorry but all this doesn't help me to understand and solve the whole issue. Your answers are very vague, and they leave me very confused.

You really are confusing yourself by mixing up several different concepts. The two series s1 = {1, 1.2, 1.44} and s2 = {1000,1200, 1440} both have the same (geometric) growth rate r = 0.2 (20%), but their numerical values are orders of magnitude different. Furthermore, the calculation ##(1440/1000)^{1/3} -1 \doteq 1.1292 -1 = 0.1292## does not give a value anywhere near the true growth rate 0.20.

I can think of no significance whatsoever for the quantity ##(a_n/a_1)^{1/n}-1## that you have called the "geometric mean". In mathematics the concept of geometric mean is well-defined (and has been for hundreds of years); it is not what you are claiming it to be.
 
  • #8
Don't forget the 1 that gets added and subtracted.
Example: a0=1, a1=2. The formula with +1 gives ##\sqrt{6}-1 \approx 1.45##, the log formula gives 1.16. Let a0 go to zero and the log average diverges while the other one stays finite.

ducmod said:
Sorry but all this doesn't help me to understand and solve the whole issue. Your answers are very vague, and they leave me very confused.
I don't understand what is unclear.
You have three different formulas calculating different things.
 

FAQ: Bi-geometrical mean using logs -- don't get the same result

What is the bi-geometrical mean?

The bi-geometrical mean is a mathematical concept that takes into account the geometric mean and the arithmetic mean of a set of numbers. It is calculated by taking the square root of the product of the numbers and then taking the logarithm of that result.

How is the bi-geometrical mean calculated?

The bi-geometrical mean is calculated by taking the square root of the product of the numbers and then taking the logarithm of that result. This means that the numbers are first multiplied together, then the square root is taken, and finally the logarithm is applied to the result.

Why don't we get the same result when using logs?

The reason for this is because the logarithm function is non-linear, meaning that it does not follow a straight line. This can result in different values when comparing the arithmetic mean and the bi-geometrical mean, as the logarithm function can amplify small differences between the numbers.

What is the purpose of using logs in the bi-geometrical mean calculation?

Using logs in the bi-geometrical mean calculation helps to reduce the effects of extreme values in the set of numbers. It also allows for the comparison of numbers on a logarithmic scale, which can be useful in certain applications.

How is the bi-geometrical mean used in scientific research?

The bi-geometrical mean is often used in scientific research to take into account both the geometric and arithmetic means of a set of numbers. This can provide a more accurate representation of the data and can help to reduce the impact of outliers. It is commonly used in fields such as biology, economics, and finance.

Similar threads

Back
Top