What is the correct way of describing this change - mean or median?

musicgold · Oct 9, 2013

Hi,

Please see the attached Excel file.

The list shows the old and new values of a set of variables. I am trying to understand what is the best way – average or median - to describe the change the values of the set. I want to describe the true central tendency of the change.

1. I think there are the following four ways I can describe the change in the values of the set . Which one is the most accurate?

a) the average value changed by 0.15% (the mean of the change values shown in Cell E63)
b) the average value changed by 0.10% (the median of the change values shown in Cell E64)
c) the average value changed by 0.14% (the difference of Cell D63 and C63)
d) the average value changed by 0.25% (the difference of Cell D64 and C64)

2. I am ignoring the change values of variables X12 and X39, as these variables did not change. Is that correct?

Thanks.

mfb · Oct 9, 2013

Certainly not the median of the differences (b).

Both distributions look nice, not too asymmetric and without outliers. I think I would compare the average values. a and c should give the same result here.

2. I am ignoring the change values of variables X12 and X39, as these variables did not change. Is that correct?

Don't ignore them, they are measured values! This could just be by chance, and also a result of your measurement resolution.

musicgold · Oct 10, 2013

mfb said:

Certainly not the median of the differences (b).

Both distributions look nice, not too asymmetric and without outliers. I think I would compare the average values. a and c should give the same result here.

Can you explain why you think B should not be used? Is it because the distribution of change values is not symmetrical?

mfb · Oct 10, 2013

I cannot imagine a scenario where the median of the differences would have an advantage over anything else, unless the correlation between your measurement series is much stronger than the correlation within the series (so you have something like 0.10 -> 0.11, 45343.44 -> 45343.45 and similar things, together with some outliers so the mean cannot be used). But then you should not compare the series like that anyway.

musicgold · Oct 11, 2013

I am not sure what you are saying here.

I prefer the median over the mean when there are outliers in the data.

mfb said:

I cannot imagine a scenario where the median of the differences would have an advantage over anything else, unless the correlation between your measurement series is much stronger than the correlation within the series (so you have something like 0.10 -> 0.11, 45343.44 -> 45343.45

Are you talking about auto-correlation here?
Also, I don't know what you mean by 'measurement series'.

mfb · Oct 11, 2013

musicgold said:

I prefer the median over the mean when there are outliers in the data.

Right.

Are you talking about auto-correlation here?

It is related to that.

Also, I don't know what you mean by 'measurement series'.

Columns in your excel file.

What is the correct way of describing this change - mean or median?

Attachments

Thread 'Onto set mapping is the surjective set mapping, and into injective?'

Thread 'Roulette wheel physics and probability'

Thread 'Detail of Diagonalization Lemma'

Similar threads

Hot Threads

B A Little Probability Puzzle

I Need help solving this Existence Algorithm for truth

A Does this computation satisfy LTL formulas?

A Prove that points which are indistinguishable from 0 exist (using logic)

A Mathematical Connection between Cosmic Expansion and Exponential Growth

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective