What is the correct way of describing this change - mean or median?

musicgold
Messages
303
Reaction score
19
Hi,

Please see the attached Excel file.

The list shows the old and new values of a set of variables. I am trying to understand what is the best way – average or median - to describe the change the values of the set. I want to describe the true central tendency of the change.

1. I think there are the following four ways I can describe the change in the values of the set . Which one is the most accurate?

a) the average value changed by 0.15% (the mean of the change values shown in Cell E63)
b) the average value changed by 0.10% (the median of the change values shown in Cell E64)
c) the average value changed by 0.14% (the difference of Cell D63 and C63)
d) the average value changed by 0.25% (the difference of Cell D64 and C64)
2. I am ignoring the change values of variables X12 and X39, as these variables did not change. Is that correct?

Thanks.
 

Attachments

Physics news on Phys.org
Certainly not the median of the differences (b).

Both distributions look nice, not too asymmetric and without outliers. I think I would compare the average values. a and c should give the same result here.

2. I am ignoring the change values of variables X12 and X39, as these variables did not change. Is that correct?
Don't ignore them, they are measured values! This could just be by chance, and also a result of your measurement resolution.
 
  • Like
Likes 1 person
mfb said:
Certainly not the median of the differences (b).

Both distributions look nice, not too asymmetric and without outliers. I think I would compare the average values. a and c should give the same result here.

Can you explain why you think B should not be used? Is it because the distribution of change values is not symmetrical?
 
I cannot imagine a scenario where the median of the differences would have an advantage over anything else, unless the correlation between your measurement series is much stronger than the correlation within the series (so you have something like 0.10 -> 0.11, 45343.44 -> 45343.45 and similar things, together with some outliers so the mean cannot be used). But then you should not compare the series like that anyway.
 
:confused: I am not sure what you are saying here.

I prefer the median over the mean when there are outliers in the data.

mfb said:
I cannot imagine a scenario where the median of the differences would have an advantage over anything else, unless the correlation between your measurement series is much stronger than the correlation within the series (so you have something like 0.10 -> 0.11, 45343.44 -> 45343.45
Are you talking about auto-correlation here?
Also, I don't know what you mean by 'measurement series'.
 
musicgold said:
I prefer the median over the mean when there are outliers in the data.
Right.

Are you talking about auto-correlation here?
It is related to that.
Also, I don't know what you mean by 'measurement series'.
Columns in your excel file.
 
Hi all, I've been a roulette player for more than 10 years (although I took time off here and there) and it's only now that I'm trying to understand the physics of the game. Basically my strategy in roulette is to divide the wheel roughly into two halves (let's call them A and B). My theory is that in roulette there will invariably be variance. In other words, if A comes up 5 times in a row, B will be due to come up soon. However I have been proven wrong many times, and I have seen some...
Thread 'Detail of Diagonalization Lemma'
The following is more or less taken from page 6 of C. Smorynski's "Self-Reference and Modal Logic". (Springer, 1985) (I couldn't get raised brackets to indicate codification (Gödel numbering), so I use a box. The overline is assigning a name. The detail I would like clarification on is in the second step in the last line, where we have an m-overlined, and we substitute the expression for m. Are we saying that the name of a coded term is the same as the coded term? Thanks in advance.
Back
Top