Statistics - Standard Deviation, Standard Error and Mean

Learn
Messages
1
Reaction score
0
Hello,

Just had a question regarding statistical analysis.

I'm trying to calculate the average of 4 numbers from a data set of 6 numbers in excel without manually choosing to average only the 4 numbers.

e.g.


85 20 32 45 27 3 (total mean = 35.3 desired mean = 31)
100 30 27 40 21 1
...etc

The middle 4 numbers represent a more realistic result whilst the two end numbers are irrelevant.

I've tried looking into weighted averages however I'm unsure how to apply this to an excel sheet.

I'd prefer to find a formula which would discount numbers which vary wildly from the average amount and focus only on the few that don't.

Any help would be greatly appreciated.
 
Physics news on Phys.org
You have six numbers - if you picked four of them at random, what would be their average?
That about it?

Clearly that depends on the four numbers - but you can work out each possible combination of 4, and work out the average for each one. The expectation value of the averages will be your answer.
 
A better strategy than excluding observations arbitrarily would be to compute "robust" estimates of the distribution, e.g. the median instead of the mean and the median absolute deviation (MAD) as an estimator for the variability.
See
http://en.wikipedia.org/wiki/Robust_statistics
 
As a general comment, to reinfoce what was already said, in distributions that are not symmetrical or not unimodal, and or distributions with outliers, the variance is not a good measure of spread. You may also want to represent your data with a dot plot or stemplot:

http://en.wikipedia.org/wiki/Stem-and-leaf_plot
 
Hi all, I've been a roulette player for more than 10 years (although I took time off here and there) and it's only now that I'm trying to understand the physics of the game. Basically my strategy in roulette is to divide the wheel roughly into two halves (let's call them A and B). My theory is that in roulette there will invariably be variance. In other words, if A comes up 5 times in a row, B will be due to come up soon. However I have been proven wrong many times, and I have seen some...
Thread 'Detail of Diagonalization Lemma'
The following is more or less taken from page 6 of C. Smorynski's "Self-Reference and Modal Logic". (Springer, 1985) (I couldn't get raised brackets to indicate codification (Gödel numbering), so I use a box. The overline is assigning a name. The detail I would like clarification on is in the second step in the last line, where we have an m-overlined, and we substitute the expression for m. Are we saying that the name of a coded term is the same as the coded term? Thanks in advance.
Back
Top