Standard deviation revised by removing a sample

cdux · Apr 25, 2008

but only knowing the previous standard deviation, the previous mean (and the sample to be removed).

does anyone know how to do it?

alphysicist · Apr 25, 2008

Hi cdux,

If I'm understanding the question, I think you would also need to know the original number of samples.

weatherhead · Apr 25, 2008

this is correct, you need also to know the sample size

cdux · May 9, 2008

I had eventually found equations that would do that (what the OP suggests).

alphysicist · May 9, 2008

cdux,

Do you mean you can find the revised standard deviation without knowing how many samples there are to begin with? If so, perhaps I'm misunderstanding your original question. Would you post the equations?

cdux · May 9, 2008

It should be similar to the next to last here:

en.wikipedia.org/wiki/Standard_deviation
(after "Similarly for sample standard deviation:")

after working out a new mean by simply "((num_of_samples X old_mean) -removed_value)/(num_of_samples - 1)" it should be possible to work out a new 's' by solving first to find the "old" summation of the squares and then using it as "result of the summation of the squares minus square of the removed value". (because the main problem is that we don't know the individual squares since we don't know the values but we may be able to find their summation)

alphysicist · May 10, 2008

If you know the old standard deviation, old mean, original number of sample, and the sample to remove, it's possible to find the new standard deviation.

There was just some confusion because you did not mention knowing the original number of samples in the problem statement. If you do not know the original number then you cannot determine the new standard deviation.

cdux · May 10, 2008

oh I'm sorry, you're right about that.

Standard deviation revised by removing a sample

1. What is the purpose of removing a sample when calculating standard deviation?

2. How is standard deviation revised by removing a sample?

3. Can removing a sample affect the overall interpretation of the data?

4. Are there any limitations to removing a sample when calculating standard deviation?

5. When is it appropriate to remove a sample when calculating standard deviation?

Similar threads

Hot Threads

Recent Insights