Average of standard deviation makes sense?

Click For Summary
SUMMARY

The discussion centers on the validity of calculating the mean of standard deviations from a set of test results, specifically in the context of performance testing. The user presents a dataset from Test1, where averages and standard deviations are computed for different sizes. It is concluded that while the mean of standard deviations (SD(mean)) may not be particularly useful, understanding the total standard deviation (SD(total)) is crucial for process improvement. The discussion emphasizes that combining datasets should focus on variances rather than standard deviations.

PREREQUISITES
  • Understanding of standard deviation and variance concepts
  • Familiarity with statistical analysis in performance testing
  • Knowledge of data aggregation techniques
  • Basic proficiency in interpreting statistical distributions
NEXT STEPS
  • Research the implications of using variance versus standard deviation in data analysis
  • Learn about the Central Limit Theorem and its impact on statistical distributions
  • Explore methods for combining datasets effectively in statistical analysis
  • Investigate process improvement techniques that utilize standard deviation analysis
USEFUL FOR

Statisticians, data analysts, performance testers, and process improvement professionals seeking to enhance their understanding of statistical measures in test result analysis.

xeon123
Messages
90
Reaction score
0
I've a set of tests, and for each test, they have different size.

I'll show an example.

Code:
Test1
size - Time(seconds)
100 - 10
100 - 23
100 - 17
200 - 37
200 - 42
200 - 47
300 - 53
300 - 53
300 - 53

For each test, I took the average.
Code:
Average1
size - average
100 - 16
200 - 42
300 - 53

And for each entry of the Test1, I took the standard deviation. Does't in make sense calculate the mean of the standard deviation?

Doing an average of standard deviation can proof that all examples ran at similar time? For example, having low average in the standard deviation means that all the 3 results of 100 were similar among themselves, the 3 results of 200 were similar results among themselves, and the 3 results of 300 were similar among themselves?
 
Physics news on Phys.org
There are a few procedures used to test the hypothesis that the variances of two groups are equal, but they tend to be extremely unreliable (especially for your small N).

The standard deviation has a distribution, of course, and the distribution tends to be pretty normal for large N (which you don't have, so your distribution is going to be quite skewed). You're not going to be able to say much with confidence; the best thing to do may be to just point to the standard deviations of the three groups and note that they're pretty close to each other.
 
I just showed here a small example. I've a set with N=100.
 
When combining data sets you should work with the variances and not the standard deviations.
 
mathman said:
When combining data sets you should work with the variances and not the standard deviations.

Where variance is SD2.

So I presume that SD(total) = √(SD12 + SD22...SDn2)

And SD(mean) = SD(total)/n

While I can't see a good use for SD(mean), being able to understand SD(total) is a powerful tool in process improvement. Often you can't reduce all variation in a process but you can sure tackle some of them individually and reduce the total that way.
 
Bandit127 said:
Where variance is SD2.

So I presume that SD(total) = √(SD12 + SD22...SDn2)

And SD(mean) = SD(total)/n

While I can't see a good use for SD(mean), being able to understand SD(total) is a powerful tool in process improvement. Often you can't reduce all variation in a process but you can sure tackle some of them individually and reduce the total that way.
Do you have a further question?
 
Since you have not taken issue with it, I presume the workings are correct.

Should I take for granted that all the readers of this thread know what variance is and how to sum and take the average of standard deviations? If so my post is redundant.

If not you could have prevented me posting by explaining it yourself.
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 9 ·
Replies
9
Views
6K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K