Error bars in Excel (standard deviation, standard error, percentage)

Click For Summary

Discussion Overview

The discussion revolves around the interpretation of error bars in Excel scatterplots, specifically focusing on standard deviation (SD), standard error (SE), and confidence intervals (CI). Participants explore how these statistical measures are represented visually and conceptually in Excel, as well as their implications for data analysis.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant describes the difference between standard deviation (SD) and standard error (SE), noting that SD indicates variation within a sample while SE pertains to the spread of sample means around a population mean.
  • Another participant expresses confusion about the utility of Excel's error bars for SD and SE, suggesting that they may not provide meaningful information and recommending the use of custom error bars instead.
  • A later reply discusses the concept of confidence intervals (CI) and how they relate to predicted average values, questioning whether error bars should be placed on sample data points or predicted values.
  • Participants discuss the relationship between margin of error (ME) and SE, with one questioning which measure better describes uncertainty for predicted values.
  • One participant introduces the idea that error bars might represent residuals, indicating the difference between predicted and actual values.

Areas of Agreement / Disagreement

Participants express varying opinions on the usefulness of Excel's error bars, with some agreeing that they may not be informative, while others explore different interpretations and applications of error bars, indicating that the discussion remains unresolved.

Contextual Notes

There are unresolved questions regarding the appropriate placement of error bars and the relationship between different statistical measures (SD, SE, ME, CI) in the context of data visualization.

Who May Find This Useful

This discussion may be useful for individuals interested in data analysis, particularly those using Excel for statistical representation and seeking clarity on the interpretation of error bars.

fog37
Messages
1,566
Reaction score
108
TL;DR
understand error bars in excel
Hello,

I was working in excel on data reported in two columns, X data and Y data. I measured both X and Y, i.e. Y was not calculated using a function and the X data. The Y an X data was plotted as Y vs X using a scatterplot. The Excel graph offers the option to add error bars over each point in the scatterplot.

I know what the standard deviation SD is: the X and Y columns/sets have each a standard deviation which indicates the spread of values in X and the spread of values in Y relative to the mean of X and the mean of Y respectively.

The standard error SE is different from the SD: the set X is to be viewed a sample of a larger and unknown population. Given the sample mean of X, the SE indicates the spread of all the potential sample means around the actual population mean. We have calculated only one sample mean since we only a have a single sample. But the smaller the SE the better because a small SE indicates that our estimated sample mean is more likely to be close to the population mean....The same reasoning applies to the set Y. I hope I am correct.

So the SD is about the variation within a sample of data while the SE is about the means of a group of samples.

That said, the error bars in Excel can be vertical and/or horizontal and represent standard errors, standard deviations or percentage. What do the error bars (for example SD error bars) represent over each point? How should we interpret those error bars? Since all Y data has the same SD, are the vertical errors bars supposed to be the same length? Same goes for the horizontal error bars...

What would SE error bars mean over the points in the scatterplot? I am confused on how to interpret them...

Thank you!
 
Physics news on Phys.org
fog37 said:
That said, the error bars in Excel can be vertical and/or horizontal and represent standard errors, standard deviations or percentage. What do the error bars (for example SD error bars) represent over each point?
...
What would SE error bars mean over the points in the scatterplot? I am confused on how to interpret them...
I can understand your confusion: the "Standard Deviation" and "Standard Error" options don't really do anything useful. Having said that, it is fairly easy to work out what they are showing. Why don't you try selecting the different options and viewing the results: if any are unclear you could post them here and someone might confirm your interpretations, although as I say they don't show anything useful so I wouldn't waste your time on it.

If you want to show meaningful error bars then you need to calculate them yourself and add them with the Custom option: there are many tutorials on the interweb for this ("excel custom error bars").
 
  • Like
Likes   Reactions: fog37 and BvU
pbuk said:
I can understand your confusion: the "Standard Deviation" and "Standard Error" options don't really do anything useful. Having said that, it is fairly easy to work out what they are showing. Why don't you try selecting the different options and viewing the results: if any are unclear you could post them here and someone might confirm your interpretations, although as I say they don't show anything useful so I wouldn't waste your time on it.

If you want to show meaningful error bars then you need to calculate them yourself and add them with the Custom option: there are many tutorials on the interweb for this ("excel custom error bars").
Will do thank you for the input. I can now create custom vertical error bars that represent the confidence interval CI.
My confusion is more conceptual: the confidence interval CI applies to the predicted "average" value Y. We get the predicted mean value after plotting the best fit regression curve for the X,Y data. The CI is created this way: regression predicted Y values ##\pm## margin of error ME.

The error bars should not go on the sample data points themselves, correct? But that is what Excel does: it places the error bars on the data points themselves...

Also, the CI is calculated using the margin of error ME which, in turn, is calculated using the standard error SE. That said, the SE represents the spread/standard deviation of the average value IF we collected a multitude of samples. The smaller SE, the more likely our estimated average is close to the population average. The ME is related to SE. Which one, the ME or the SE, makes more sense to describe the uncertainty of the predicted value Y for a certain X input? I am confused about that. The CI also describes how close the estimated average value is to the population value, I guess..
 
fog37 said:
The error bars should not go on the sample data points themselves, correct?
...
Which one, the ME or the SE, makes more sense to describe the uncertainty of the predicted value Y for a certain X input?
Depends, there are many ways of presenting uncertain data. What experience do you have in data science? There are lots of resources on the interweb, some good some bad. MIT's Open Courseware is often a good starting point: https://ocw.mit.edu/courses/res-6-009-how-to-process-analyze-and-visualize-data-january-iap-2012/
 
Last edited:
  • Like
Likes   Reactions: WWGD
Maybe this refers to residuals, i.e., in ##(x_i, y_i)##, the error bars may refer to the difference between the prediction y_i^ of ##y_i ## by the model, minus the actual value ##y_i##.
 

Similar threads

  • · Replies 42 ·
2
Replies
42
Views
6K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 6 ·
Replies
6
Views
8K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K