Margin of error for standard deviation

In summary: There is some exact formula for the standard deviation of a sample of three, but it is complicated. It would be better to generate a few hundred triples (x1, x2, x3) and compute the sample standard deviation of each, and then look at the distribution of those sample standard deviations. That would tell you about the distribution of your sigma.In summary, the conversation discusses the calculation of the margin of error for the standard deviation when each data point has a margin of error E. The standard deviation for 3 data points is defined as the square root of the sum of the squared differences between each data point and the mean, divided by 3. The conversation also mentions the possibility of finding the maximum margin of
  • #1
CarlosMarti12
8
0
Hello, I am trying to calculate what the margin of error for the standard deviation would be if each data point has a margin of error E. The standard deviation (for 3 data points) is defined as

[itex]σ = \sqrt{\frac{(x_{1}-μ)^{2}+(x_{2}-μ)^{2}+(x_{3}-μ)^{2}}{3}}[/itex]

Where

[itex]μ = \frac{x_{1}+x_{2}+x_{3}}{3}[/itex]

In other words, [itex]x_{n}[/itex] for n = 1, 2, or 3 has a margin of error of ±E. Is there a way to find what the maximum margin of error would be for the standard deviation, given the margin of error for each individual data point? (Notice that the margin of error would affect the mean value of μ as well.)

Any help would be greatly appreciated. Thanks!
 
Physics news on Phys.org
  • #2
CarlosMarti12 said:
The standard deviation (for 3 data points) is defined as

[itex]σ = \sqrt{\frac{(x_{1}-μ)^{2}+(x_{2}-μ)^{2}+(x_{3}-μ)^{2}}{3}}[/itex]

Where

[itex]μ = \frac{x_{1}+x_{2}+x_{3}}{3}[/itex]

This is apparently a "sample standard deviation" rather than the standard deviation of a probability distribution. You can define the sample standard deviation of a sample of 3 things that way, but some texts would say divide by 2 instead of by 3 in your first equation. Let's say we define it your way.

In other words, [itex]x_{n}[/itex] for n = 1, 2, or 3 has a margin of error of ±E. Is there a way to find what the maximum margin of error would be for the standard deviation, given the margin of error for each individual data point? (Notice that the margin of error would affect the mean value of μ as well.)

If you are talking about "margin of error" in the sense of the rules of thumb used in lab measurements, then I don't know the answer. These rules of thumb seem to vary from textbook to textbook.

If you are talking about a real math problem --- well, now your talking!

Let the errors in [itex] x_1, x_2,x_3 [/itex] be respectively [itex] e_1, e_2, e_3 [/itex].


Define

[tex] \mu_e = \frac{ (x_1 + e_1) + (x_2 + e_2) + (x_3 + e_3)}{3} [/tex]

[tex] \sigma_e = \sqrt{ \frac{(x_1 + e_1 - \mu_e)^2 + (x_2 + e_2 - \mu_e)^2 + (x3 + e_3 - \mu_e)^2 }{3}} [/tex]

[tex] F(e_1,e_2,e_3) = \sigma_e - \sigma [/tex]

Regarding [itex] x_1, x_2, x_3 [/itex] as constants the problem is:

Given

[tex] -E \le e_1 \le E [/tex]
[tex] - E \le e_2 \le E [/tex]
[tex] -E \le e_3 \le E [/tex]

Find the tightest bounds [itex] L, U [/itex] such that

[tex] L \le F(e_1,e_2,e_3) \le U [/tex]

If we have to, we can approach this as multi-variable caclulus problem:

Find the max and min values of [itex] F(e_1,e_2,e_3) [/itex] subject to the above constraints on the [itex] e_i [/itex].

That would involve taking partial derivative of [itex] F [/itex] with respect to the [itex] e_i [/itex] and setting them equal to zero. If would also involve checking all the "boundary" cases. Before we worry about that, let's see if the above is a correct statement of you question.

In a probabilistic setting for errors this formulation gives a worst case analysis, which may be an unlikely case.
 
  • #3
CarlosMarti12 said:
Hello, I am trying to calculate what the margin of error for the standard deviation would be if each data point has a margin of error E. The standard deviation (for 3 data points) is defined as

[itex]σ = \sqrt{\frac{(x_{1}-μ)^{2}+(x_{2}-μ)^{2}+(x_{3}-μ)^{2}}{3}}[/itex]

Where

[itex]μ = \frac{x_{1}+x_{2}+x_{3}}{3}[/itex]

In other words, [itex]x_{n}[/itex] for n = 1, 2, or 3 has a margin of error of ±E. Is there a way to find what the maximum margin of error would be for the standard deviation, given the margin of error for each individual data point? (Notice that the margin of error would affect the mean value of μ as well.)

Any help would be greatly appreciated. Thanks!

By "margin of error ±E" you mean a confidence interval? I am not sure what you mean.
 
  • #4
CarlosMarti12, you ask for the "maximum margin of error". Taking that literally, think of it geometrically. The feasible 'true' values for the triple x1, x2, x3 form a cube of side 2E. The point where they all equal the observed mean may be inside or outside the cube. Sigma is (proportional to) the distance from that point to a point in the cube.
So you can just evaluate the distance from the 'mean point' to each of the eight vertices to find the max sigma.
The min sigma is more complicated. It can be 0, or the distance to the nearest face of the cube, or the distance to the nearest edge of the cube, or the distance to the nearest vertex of the cube. This will be according to whether the mean lies inside the +/-E range of all, 2, 1 or none of the measurements.

That said, I would consider the standard deviation of sigma more interesting. For that, it would be best to reinterpret E as some number of standard deviations of the measurements.
 
  • #5


Hello,

Thank you for your question. The margin of error for the standard deviation can be calculated using the formula:

ME = t * \frac{s}{\sqrt{n}}

Where ME is the margin of error, t is the critical value from the t-distribution based on the desired confidence level, s is the sample standard deviation, and n is the sample size.

In your case, since you have 3 data points, n = 3. Also, the sample standard deviation can be calculated using the formula you provided:

s = \sqrt{\frac{(x_{1}-μ)^{2}+(x_{2}-μ)^{2}+(x_{3}-μ)^{2}}{3}}

Therefore, the margin of error for the standard deviation would be:

ME = t * \frac{\sqrt{\frac{(x_{1}-μ)^{2}+(x_{2}-μ)^{2}+(x_{3}-μ)^{2}}{3}}}{\sqrt{3}}

We can see that the margin of error will be affected by the individual data point's margin of error, as well as the sample size and the confidence level. The maximum margin of error for the standard deviation would depend on the maximum margin of error for each individual data point.

I hope this helps. Let me know if you have any further questions. Thank you.
 

What is the margin of error for standard deviation?

The margin of error for standard deviation is a measure of the range within which the true value of the standard deviation is likely to fall. It is calculated by multiplying the standard error of the mean by a critical value based on the desired confidence level.

Why is it important to consider the margin of error for standard deviation?

The margin of error for standard deviation is important because it provides a measure of uncertainty in the estimate of the true value of the standard deviation. It allows us to understand the level of confidence we have in our data and can help us make more informed decisions.

How is the margin of error for standard deviation calculated?

The margin of error for standard deviation is calculated by multiplying the standard error of the mean by a critical value based on the desired confidence level. The standard error of the mean is calculated by dividing the standard deviation by the square root of the sample size.

What factors can affect the margin of error for standard deviation?

The margin of error for standard deviation can be affected by several factors, including the sample size, the level of confidence desired, and the variability of the data. A larger sample size and a lower desired confidence level will result in a smaller margin of error.

How can we use the margin of error for standard deviation in data analysis?

The margin of error for standard deviation can be used in data analysis to determine the precision of our estimates and to compare the variability of different data sets. It can also be used to identify outliers and to understand the level of confidence in our data and results.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
919
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
993
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
822
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
2
Replies
37
Views
4K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
848
  • Set Theory, Logic, Probability, Statistics
Replies
28
Views
3K
Back
Top