Propagating Uncertainty for sets of data?

In summary, the conversation discusses using linear regression to extract a value (J) from a set of data for two variables (x,y). The issue at hand is reporting the value of J with an uncertainty, given that some measurements (a, b, c) were only taken once while others (x,y) were measured multiple times. The question also arises of whether the uncertainty for J can be calculated from the confidence interval of the slope obtained from the linear regression. Suggestions for addressing this issue include making repeated measurements and considering worst-case estimates of measurement errors.
  • #1
NewtonsHead
26
0
I'm writing the paper on this experiment I just did. Basically I took sets of data for two variables (x,y) and I fit the points to a line in Origin to extract the value that I was trying to measure (J).
*Using generic variables here*
I found a value for J where

J = [8*∏*x*a(b+c)] / y

I did this by graphing the line

y(x) = [8*∏*a(b+c) / J] * x
and extracting J from the slope

The problem is reporting my value of J with an uncertainty. a, b, and c are distances that I measured only 1 time, so I know the uncertainty on those. However, x and y were both measured 10 times in 3 trials each. I fit the average of those 3 trials (10 data points) to a line to obtain J.

Anyone have experience with fitting a lot of data to a line and reporting uncertainties?
 
Physics news on Phys.org
  • #2
NewtonsHead said:
However, x and y were both measured 10 times in 3 trials each.

What errors are involved in measuring x and y? In some measurement scenarios, one sets the value of x and then measures the value of y. So the error in y depends not only on the error of the instrument that measures y, it also depends on the error made in setting x by the instrument that sets x. In other scenarios, you attempt to measure x and y at a time t and use some instrument that measures t.
 
  • #3
My two cents:
When you do a linear regression of x versus y and get the slope, you can calculate the confidence interval of the slope (see http://stattrek.com/regression/slope-confidence-interval.aspx ). Suppose S is the slope from the linear regression, and [L1, L2] is the 95% confidence interval for S.
So X = S*Y, where L1 < S < L2 with 95% confidence.

Now the question is whether you can say that L1 < [8*∏*a(b+c) / J] < L2 with 95% confidence, where a, b, c are your measurements and J has some physical meaning.

That is, can we say that (assuming L1 & L2 are positive), 8*∏*a(b+c)/L2 < J < 8*∏*a(b+c)/L1 with 95% confidence?

Your measurement of a, b, c were not part of a known statistical process. You either have to make repeated measurements and get statistics on those measurements, or you can make some worst-case estimates of the measurement errors and adjust the confidence interval for J accordingly. Most experiments have some small measurement errors and people adjust their conclusions accordingly. Hopefully, they are small enough to still get useful results.
 
  • #4
mayby you should have a look into "Data Fitting and Uncertainty - A practical introduction to weighted least squares and beyond", ISBN 978-3-8348-1022-9, This textbook explains the determination of uncertainties of model parameters quite well and also tells you something about error propagation.
 
  • #5


As a scientist, it is important to acknowledge and address uncertainties in data analysis. In this case, it seems like there are two main sources of uncertainty: the measurements of a, b, and c, and the measurements of x and y. To accurately report the value of J with its associated uncertainty, it would be helpful to first determine the uncertainty in each of these variables separately. This could involve calculating the standard deviation or error for each set of measurements.

Next, it would be important to consider how these uncertainties propagate through the equation used to calculate J. This can be done using techniques such as error propagation or Monte Carlo simulations. By propagating the uncertainties through the equation, a range of possible values for J can be determined, along with an associated uncertainty.

Another approach could be to use a regression analysis method, such as linear regression, to fit the data and obtain a more precise value for J. This method takes into account the uncertainties in both the dependent and independent variables and provides a statistical measure of the fit, which can be used to determine the uncertainty in the calculated value of J.

In conclusion, when reporting values with uncertainties, it is important to consider all sources of uncertainty and use appropriate methods to determine and propagate them through the calculations. This will provide a more accurate and comprehensive representation of the data and its associated uncertainties.
 

1. What is propagating uncertainty for sets of data?

Propagating uncertainty is a method used in science to estimate the uncertainty or error associated with a calculated value based on the uncertainties of the input data. It takes into account all possible sources of error and combines them to give a more accurate and reliable estimate of the overall uncertainty of the calculated value.

2. Why is propagating uncertainty important in scientific research?

Propagating uncertainty is important because it allows scientists to understand the limitations of their data and the potential errors in their calculations. It also helps in making informed decisions based on the reliability of the data and its implications.

3. How is propagating uncertainty calculated?

Propagating uncertainty is calculated using a mathematical formula that takes into account the uncertainties of each input value and their respective units. This formula is based on the principles of error propagation and is often represented as a formula or a diagram.

4. Can propagating uncertainty be applied to any type of data?

Yes, propagating uncertainty can be applied to any type of data, whether it is experimental, observational, or theoretical. It is a fundamental aspect of scientific research that helps in evaluating the reliability and accuracy of data.

5. How can propagating uncertainty be used to improve the accuracy of data?

Propagating uncertainty can be used to improve the accuracy of data by identifying and accounting for all potential sources of error. By taking into consideration the uncertainties associated with each input value, the overall uncertainty of the calculated value can be reduced, leading to a more accurate representation of the data.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
11
Views
753
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
888
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
489
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
30
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
16
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
18
Views
2K
Back
Top