Linear combination of data with uncertainty

Click For Summary
SUMMARY

The discussion focuses on calculating the uncertainty of a linear combination of two independent measurements, specifically ##y = 0.23x_1 + 0.55x_2##, where ##x_1 = 100 \pm 1## and ##x_2 = 94 \pm 10##. The correct method involves using the variance formula for independent random variables: $$Var(aX+bY) = a^2\ Var(X) + b^2\ Var(Y)$$, leading to the error calculation $$error(aX+bY) = \sqrt{a^2\ (error(X) )^2 + b^2\ (error(Y))^2}$$. The discussion also touches on the complexities of fitting a line to the data points and extracting uncertainties for the slope and intercept.

PREREQUISITES
  • Understanding of linear combinations of random variables
  • Familiarity with variance and standard deviation concepts
  • Knowledge of error propagation techniques
  • Basic statistics, particularly linear regression analysis
NEXT STEPS
  • Study the concept of "Propagation of Errors" in statistical analysis
  • Learn about variance calculations for linear combinations of independent variables
  • Explore linear regression techniques for estimating slope and intercept uncertainties
  • Investigate the application of the linearity of expectation in statistical modeling
USEFUL FOR

Statisticians, data analysts, researchers in experimental sciences, and anyone involved in quantitative data analysis requiring uncertainty calculations.

Malamala
Messages
348
Reaction score
28
Hello! I have 2 measured data points (they are measurements of different observable, not 2 measurement of the same observable), with quite different errors, say ##x_1 = 100 \pm 1## and ##x_2 = 94 \pm 10##. I want to compute the value (and associated uncertainty) of a linear combination of them, say ##y = 0.23x_1 + 0.55x_2##. What is the right way to do it, accounting for their different uncertainties? (This is basically equivalent to fitting the 2 points with a straight line, and extracting the uncertainty on the slope and intercept).
 
Physics news on Phys.org
Malamala said:
This is basically equivalent to fitting the 2 points with a straight line, and extracting the uncertainty on the slope and intercept)
That's not right. Measuring the uncertainty of the slope and intercept of the line is different to, and more complex than, measuring the uncertainty of the linear combination ##y##.
A typical approach would be to assume that the errors of the two measurements are independent. Since the most common error measurement is a standard deviation, which is the square root of a variance, we can then use the rule for variances of linear combinations of independent random variables, which is that:
$$Var(aX+bY) = a^2\ Var(X) + b^2\ Var(Y)$$
whence
$$error(aX+bY) = \sqrt{a^2\ (error(X) )^2+ b^2\ (error(Y))^2}$$
Substitute your errors from above, with ##a=0.23,b=0.55## and you'll get the answer.
 
  • Like
Likes   Reactions: Twigg
andrewkirk said:
That's not right. Measuring the uncertainty of the slope and intercept of the line is different to, and more complex than, measuring the uncertainty of the linear combination ##y##.
A typical approach would be to assume that the errors of the two measurements are independent. Since the most common error measurement is a standard deviation, which is the square root of a variance, we can then use the rule for variances of linear combinations of independent random variables, which is that:
$$Var(aX+bY) = a^2\ Var(X) + b^2\ Var(Y)$$
whence
$$error(aX+bY) = \sqrt{a^2\ (error(X) )^2+ b^2\ (error(Y))^2}$$
Substitute your errors from above, with ##a=0.23,b=0.55## and you'll get the answer.
Thank you for this! But what would be the actual value for ##aX+bY##? Do I just plug in the values for ##X## and ##Y##? Do I account for the uncertainties on X and Y when computing aX+bY?

About the linear fit, if I wanted to fit a straight line to these 2 points, how can I get the uncertainty on the slope and intercept of the fit?
 
@Malala: Look up 'Propagation of Erros'. It deals with the error/variance in functions of Random variables.
 
Malamala said:
Thank you for this! But what would be the actual value for ##aX+bY##? Do I just plug in the values for ##X## and ##Y##? Do I account for the uncertainties on X and Y when computing aX+bY?

About the linear fit, if I wanted to fit a straight line to these 2 points, how can I get the uncertainty on the slope and intercept of the fit?
You can't know the actual value, because X,Y are Random Variables. This means/implies you can know their long-term distribution but cannot tell the actual values. Best you can do is use linearity of expectation (Edit: Assuming independence of X,Y):

$$E(aX+bY) =aE(X)+bE(Y) $$
 
Last edited:
  • Like
Likes   Reactions: Twigg
Malamala said:
About the linear fit, if I wanted to fit a straight line to these 2 points.
What two points are you talking about?
 
  • Like
Likes   Reactions: BvU

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
4K
  • · Replies 28 ·
Replies
28
Views
3K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 21 ·
Replies
21
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 26 ·
Replies
26
Views
3K