Finding the Uncertainty of the Slope Parameter of a Liner Regression

AI Thread Summary
Finding the uncertainty of the slope parameter in linear regression involves understanding how measurement uncertainties in both x and y affect the estimate of b. The common approach is to use a linearized asymptotic estimate, where the slope b is derived from a function of the observed data points. When measurement errors exist in both variables, the total least squares regression method is often recommended, as it accounts for errors in x_i as well as y_i. The formula for b provided is specific to ordinary least squares regression, which assumes no error in x_i measurements. The discussion highlights the complexity of estimating the standard deviation of b when measurement errors are present, indicating that further exploration of specific formulas may be necessary.
richardc
Messages
7
Reaction score
1
Finding the Uncertainty of the Slope Parameter of a Linear Regression

Suppose I have measurements x_i \pm \sigma_{xi} and y_i \pm \sigma_{yi} where \sigma is the uncertainty in the measurement. If I use a linear regression to estimate the value of b in y=a+bx, I'm struggling to find a straightforward way to compute the uncertainty of b that arises from the measurement uncertainties. This seems like it should be a very common problem, so I'm not sure why I can't find a simple algorithm or formula.

Thank you for any advice.
 
Last edited:
Physics news on Phys.org
Are you using "uncertainty" to mean "standard deviation"?

It's a common problem, but it's not simple. After all, your data gives only one value for b, so how can you estimate the standard deviation of b from a sample of size 1 ?

The common way to get an answer is to oversimplify matters and compute a "linearized asymptotic" estimate. The value of b is some function F of the (x_i,y_i). Let L be the linear approximation for the function F. Assume that near the observed values in the sample that this well approximates the random variable b as a linear combination of the x_i and y_i. When you have a random variable expressed as linear combination of other random variables, you can work on expressing its standard deviation in terms of the standard deviations of the other random variables.

That's the general picture. If it's what you want to do then we can try to look up the specifics. I don't know them from memory.
 
Thank you for clarifying the problem.

With N observation pairs I believe I can write b=\frac{N \sum x_i y_i - \sum x_i \sum y_i}{N \sum x_i^2 - (\sum x_i)^2}.

I suppose the propagation of error formula \sigma_f^2=\sum (\frac{\partial f}{\partial x_i} \sigma_{x_i} )^2 is then applied to a linear approximation of b?
 
You state a problem where there is an error in measurement for x_i as well as for y_i. In such a problem, people often use "total least squares" regression. I think the computation of the slope in "total least squares" regression is different than in ordinary least square regression, which assumes no error in the measurement of the x_i. I think the formula you gave for b is for ordinary least squares regression.

Of course, one may ask the question: If I fit a straight line to data using the estimator for slope used in ordinary least squares regression and my data also has errors in the x_i then what is the standard deviation of this estimator. If that's the question, you need terms involving \frac{\partial f}{\partial y_i} \sigma^2_{y_i} and \frac{\partial f}{\partial x_i} \sigma^2 x_i

I don't know if the estimator for slope in ordinary least squares regression is an unbiased estimator if there are errors in the x_i.
 
I was reading documentation about the soundness and completeness of logic formal systems. Consider the following $$\vdash_S \phi$$ where ##S## is the proof-system making part the formal system and ##\phi## is a wff (well formed formula) of the formal language. Note the blank on left of the turnstile symbol ##\vdash_S##, as far as I can tell it actually represents the empty set. So what does it mean ? I guess it actually means ##\phi## is a theorem of the formal system, i.e. there is a...
Back
Top