Errors in fitting to data, relationship to residue

mikelee8a · Feb 6, 2012

Hi,

I'd like to fit a straight line to some data which is noisey with gaussian noise with some st dev.

Using least squares, I can estimate the slope and intercept. I'd like to know the uncertainty in these numbers. I can find the residue, I believe this is a measure of the variance of the noise.

Using a simulation with N points, each with noise st dev. σ, I find the variation in estimated slope is proportional to σ/(√N^3), which I can't explain, I'd have expected sigma over root N, as for the standard error.

Any help would be fantastic. I just want to know what error to quote with my fitted gradient.

Mike

mmwave · Feb 6, 2012

Dear Mike,

Thank you for your post. It seems like you have already made some good progress in fitting a straight line to your noisy data using least squares. To answer your question about the uncertainty in the slope and intercept, you will need to calculate the standard error of the estimates. This can be done by taking the square root of the variance of the slope and intercept, which can be found using the residual sum of squares (RSS) and the degrees of freedom (N-2) as follows:

Standard error of slope = √(RSS/(N-2))
Standard error of intercept = √(RSS*(∑x^2)/(N*(N-2)))

The variation in the estimated slope that you observed in your simulation is likely due to the fact that the standard error is proportional to σ/(√N^3). This means that as the number of points (N) increases, the standard error decreases at a rate faster than 1/√N. This is because the standard error is also influenced by the variance of the noise (σ), which is squared in the formula. Therefore, as N increases, the effect of the noise on the standard error decreases at a faster rate than the effect of the sample size.

I hope this helps to explain the unexpected behavior of the standard error in your simulation. Remember to always report the standard error along with your estimated slope and intercept to accurately convey the uncertainty in your fitted line. Good luck with your research!

Errors in fitting to data, relationship to residue

1. What are fitting errors and how do they affect data analysis?

2. Can fitting errors be eliminated completely?

3. What is the relationship between fitting errors and residue?

4. How can we assess the significance of fitting errors?

5. What are some common sources of fitting errors?

Similar threads

Hot Threads

Recent Insights