Weighted least squares residuals

  • Context: Graduate 
  • Thread starter Thread starter vibe3
  • Start date Start date
  • Tags Tags
    Least squares Squares
Click For Summary
SUMMARY

The discussion focuses on the calculation of weighted least squares residuals in the context of robust regression, specifically using the Huber M-estimator. The normalized residuals can be defined as r_i = (y_i - Y(x_i)) / (σ√(1 - h_i)), where h_i represents statistical leverages and σ is the residual standard deviation. The weights w_i, determined through an iterative least squares procedure, are not included in the residual calculation as they are not parameters of the predictive model. The adjusted residual is defined as r_{adj} = r_i / √(1 - h_i), which is essential for accurate analysis.

PREREQUISITES
  • Understanding of weighted least squares regression
  • Familiarity with M-estimators, particularly the Huber method
  • Knowledge of statistical leverages in regression analysis
  • Experience with software tools such as MATLAB or SAS for statistical modeling
NEXT STEPS
  • Study the implementation of Huber M-estimator in statistical software
  • Learn about statistical leverage and its impact on regression analysis
  • Explore the bisquare weight method in robust regression
  • Investigate the use of adjusted residuals in model diagnostics
USEFUL FOR

Statisticians, data analysts, and researchers involved in regression modeling and outlier detection will benefit from this discussion, particularly those utilizing robust regression techniques.

vibe3
Messages
39
Reaction score
1
Hello,

When doing a weighted least squares fit of a model to data, I want to examine the residuals to see if their histogram matches the expected probability distribution. Since I am minimizing
<br /> \chi^2 = \sum_i w_i \left[ y_i - Y(x_i) \right]^2<br />
would I define my (normalized/studentized) residuals as
<br /> r_i = \frac{\sqrt{w_i} ( y_i - Y(x_i) )}{\sigma \sqrt{1 - h_i}}<br />
or in the usual way as
<br /> r_i = \frac{y_i - Y(x_i)}{\sigma \sqrt{1 - h_i}}<br />
where h_i are the statistical leverages and \sigma is the residual standard deviation. I am using a robust (iterative) least squares procedure to determine the weights w_i in order to detect and eliminate outliers, so the residual histogram should match the expected distribution of the M-estimator function I'm using (Huber in my case).
 
Physics news on Phys.org
I don't know what a "statistical leverage" is.

If the weights are just something used in the fitting process and are not parameters of the predictive model then the weights are not involved in the calculation of an individual "residual" of the model, by the usual definition of "residual". You'll get better advice if you state the problem precisely. If you are using an example of M-estimation as a guide, can you give a link to it?
 
It looks like to me that you're using a bisquare weight method. You will want to use the normal definition of the residual and with adjusted residual defined as r_{adj} = \frac{r_{i}}{\sqrt{1-h_i}}. Where r_{i} is your residual as defined normally and h_{i} is your leverage. Matlab or SAS will produce this as your weighted least squares as the default.
 

Similar threads

Replies
1
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
24
Views
3K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 3 ·
Replies
3
Views
7K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 7 ·
Replies
7
Views
11K