Optimizing H for Accurate Outlier Detection in Weighted Linear Least Squares

ilia1987 · Dec 5, 2010

I have the following problem:
I have a set of [tex]m[/tex] measurements [tex]$\mathbf{\phi}$[/tex]
and I estimate a set of 3 variables [tex]$\mathbf{x}$[/tex]

The estimated value for [tex]$\mathbf{\phi}$[/tex] depends linearly on [tex]$\mathbf{x}$[/tex] : [tex]Hx=\Tilde{\phi}[/tex]
The solution through weighted linear least squares is:
[tex]$\mathbf{x}$ = (H^TWH)^{-1}H^TW$\mathbf{\phi}$[/tex]
W is diagonal matrix

Suppose there is absolutely no deviation between the estimated values of [tex]$\mathbf{\phi}$[/tex] and the measured values (perfect measurements).
Now, suppose that one measurement [tex]\phi_i[/tex] deviates strongly from its nominal value. In such a case the estimated value of [tex]$\mathbf{x}$[/tex] is going to change and so will the estimated value [tex]$\mathbf{\Tilde{\phi}}$[/tex]. if the actual measurements deviate by [tex]$\mathbf{\delta\phi}$[/tex], the estimated values are goind to deviate by:
[tex]$\mathbf{\Delta\Tilde{\phi}}=H (H^TWH)^{-1}H^TW$\mathbf{\delta\phi}$[/tex]

I later use the difference [tex]\Tilde{\phi}-\phi[/tex] to reject stray measurements.
So, a situation where one stray measurement [tex]\delta\phi_i[/tex] causes a large deviation [tex]\Delta\Tilde{\phi_j}[/tex] is undesirable.
I reached the conclusion that it is necessary for off diagonal elements of
[tex]H (H^TWH)^{-1}H^TW[/tex] to be smaller than the diagonal elements in order for the correct measurement to be recognized as stray.

My hope is that someone on this forum can help me find a nicer, more analytical way to express that condition on [tex]H[/tex] itself rather that on this monstrous expression, or at least point me in the right direction.

Thank you.

ilia1987 · Dec 5, 2010

A small correction:

For the matrix
[tex]H (H^TWH)^{-1}H^TW - I_{m \times m}[/tex]
I want the absolute value of the diagonal elements to be larger than the absolute values of the off-diagonal elements.

That's because I want:
[tex]\forall(j\neq i) \delta\phi_i > \delta\phi_j \Rightarrow r_i > r_j[/tex]
where [tex]r_k = \tilde{\phi_k} - \phi_k[/tex] are the residuals.

Optimizing H for Accurate Outlier Detection in Weighted Linear Least Squares

1. What are least squares outliers?

2. How are least squares outliers identified?

3. What causes least squares outliers?

4. How do least squares outliers affect the regression line?

5. How can least squares outliers be dealt with?

Similar threads

Hot Threads

Recent Insights