Linear least square method for singular matrices

alyflex · Nov 11, 2010

I have stumbled upon a problem which I have so far been unable to solve.

I we consider a general set of linear equations:
[tex]Ax=b[/tex],

I know the the system is inconsistent which makes least square method the logical choice.

So the mission is to minimize [tex]||Ax-b||[/tex]

And the usual way I do this is by setting Ax=p, where p is the projection of b onto Ax.
By isolating:
[tex]x=(A^TA)^{-1}A^T \cdot p
=(A^TA)^{-1} \cdot A^T \cdot b[/tex]

However the product [tex]A^T*A[/tex] is also singular, and thus I am unable to do this.

I pretty sure there is a very simple way to do this, but when i look in my old algebra book I see no solution to the problem.

Anyone know the way?

HallsofIvy · Nov 11, 2010

Yes, that is the standard "least squares" method for solving such a problem with [itex](A^TA)^{-1}A^T[/itex] being the "generalized inverse". For most such problems [itex]A^TA[/itex] is invertible. If your A is such that [itex]A^TA[/itex] is not invertible then you have a very pathological problem for which there probably is no "simple" way to solve it. What is A?

Wizlem · Nov 11, 2010

Isn't it true that [tex]A^TA[/tex] is only not invertible if there are actually multiple solutions to [tex]Ax=b[/tex]?

Landau · Nov 11, 2010

Wizlem said:

Isn't it true that [tex]A^TA[/tex] is only not invertible if there are actually multiple solutions to [tex]Ax=b[/tex]?

Correct. The least square solution satisfies the normal equation A^TAx=A^Ty. This solution is unique if and only if A^TA is invertible. See here for a similar discussion.

hotvette · Nov 11, 2010

I can think of a couple of possibilities, both involving miniminum norm solutions. If A has more columns than rows, it is an underdetermined system that has an easy 'least norm' solution. If A has more rows than columns and is rank deficient (my guess in your case), there is a way to minimize the norm of Ax-b and the norm of x at the same time using SVD, but I'm not familiar with the details. Look up LAPACK DGELSS.

* DGELSS computes the minimum norm solution to a real linear least
* squares problem:
*
* Minimize 2-norm(| b - A*x |).
*
* using the singular value decomposition (SVD) of A. A is an M-by-N
* matrix which may be rank-deficient.

alyflex · Nov 12, 2010

Landau said:

Correct. The least square solution satisfies the normal equation A^TAx=A^Ty. This solution is unique if and only if A^TA is invertible. See here for a similar discussion.

This was what I suspected, but thanks for making it clear. Now I just need a method to compute it for non-invertible matrices

hotvette said:

I can think of a couple of possibilities, both involving miniminum norm solutions. If A has more columns than rows, it is an underdetermined system that has an easy 'least norm' solution. If A has more rows than columns and is rank deficient (my guess in your case), there is a way to minimize the norm of Ax-b and the norm of x at the same time using SVD, but I'm not familiar with the details. Look up LAPACK DGELSS.

The more I look at this the more afraid I am, that a numerical solution might be the best way. I was really hoping for a clear and illuminating analytical method.

HallsofIvy said:

What is A?

The initial system I stumbled upon was this:

[tex]\left[ {\begin{array}{ccc}
1 & 1 & 3 \\
-1 & 3 & 1 \\
1 & 2 & 4 \\
\end{array} } \right]
\left( {\begin{array}{c}
x \\
y \\
z \\
\end{array} } \right)
= \left( {\begin{array}{c}
-2 \\
0 \\
8 \\
\end{array} } \right)[/tex]

but then I considered more simple system but with the same problem.
One example could be:

y=x
y=x+1

It is clear what the least square solution to this system should be, but I have no idea how to find it methodically

trambolin · Nov 12, 2010

Consider A being column eliminated with R as

[tex]
A R R^{-1}x = b
[/tex]

Then,
[tex]
\left[ {\begin{array}{ccc}
1 & 0 & 0 \\
-1 & 4 & 0 \\
1 & 1 & 0 \\
\end{array} } \right] \left( {\begin{array}{c}
x + y +3z \\
y +z \\
z \\
\end{array} } \right) =\left( {\begin{array}{c}
-2 \\
0 \\
8 \\
\end{array} } \right)
[/tex]
now we solve a different linear eq. set, since the last variable does not play a role in the equations we reduce the number of columns.
[tex]
\underbrace{\left[ {\begin{array}{cc}
1 & 0 \\
-1 & 4 \\
1 & 1 \\
\end{array} } \right]}_{A_r} \left( {\begin{array}{c}
x + y +3z \\
y +z
\end{array} } \right) = \left( {\begin{array}{c}
x + y +3z \\
-x +3y + z \\
x+2y+4z
\end{array} } \right) =\underbrace{\left( {\begin{array}{c}
-2 \\
0 \\
8 \\
\end{array} } \right)}_{b}
[/tex]

From the second equation, you can see that it is not solvable. So back your least squares, and call the variables a b i.e.

[tex]
\left( {\begin{array}{c}
x + y +3z \\
y +z
\end{array} } \right) = \begin{pmatrix}a\\b\end{pmatrix}
[/tex]
And Matlab gave (from linsolve(Ar,b))

[tex]
\begin{pmatrix}a\\b\end{pmatrix} = \begin{pmatrix}3\\1\end{pmatrix}
[/tex]

And from this solution, you can construct admissible least squares solutions. I might have made mistakes so please check the steps but conceptually it should be OK if I didn't mess it up. To your small example, the least squares solution is a = y-x = 0.5 So the whole trick is to embed the underdetermined part inside the x vector and solve the least squares solution. Then you get infinitely many solutions that satisfy the least squares solution.EDIT: By the way, I forgot to write that any full rank decomposition of [itex]A = MN^T[/itex] leads to a similar solution. Similarly with SVD mentioned above.

alyflex · Nov 12, 2010

Thank you very much for the illuminating answer trambolin.

I'm going to work it through in the weekend using your method.

I'll post a longer and more conclusive reply once I have tried the method.

But to all of you, thank you very much for an enlightening discussion on a subject, which was causing me severe headaches yesterday. =)

alyflex · Nov 14, 2010

Well I just went through the calculation myself and as far as I can see everything you wrote was spot on, so thank you very much.

Btw I had no idea that the linsolve function in MATLAB solved a system of linear equations using least squares, I only thought it could find consistent solutions.

mattrb · Nov 20, 2010

alyflex,

Try the truncated SVD of the matrix A (Demmel, Applied Numerical Linear Algebra). Let [tex]A=U\Sigma V^T[/tex] where [tex]A,U\in \Re^{m\times n}, \Sigma,V\in \Re^{n \times n}[/tex]. See en.wikipedia.org/wiki/Singular_value_decomposition for details. Every good linear algebra software should have an algorithm. Let [tex]\hat{U}\in \Re^{m\times k},\ \hat{\Sigma} \in \Re^{k \times k},\ \hat{V}\in \Re^{n \times k}[/tex] where k is the numerical rank of A (ignore smallest singular information).

Your least squares solution is [tex]x=A^+b=\hat{V}\hat{\Sigma}^{-1}\hat{U}^Tb[/tex].
This should work if your matrix is decently sized (around 1000 unknowns).
Another plus is that of all the residual minimizers, this x has a minimal 2-norm.

Matthew

Linear least square method for singular matrices

Related to Linear least square method for singular matrices

1. What is the purpose of the linear least square method for singular matrices?

2. How does the linear least square method handle singular matrices?

3. Can the linear least square method for singular matrices be used for underdetermined systems?

4. What are the advantages of using the linear least square method for singular matrices?

5. Are there any limitations to the linear least square method for singular matrices?

Similar threads

Hot Threads

Recent Insights