How Can We Find the Minimum 2-Norm of Ax-y Using an Orthogonal Matrix?

  • Context: MHB 
  • Thread starter Thread starter mathmari
  • Start date Start date
  • Tags Tags
    Minimum
Click For Summary

Discussion Overview

The discussion revolves around finding the minimum 2-norm of the expression \( \|Ax - y\|_2 \) where \( A \) is expressed as a product of an orthogonal matrix \( Q \) and an upper triangular matrix \( R \). Participants explore the mathematical derivation and properties of orthogonal matrices, triangular matrices, and their implications in minimizing the norm. The focus is on theoretical reasoning and mathematical formulation.

Discussion Character

  • Exploratory
  • Mathematical reasoning

Main Points Raised

  • One participant proposes that \( A = QR \) and seeks to show that \( \min_{x\in \mathbb{R}^n}\|Ax-y\|_2=\|(Q^Ty)_{n+1}^m\|_2 \) for all \( y \in \mathbb{R}^m \).
  • Another participant suggests expressing \( Rx \) in a specific form involving an upper triangular matrix \( U \) and a zero matrix, leading to a breakdown of the norm into two components.
  • Further elaboration on the norm leads to a formulation that separates the minimization problem into components involving \( Ux \) and \( (Q^Ty)_{n+1}^m \).
  • One participant concludes that the minimum occurs when \( Ux = (Q^Ty)_1^n \), leading to a proposed result for the minimum norm.
  • Another participant confirms that the rank of \( R \) being \( n \) ensures that \( U \) is invertible, supporting the previous claims.

Areas of Agreement / Disagreement

Participants generally agree on the mathematical steps taken to derive the minimum norm, but the discussion does not reach a consensus on the implications or broader applications of the findings. Some participants express uncertainty about the next steps or the completeness of the argument.

Contextual Notes

The discussion includes assumptions about the properties of orthogonal and triangular matrices, as well as the conditions under which the minimum is derived. There is an implicit dependence on the definitions of the matrices involved and the rank conditions stated.

mathmari
Gold Member
MHB
Messages
4,984
Reaction score
7
Hey! :o

Let $A = QR$, where $Q$ is an orthogonal ($m\times m$)−matrix and $R$ is an upper ($m\times n$)-triangular matrix of rang $n$ ($m>n$).

I want to show that $$\min_{x\in \mathbb{R}^n}\|Ax-y\|_2=\|(Q^Ty)_{n+1}^m\|_2, \ \forall y\in \mathbb{R}^m$$

It is $(a)_k^l=(a_k, \ldots , a_l)^T$ if $a=(a_1, \ldots , a_l)^T\in \mathbb{R}^l$.
I have done the following:

\begin{align*}\min_{x\in \mathbb{R}^n}\|Ax-y\|_2&=\min_{x\in \mathbb{R}^n}\|QRx-y\|_2=\min_{x\in \mathbb{R}^n}\|QRx-QQ^{-1}y\|_2 \\ & =\min_{x\in \mathbb{R}^n}\|Q(Rx-Q^{T}y)\|_2=\min_{x\in \mathbb{R}^n}\|Rx-Q^{T}y\|_2\end{align*}

(We have used here the properties of an orthogonal matrix.)

How could we continue? How can we find that minimum? (Wondering)
 
Physics news on Phys.org
Hey mathmari!

We can write $Rx$ as $\binom U0x$ where $U$ is an up upper triangular nxn matrix of rank n, and 0 is a zero matrix, can't we? (Wondering)
It means that:
$$\|Rx-Q^{T}y\|^2
= \|Ux-(Q^{T}y)_1^n\|^2 + \|0-(Q^{T}y)_{n+1}^m\|^2
$$
Can we solve it now? (Wondering)
 
I like Serena said:
Hey mathmari!

We can write $Rx$ as $\binom U0x$ where $U$ is an up upper triangular nxn matrix of rank n, and 0 is a zero matrix, can't we? (Wondering)
It means that:
$$\|Rx-Q^{T}y\|^2
= \|Ux-(Q^{T}y)_1^n\|^2 + \|0-(Q^{T}y)_{n+1}^m\|^2
$$
Can we solve it now? (Wondering)

Ah ok! We have that \begin{align*}\|Rx-Q^{T}y\|_2^2&=\left \|\begin{pmatrix}U \\ 0\end{pmatrix}x-\begin{pmatrix}(Q^T)_1^n \\ (Q^T)_{n+1}^m\end{pmatrix}y\right \|_2^2 =\left \|\begin{pmatrix}Ux-(Q^Ty)_1^n \\ 0-(Q^Ty)_{n+1}^m\end{pmatrix}\right \|_2^2 \\ & =\|Ux-(Q^{T}y)_1^n\|_2^2 + \|(Q^{T}y)_{n+1}^m\|_2^2\end{align*}

Therefore, we get $$\min_{x\in\mathbb{R}^n}\|Rx-Q^{T}y\|^2=\min_{x\in\mathbb{R}^n}\{\|Ux-(Q^{T}y)_1^n\|_2^2 + \|(Q^{T}y)_{n+1}^m\|_2^2\}$$

This is minimized for that $x$ for which it holds $Ux=(Q^{T}y)_1^n$ and so we get $$\min_{x\in \mathbb{R}^n}\|Ax-y\|_2^2=\|(Q^{T}y)_{n+1}^m\|_2^2\Rightarrow \min_{x\in \mathbb{R}^n}\|Ax-y\|_2=\|(Q^{T}y)_{n+1}^m\|_2$$ right? (Wondering)
 
Yep.
And that is possible because $R$ is of rank $n$, and therefore $U$ is as well, making it an invertible matrix. (Nod)
 
I like Serena said:
And that is possible because $R$ is of rank $n$, and therefore $U$ is as well, making it an invertible matrix. (Nod)

Ok! Thanks a lot! (Yes)
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
Replies
4
Views
2K
  • · Replies 34 ·
2
Replies
34
Views
2K
  • · Replies 12 ·
Replies
12
Views
2K
Replies
12
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 12 ·
Replies
12
Views
6K
  • · Replies 4 ·
Replies
4
Views
2K