Set of solutions to the least-squares problem via QR decomposition
The set of solutions to the least-squares problem
data:image/s3,"s3://crabby-images/58082/58082f2549f0a925d6cf20abc1686a7c7ba9be2b" alt="Rendered by QuickLaTeX.com \min _x\|A x-y\|_2^2 \text {, }"
where , and
are given, can be expressed in terms of the full QR decomposition of
:
data:image/s3,"s3://crabby-images/258b7/258b72daae31e8499007dceb4994e7c6a33eb10a" alt="Rendered by QuickLaTeX.com A P=Q R, \quad R=\left(\begin{array}{cc} R_{11} & R_{12} \\ 0 & 0 \end{array}\right),"
where is upper triangular and invertible,
is a permutation matrix, and
is
and orthogonal.
Precisely we have , with
a matrix whose columns span the nullspace of
:
data:image/s3,"s3://crabby-images/253e3/253e38aae3caccbef3b0dbed9d3634931f7306d4" alt="Rendered by QuickLaTeX.com x_0=P\left(\begin{array}{c} R_{11}^{-1} y_1 \\ 0 \end{array}\right), \quad N=\left(\begin{array}{c} R_{11}^{-1} R_{12} \\ I \end{array}\right) \in \mathbf{R}^{n \times(n-r)} ."
Proof: Since and
are orthogonal, we have, with
:
data:image/s3,"s3://crabby-images/e6cc6/e6cc6a99d062e97740eff5c20c8a5006b3dbf10e" alt="Rendered by QuickLaTeX.com A x-y=A P P^T x-y=Q R \bar{x}-y=Q\left(R \bar{x}-Q^T y\right)=Q(R \bar{x}-\bar{y})"
Exploiting the fact that leaves Euclidean norms invariant, we express the original least-squares problem in the equivalent form:
data:image/s3,"s3://crabby-images/356cf/356cf9b29f6f51c99f7db78ef3a6319aaa605c34" alt="Rendered by QuickLaTeX.com \min _{\bar{x}}\|R \bar{x}-\bar{y}\|_2 \text {. }"
Once the above is solved, and is found, we recover the original variable
with
.
Now let us decompose and
in a manner consistent with the block structure of
, with
two
-vectors. Then
data:image/s3,"s3://crabby-images/92148/921489eb52ebd2be8f55198b179948fb3ae7ade5" alt="Rendered by QuickLaTeX.com R \bar{x}-\bar{y}=\left(\begin{array}{c} R_{11} x_1+R_{12} x_2-y_1 \\ -y_2 \end{array}\right),"
which leads to the following expression for the objective function:
data:image/s3,"s3://crabby-images/cb081/cb081a3b13ca0785a3449daa88433ec786d9b152" alt="Rendered by QuickLaTeX.com \|R \bar{x}-\bar{y}\|_2^2=\left\|R_{11} x_1+R_{12} x_2-y_1\right\|_2^2+\left\|y_1\right\|_2^2 ."
The optimal choice for the variables is to make the first term zero, which is achievable with
data:image/s3,"s3://crabby-images/4ec59/4ec5913015df816d5747e7926fd6237c0483c08a" alt="Rendered by QuickLaTeX.com x_1=R_{11}^{-1}\left(y_1-R_{12} x_2\right),"
where is free and describes the ambiguity in the solution. The optimal residual is
.
We are essentially done: with , we can write
data:image/s3,"s3://crabby-images/ce8a9/ce8a97cad8c0b673a23cfc21d659018f00a26991" alt="Rendered by QuickLaTeX.com P^T x=\bar{x}=\left(\begin{array}{c} x_1 \\ x_2 \end{array}\right),\left(\begin{array}{c} R_{11}^{-1} y_1 \\ 0 \end{array}\right)+\left(\begin{array}{c} R_{11}^{-1} R_{12} \\ I \end{array}\right) z,"
that is: , with
data:image/s3,"s3://crabby-images/ec5dc/ec5dc9426410dab3735b92efb6cce798ad813125" alt="Rendered by QuickLaTeX.com x_0=P\left(\begin{array}{c} R_{11}^{-1} y_1 \\ 0 \end{array}\right), N=P\left(\begin{array}{c} R_{11}^{-1} R_{12} \\ I \end{array}\right) \in \mathbf{R}^{n \times(n-r)}"