Solving Linear Equations via SVD

L. El Ghaoui

Solving Linear Equations via SVD

Solution set of a linear equation
Pseudo-inverse
Sensitivity analysis and condition number

Solution set

where $A in mathbf{R}^{m times n}$ and $y in mathbf{R}^m$ are given. We can completely describe the set of solutions via SVD, as follows. Let us assume that admits an SVD given here. With $A = Utilde{ {S}}V^T$ , pre-multiply the linear equation by the inverse of , U^T ; then we express the equation in terms of the rotated vector $tilde{x} = V^Tx$ . This leads to

where $tilde{y} := U^Ty$ is the ‘‘rotated’’ right-hand side of the equation. Due to the simple form of tilde{ {S}} , the above writes

$sigma_i tilde{x}_i = tilde{y}_i , ;; i = 1,ldots, r, ;; 0 = tilde{y}_i , ;; i=r+1,ldots,m.$

Two cases can occur.

If the last components of are not zero, then the above system is infeasible, and the solution set is empty. This occurs when is not in the range of .
If is in the range of , then the last set of conditions in the above system hold, and we can solve for with the first set of conditions:

$tilde{x}_i = frac{tilde{y}_i}{sigma_i} , ;; i = 1,ldots, r.$

The last n-r components of tilde{x} are free. This corresponds to elements in the nullspace of . If is full column rank (its nullspace is reduced to {0} ), then there is a unique solution.

Pseudo-inverse

Definition

The solution set is conveniently described in terms of the pseudo-inverse of , denoted by A^dagger , and defined via the SVD of :

$A = U tilde{ {S}} V^T, ;; tilde{ {S}} = mbox{bf diag}(sigma_1,ldots,sigma_r,0,ldots,0)$

as one with same SVD, with non-zero singular values inverted, and the matrix tilde{S} transposed:

$A^dagger := V tilde{ {S}}^dagger U^T, ;; tilde{ {S}} = mbox{bf diag}(sigma_1^{-1},ldots,sigma_r^{-1},0,ldots,0).$

The pseudo-inverse of a matrix is always well-defined, and that it has the same size as the transpose A^T . When the matrix is invertible (it is square and full column or row rank: m=n=r ), then it reduces to the inverse.

Example: pseudo-inverse of a 4 times 5 matrix.

Matlab syntax

>> Adagger = pinv(A);

Link with solution set

From the above development, we see that the solution set can be written as

$mathbf{S} = left{ A^dagger y + z ~:~ z in {cal N}(A) right},$

where {cal N}(A) is the nullspace of . Both A^dagger and a basis for the nullspace can be computed via the SVD.

Case when is full rank

If is full column rank, the pseudo-inverse can be written as

$A^dagger = (A^TA)^{-1}A^T.$

In that case, A^dagger is a left-inverse of , since A^dagger A = I_n .

If is full row-rank, then the pseudo-inverse can be written as

$A^dagger = A^T(AA^T)^{-1}.$

In that case, A^dagger is a right-inverse of , since AA^dagger = I_m .

Sensitivity analysis and condition number

Sensitivity analysis refers to the process of quantifying the impact of changes in the linear equations’ coefficients (the matrix and vector ), on the solution. To simplify, let us assume that is square and invertible, and analyze the effects of errors in only. The condition number of the matrix quantifies this.

We start from the linear equation above, which has the unique solution $x = A^{-1} y$ . Now assume that is changed into y+delta y , where delta y is a vector that contains the changes in . Then let us denote by x + delta x the new solution, which is $A^{-1} (y + delta y)$ . From the equations

$delta x = A^{-1} delta y, ;;$

and via the definition of the largest singular value norm, we obtain:

$|delta x|_2 le |A^{-1}| cdot |delta y|_2 , ;; |y|_2 le |A| cdot |x|_2 .$

Combining the two inequalities we obtain:

$frac{|delta x|_2}{|x|_2} le kappa(A) cdot frac{|delta y|_2}{|y|_2} ,$

where kappa(A) is the condition number of , defined as

$kappa(A) := |A| cdot |A^{-1}|.$

We can express the condition number as the ratio between the largest and smallest singular values of :

$kappa(A) = frac{sigma_1(A)}{sigma_n(A)}.$

The condition number gives a bound on the ratio between the relative error in the left-hand side to that of the solution. We can also analyze the effect of errors in the matrix itself on the solution. The condition number turns out to play a crucial role there as well.

Solving Linear Equations via SVD

Solution set

Pseudo-inverse

Definition

Link with solution set

Case when is full rank

Sensitivity analysis and condition number

License

Share This Book