Positive Semi-Definite Matrices
- Definitions
- Special cases and examples
- Square root and Cholesky decomposition
- Ellipsoids
Definitions
For a given symmetric matrix , the associated quadratic form is the function
with values
data:image/s3,"s3://crabby-images/8798c/8798cd768c39f43e874d2d51afe040b61d9c9fb1" alt="Rendered by QuickLaTeX.com q(x) = x^TAx."
- A symmetric matrix
is said to be positive semi-definite (PSD, notation:
) if and only if the associated quadratic form
is non-negative everywhere:
data:image/s3,"s3://crabby-images/d0c95/d0c958e885af7c4561d19c87d5c79d4cb0ec5060" alt="Rendered by QuickLaTeX.com q(x) \ge 0 \quad \forall x\in \mathbb{R}^n."
- It is said to be positive definite (PD, notation:
) if the quadratic form is non-negative and definite, that is,
if and only if
.
It turns out that a matrix is PSD if and only if the eigenvalues of are non-negative. Thus, we can check if a form is PSD by computing the eigenvalue decomposition of the underlying symmetric matrix.
A quadratic form , with
is non-negative (resp. positive-definite) if and only if every eigenvalue of the symmetric matrix
is non-negative (resp. positive).
By definition, the PSD and PD properties are properties of the eigenvalues of the matrix only, not of the eigenvectors. Also, if the matrix
is PSD, then for every matrix
with
columns, the matrix
also is.
Special cases and examples
Symmetric dyads
Special cases of PSD matrices include symmetric dyads. Indeed, if for some vector
, then for every
:
data:image/s3,"s3://crabby-images/09785/097853593a43ee674fbf255c6c9fa8582f9e0de0" alt="Rendered by QuickLaTeX.com q_A(x) = x^Tuu^Tx = (u^Tx)^2 \ge 0."
More generally if , then
is PSD, since
data:image/s3,"s3://crabby-images/536ff/536ff16032b6166b8948ddc3421192fe54ea617f" alt="Rendered by QuickLaTeX.com q_A(x) = x^TB^TBx = ||Bx||_2^2 \ge 0."
Diagonal matrices
A diagonal matrix is PSD (resp. PD) if and only if all of its (diagonal) elements are non-negative (resp. positive).
Examples of PSD matrices
- Covariance matrix.
- Laplacian matrix of a graph.
- Gram matrix of data points.
Square root and Cholesky decomposition
For PD matrices, we can generalize the notion of the ordinary square root of a non-negative number. Indeed, if is PSD, there exists a unique PSD matrix, denoted
, such that
. We can express this matrix square root in terms of the SED of
, as
, where
is obtained from
by taking the square root of its diagonal elements. If
is PD, then so is its square root.
Any PSD matrix can be written as a product for an appropriate matrix
. The decomposition is not unique, and
is only a possible choice (the only PSD one). Another choice, in terms of the SED of
, is
. If
is positive-definite, then we can choose
to be lower triangular, and invertible. The decomposition is then known as the Cholesky decomposition of
.
Ellipsoids
There is a strong correspondence between ellipsoids and PSD matrices.
Definition
We define an ellipsoid to be an affine transformation of the unit ball for the Euclidean norm:
data:image/s3,"s3://crabby-images/6f8c0/6f8c0565120b7958aa266a0ec1ac00851d234b11" alt="Rendered by QuickLaTeX.com {\bf E} = \{\hat{x} +Lz: ||z||_2 \leq 1 \},"
where is an arbitrary non-singular matrix. We can express the ellipsoid as
data:image/s3,"s3://crabby-images/fb6e2/fb6e29109975682559dc412617bfcb712d16cd80" alt="Rendered by QuickLaTeX.com \mathbf{E}=\left\{x:\left\|L^{-1}(x-\hat{x})\right\|_2 \leq 1\right\}=\left\{x:(x-\hat{x})^T A^{-1}(x-\hat{x}) \leq 1\right\},"
where is PD.
Geometric interpretation via SED
We can interpret the eigenvectors and associated eigenvalues of in terms of the geometrical properties of the ellipsoid, as follows. Consider the SED of
:
, with
and
diagonal, with diagonal elements positive. The SED of its inverse is
. Let
. We can express the condition
as
data:image/s3,"s3://crabby-images/492f5/492f58556727fbf515bc433f275bc35aa9c6c373" alt="Rendered by QuickLaTeX.com \tilde{x}^T\Lambda \tilde{x} = \sum\limits_{i=1}^n \lambda_i \tilde{x}_i^2 \leq 1."
Now set ,
. The above writes
: in
-space, the ellipsoid is simply a unit ball. In
-space, the ellipsoid corresponds to scaling each
-axis by the square roots of the eigenvalues. The ellipsoid has principal axes parallel to the coordinate axes in
-space. We then apply a rotation and a translation, to get the ellipsoid in the original
-space. The rotation is determined by the eigenvectors of
, which are contained in the orthogonal matrix
. Thus, the geometry of the ellipsoid can be read from the SED of the PD matrix
: the eigenvectors give the principal directions, and the semi-axis lengths are the square root of the eigenvalues.
![]() |
The graph on the left shows the ellipsoid ![]() ![]() The matrix ![]() We check that the columns of |
The above shows in particular that an equivalent representation of an ellipsoid is
data:image/s3,"s3://crabby-images/8687d/8687d10bfac07511722bcb685ad654fed2a41963" alt="Rendered by QuickLaTeX.com \{x: (x-\hat{x})^TB(x-\hat{x}) \leq 1 \}"
where is PD.
It is possible to define degenerate ellipsoids, which correspond to cases when the matrix in the above, or its inverse
, is degenerate. For example, cylinders or slabs (intersection of two parallel half-spaces) are degenerate ellipsoids.