Image Compression
In image compression applications, we are given the pixel representation of a ‘‘target’’ image, as a vector in . We would like to represent the image as a linear combination of ‘‘basic’’ images , where the matrix is called the dictionary. The picture on the left shows a total of ‘‘basic’’ images. |
To represent the image in terms of the dictionary, we would like to find coefficients , such that
or, more compactly, .
If the representation of the image (that is, the vector ) has many zeros (we say: is sparse), then we can represent the entire image with only a few values (the components of that are not zero). We may then, for example, send the image over a communication network at high speed. Provided the receiver has the dictionary handy, it can reconstruct perfectly the image.
In practice, it may be desirable to trade off the sparsity of the representation (via ) against the accuracy of the representation. Namely, we may prefer a representation that achieves only approximately but has way more zeros. The process of searching for a good sparsity/accuracy trade-off is called image compression.