Image Compression

Singular Value Decomposition

The Singular Value Decomposition (SVD) states that for every matrix A of size m×n there exist two sets of vectors, {v1,v2,,vn} orthogonal in Rn, {u1,u2,,um} orthogonal in Rm and real numbers σ1σ2σr>0 such that


Avi=σiui,i=1,r,

where r is the rank of the matrix A. And


Avj=0j=r+1,,n.

In matricial notation:


A[v1vrvn]=[u1urum][σ10σr00]

That is


AV=UΣ

As the vectors in the set {v1,v2,,vn} are orthonormal then the matrix V with columns vi is orthogonal, that is VVt=In. Using this and multiplying by VT on both sides of the expression we get

A=UΣVT

The SVD of the matrix A.

Image compression and the Eckart-Young-Mirsky Theorem

Given a matrix A of size m×n and rank r, we know that for every i>r Avi=0, so there are parts in AV=UΣ that don't contribute to the multiplication. The important part in the product of those matrices is in the first r sigma vectors and sigma values, this way we get


A[v1vr]=[u1ur][σ1σr]

And we still have VrVrT=Ir so multiplying on both sides by VrT we get to the reduced SVD decomposition of A

Ar=UrΣrVrT

The Eckart-Young-Mirsky theorem states that for every unitarily invariant norm |||| then the best k-rank approximation of a matrix A is Ak, that is

minr(B)=k||AB||=||AAk||.

This is particularly useful in data science and we shall show how good this approximations are by compressing an image. An image is a 2-dimensional array of pixels, each pixel, represented with a square is also an array formed by four numerical values RGBA, this values represent "how much" Red, Green, Blue and Alpha (alpha is the transparency of the image) there is in that particular pixel, this values range from 0 to 255.

An image can be converted into greyscale by setting the three values of RGB to the average of them in the original image, that is setting for each pixel the array [R+G+B3,R+G+B3,R+G+B3], doing so allows us to transform an image into a big matrix in which every value represents the average of colours in the corresponding pixel.


Using what we've just learned about the SVD, we know due to the Eckart-Young-Mirsky theorem that the image matrix can be very well aproximated by one of smaller range, making the image lighter. This is what we do in the Image Compression section.