Diagonalization by a Similarity Transformation
Definition.
A matrix is diagonalizable by a similarity
transformation if there exists an invertible matrix
such that
(1)
where is a diagonal matrix.
We say that is a diagonalizing matrix
or that it diagonalizes the matrix
Lemma.
Consider matrices
Columns
of the matrix are eigenvectors
of the matrix
(2)
if and only if
(3)
where
Indeed, according to the column rule of matrix multiplication:
and thus the conditions (2) and (3) equivalent.
Theorem 7.
A matrix is diagonalizable by a similarity
transformation (1) if and only if the space
has a basis
consisting of eigenvectors of the matrix
Then the matrix
whose columns are vectors from the basis diagonalizes the matrix
Proof.
A matrix is diagonalizable by a similarity
transformation if there exists an invertible matrix
such that
where is a diagonal matrix:
This is equivalent to the conditions
The condition means that
comprises a linearly independent set of vectors of the space
Moreover, by the above Lemma:
In an -dimensional vector space every set of linearly
independent vectors is a basis. Therefore, since so
is a basis of the space
it is the basis consisting of eigenvectors of the matrix
On the other hand, if eigenvectors of the matrix
span the space then the matrix
whose columns are the basis vectors diagonalizes the matrix
Comments and corollaries.
1.) Every matrix has at least
one eigenvalue and the associated eigenvector
Hence, because the equation (2) does not
require that the eigenvalues
and the associated eigenvectors
are distinct, there always exists a matrix
such that the equation (3) holds.
In particular, one may take
Then
but the matrix is not invertible and thus
the relation does not hold (1).
2.) The formula may be interpreted
in terms of transformation of a matrix of a linear operator under a change of basis.
Consider the space with the canonical basis
Let be the matrix of a linear
operator defined by
If eigenvectors
of the operator are linearly independent, then the matrix
is the transition matrix from the canonical basis
to the basis consisting of the eigenvectors.
Hence, is a matrix of the operator in the basis
consisting of its eigenvectors. As one should expect,
this is a diagonal matrix with the eigenvalues of on the diagonal.
3.) We know already that the eigenevectors of a linear operator which
are associated to different eigenvalues are linearly independent.
Corollary. If a matrix has
distinct eigenvalues, then there exists a similarity transformation
which diagonalizes this matrix.
Indeed, if columns of the matrix are eigenvectors
of the matrix which are associated
with distinct eigenvalues, then the matrix
is non-degenerate: and thus invertible.
4.) Eigenvectors of a normal operator which are associated
with distinct eigenvalues comprise an orthogonal system, and after normalization - an orthonormal system.
A matrix whose columns comprise an orthonormal system is unitary.
Corollary. Let be a
normal (e.g. Hermitian or unitary) matrix. If
has distinct eigenvalues, then there exists a unitary
similarity transformation which diagonalizes this matrix (a diagonalizing
matrix is unitary:
Remark. A normal matrix does not have to have
distinct eigenvectors to be diagonalizable. Namely, one can prove a more general
Theorem 8.
A matrix is diagonalizable by a unitary similarity transformation if and only if it is normal.
Application to real matrices.
For a real matrix
we have
Therefore
(a real Hermitian matrix is symmetric), and
(a real unitary matrix is orthogonal).
Theorem 9.
Every real symmetric or orthogonal matrix is
diagonalizable by a unitary similarity transformation.
Eigenvalues, and thus also eigenvectors, of a real symmetric matrix
are real. Hence, a unitary diagonalizing matrix is a real orthogonal matrix.
Corollary. Every real symmetric matrix is
diagonalizable by a real orthogonal similarity transformation.
In comparison to the previous case, eigenvalues of a real orthogonal matrix
(and thus also its eigenvectors) may be complex and not real.
Then the unitary diagonalizing matrix will be also
complex and not real.
Theorem 10.
If a matrix is diagonalizable by a similarity
transformation, then an algebraic multiplicity of every eigenvalue is equal
to the geometric multiplicity.
Proof.
If a transformation
diagonalizes the matrix
then where
are eigenvalues of the
matrix
The number with which occurs on a diagonal of the matrix
is equal to both an algebraic and geometric multiplicity of this eigenvalue.