u_eigen

Next: u_eigen Up: u_eigen Previous: u_eigen

8. Diagonalizing a square matrix

For a square matrix , to diagonalize means to find a square matrix and diagonal matrix with $P ^ {-1} A P = D$ .

In this case we say that diagonalizes . Notice that has to be nonsingular or it won't have an inverse.

Some matrices are ``diagonalizable'' and others are not.

For $A = \matp{cc}{4&1\\ 1&4}$ , it turns out that $P = \matp{rr}{1&-1\\ 1&1}$ is a good matrix to diagonalize . In fact, $P ^ {-1} A P = D$ for $D = \matp{cc}{5&0\\ 0&3}$ .

Notice that $\matp{c}{1\\ 1}$ and $\matp{r}{-1\\ 1}$ are eigenvectors of . In fact, is always made from eigenvectors:

Proposition. For $n \times n$ matrices and , diagonalizes $\Leftrightarrow$ the columns of are linearly independent eigenvectors of and the diagonal entries of are the corresponding eigenvalues.

(The linear independence of columns is needed in order for to be nonsingular. It wouldn't work, for instance, to have all columns be the same. For the reasoning behind the Proposition, see below.)

The Proposition tells how to diagonalize : Just find the eigenvalues and corresponding eigenvectors. If there are enough linearly independent eigenvectors, use them for the columns of . The Example could be found that way.

Handy facts:

(1) Eigenvectors that belong to distinct eigenvalues are linearly independent. Therefore, if is $n \times n$ and has distinct eigenvalues, then is diagonalizable. (``Distinct'' means ``all different from one another''.)

(2) A real symmetric matrix can always be diagonalized, even if the eigenvalues are not all distinct (for example, if the characteristic polynomial factors as $(\lambda-1)(\lambda-1)(\lambda-4) (\lambda-6)$ ), we would say the eigenvalues are , not all distinct.

These two facts will be proved in class. Let's go back and prove the Proposition.

Proof of the Proposition. For `` $\Rightarrow$ '': To says that diagonalizes means that $P ^ {-1} A P = D$ . If we take this equation and multiply both sides by on the left, we get . Now let's try to interpret this second equation using eigenvalues and eigenvectors. We need two facts:

First, whenever we multiply two matrices, such as , the columns of the matrix on the right work independently; if v is the first column of then v is the first column of , and so on.
For example, $\matp{rr}{1&2\\ 3&4}\matp{rr}{1&\cdot\\ 5&\cdot}= \matp{rr}{11&\cdot\\ 24& \cdot}$ and it doesn't matter what the other entries are.
Second, whenever we multiply a matrix on the right by a diagonal matrix, as in , the effect is to multiply the columns of the left-hand matrix by scalars (the diagonal entries of ), so that if v is the first column of then $d _ {11}$ v is the first column of , and so on.
For example, $\matp{rr}{1&2\\ 3&4}\matp{rr}{10&0\\ 0&100}= \matp{rr}{10&200\\ 30&400}$ .

Putting these two facts together, we see that

v $_ 1 = d _ {11}$ v

, so v

is an eigenvector of

and $d _ {11}$ is the corresponding eigenvalue. (An eigenvector can't be 0, but obviously v

0 since it's a column of the invertible matrix

.) Now look at the second column v

; by the same facts we now get

v $_ 2 = d _ {22}$ v

, so v

is an eigenvector of

with corresponding eigenvalue $d _ {22}$ . The other columns of

work similarly. The Proposition also mentions that the columns of

are linearly independent, which is the same as saying that the column rank equals the number of columns, but this is automatic since

is invertible.

We still need to prove `` $\Leftarrow$ '' in the Proposition. We start by assuming we are given and such that the columns of are eigenvectors of and are linearly independent. The same two facts as before show that . Since the columns of are linearly independent, has rank and is invertible. Multiply both sides of on the left by $P ^{-1}$ ; we get $P ^ {-1} A P = D$ , as required.

Problem U-12. Diagonalize $A = \matp{rr}{7&-4\\ 2&1}$ . (Use Problem U-.)

Problem U-13. Explain how to interpret the equation $P ^ {-1} A P = D$ in terms of a change of basis. What is the transformation? The old basis? The matrix relative to the old basis? The new basis? The transition matrix? The matrix relative to the new basis?

Next: u_eigen Up: u_eigen Previous: u_eigen

Kirby A. Baker 2001-11-20