Section ROD  Rank One Decomposition

From A First Course in Linear Algebra
Version 2.30
© 2004.
Licensed under the GNU Free Documentation License.
http://linear.ups.edu/

T h i s S e c t i o n i s a D r a f t, S u b j e c t t o C h a n g e s

Our first decomposition applies only to diagonalizable (Definition DZM) matrices, and yields a decomposition into a sum of very simple matrices.

Theorem  ROD
Rank One Decomposition
Suppose that A is a diagonalizable matrix of size n and rank r . Then there are r square matrices A 1 , A 2 , A 3 , , A r , each of size n and rank 1 such that

A = A 1 + A 2 + A 3 + + A r

Furthermore, if λ 1 , λ 2 , λ 3 , , λ r are the nonzero eigenvalues of A , then there are two sets of r linearly independent vectors from n ,

X = x 1 , x 2 , x 3 , , x r Y = y 1 , y 2 , y 3 , , y r

such that A k = λ k x k y k t , 1 k r .

Proof   The proof is constructive. Generally, we will diagonalize A , creating a nonsingular matrix S and a diagonal matrix D . Then we split up the diagonal matrix into a sum of matrices with a single nonzero entry (on the diagonal). This fundamentally creates the decomposition in the statement of the theorem, the remainder is just bookkeeping. The vectors in X and Y will result from the columns of S and the rows of S 1 .

Let λ 1 , λ 2 , λ 3 , , λ n be the eigenvalues of A (repeated according to their algebraic multiplicity). If A has rank r , then dim N A = n r (Theorem RPNC). The null space of A is the eigenspace of the eigenvalue λ = 0 (Theorem EMNS), so it follows that the algebraic multiplicity of λ = 0 is n r , α A 0 = n r . Presume that the complete list of eigenvalues is ordered so that λ k = 0 for r + 1 k n .

Since A is hypothesized to be diagonalizable, there exists a diagonal matrix D and an invertible matrix S , such that D = S 1 A S . We can rearrange tis equation to read, A = S D S 1 . Also, the proof of Theorem DC says that the diagonal elements of D are the eigenvalues of A and we have the flexibility to assume they lie on the diagonal in the same order as we have specified above. Now, let X = x 1 , x 2 , x 3 , , x n be the columns of S , and let Y = y 1 , y 2 , y 3 , , y n be the rows of S 1 converted to column vectors. With little motivation other than the statement of the theorem, define size n matrices A k , 1 k n by A k = λ k x k y k t . Finally, let D k be the size n matrix that is totally zero, other than having λ k in row k and column k .

With everything in place, we compute entry-by-entry,

A i j = S D S 1 i j   Definition DZM = S k = 1 n D k S 1 i j   Definition MA = S k = 1 n D k S 1 i j   Theorem MMDAA = k = 1 n S D k S 1 i j   Theorem MMDAA = k = 1 n S D k S 1 i j   Definition MA = k = 1 n = 1 n S D k i S 1 j   Theorem EMP = k = 1 n = 1 n p = 1 n S i p D k p S 1 j   Theorem EMP = k = 1 n S i k D k k k S 1 k j   D k p = 0  if  p k , or  k = k = 1 n S i k λ k S 1 k j D k k k = λ k = k = 1 n λ k S i k S 1 k j   Property CMCN = k = 1 n λ k x k i 1 y k t 1 j  Definition of  X Y = k = 1 n λ k q = 1 1 x k i q y k t q j = k = 1 n λ k x k y k t i j   Theorem EMP = k = 1 n λ k x k y k t i j   Definition MSM = k = 1 n A k i j  Definition of  A k = k = 1 n A k i j   Definition MA

So by Definition ME we have the desired equality of matrices. The careful reader will have noted that A k = O , r + 1 k n , since λ k = 0 in these instances. To get the sets X and Y from X and Y , simply discard the last n r vectors. We can safely ignore (or remove) A r + 1 , A r + 2 , , A n from the summation just derived.

One last assertion to check. What is the rank of A k , 1 k r ? Every row of A k is a scalar multiple of y k t , row k of the nonsingular matrix S 1 ( Theorem MIMI). As a row of a nonsingular matrix, y k t cannot be all zeros. In particular, row i of A k is obtained as a scalar multiple of y k t by the scalar α k x k i . We have restricted ourselves to the nonzero eigenvalues of A , and as S is nonsingular, some entry of x k is nonzero. This all implies that some row of A k will be nonzero. Now consider row-reducing A k . Swap the nonzero row up into row 1. Use scalar multiples of this row to zero out every other row. This leaves a single nonzero row in the reduced row-echelon form, so A k has rank one.

We record two observations that was not stated in our theorem above. First, the vectors in X , chosen as columns of S , are eigenvectors of A . Second, the product of two vectors from X and Y in the opposite order, by which we mean y i t x j , is the entry in row i and column j of the matrix product S 1 S = I n ( Theorem EMP). In particular,

y i t x j = 1  if  i = j 0  if  i j

We give two computational examples. One small, one a bit bigger.

Example  ROD2
Rank one decomposition, size 2
Consider the 2 × 2 matrix,

A = 1 6 6 4 5 1 7

By the techniques of Chapter E we find the eigenvalues and eigenspaces,

λ 1 = 2 A 2 = 1 3 λ 2 = 1 A 1 = 2 5

With n = 2 distinct eigenvalues, Theorem DED tells us that A is diagonalizable, and with no zero eigenvalues we see that A has full rank. Theorem DC says we can construct the nonsingular matrix S with eigenvectors of A as columns, so we have

S = 1 2 3 5 S 1 = 5 2 3 1

From these matrices we obtain the sets of vectors

X = 1 3 , 2 5 Y = 5 2 , 3 1

And we have the matrices,

A 1 = 2 1 3 5 2 t = 2 5 2 1 5 6 = 1 0 4 3 0 1 2 A 2 = ( 1 ) 2 5 3 1 t = ( 1 ) 6 2 1 5 5 = 6 2 1 5 5

And you can easily verify that A = A 1 + A 2 .

Here’s a slightly larger example, and the matrix does not have full rank.

Example  ROD4
Rank one decomposition, size 4
Consider the 4 × 4 matrix,

B = 3 4 1 8 1 6 4 4 2 4 1 9 3 6 1 8 3 6 3 6 1 8 6 3

By the techniques of Chapter E we find the eigenvalues and eigenvectors,

λ 1 = 3 B 3 = 1 2 1 1 , 1 1 1 2 λ 2 = 2 B 2 = 1 2 0 0 λ 3 = 0 A 0 = 2 3 2 2

The algebraic and geometric multiplicities of each eigenvalue are equal, so Theorem DMFE tells us that A is diagonalizable. With a single zero eigenvalue we see that A has rank 4 1 = 3 . Theorem DC says we can construct the nonsingular matrix S with eigenvectors of A as columns, so we have

S = 1 1 1 2 2 1 2 3 1 1 0 2 1 2 0 2 S 1 = 4 2 0 1 8 4 1 1 1 0 1 0 6 3 1 1

Since r = 3 , we need only collect three vectors from each of these matrices,

X = 1 2 1 1 , 1 1 1 2 , 1 2 0 0 Y = 4 2 0 1 , 8 4 1 1 , 1 0 1 0

And we obtain the matrices,

B 1 = 3 1 2 1 1 4 2 0 1 t = 3 4 2 0 1 8 4 0 2 4 2 0 1 4 2 0 1 = 1 2 6 0 3 2 4 1 2 0 6 1 2 6 0 3 1 2 6 0 3 B 2 = 3 1 1 1 2 8 4 1 1 t = 3 8 4 1 1 8 4 1 1 8 4 1 1 1 6 8 2 2 = 2 4 1 2 3 3 2 4 1 2 3 3 2 4 1 2 3 3 4 8 2 4 6 6 B 3 = ( 2 ) 1 2 0 0 1 0 1 0 t = ( 2 ) 1 0 1 0 2 0 2 0 0 0 0 0 0 0 0 0 = 2 0 2 0 4 0 4 0 0 0 0 0 0 0 0 0

Then we verify that

B = B 1 + B 2 + B 3 = 1 2 6 0 3 2 4 1 2 0 6 1 2 6 0 3 1 2 6 0 3 + 2 4 1 2 3 3 2 4 1 2 3 3 2 4 1 2 3 3 4 8 2 4 6 6 + 2 0 2 0 4 0 4 0 0 0 0 0 0 0 0 0 = 3 4 1 8 1 6 4 4 2 4 1 9 3 6 1 8 3 6 3 6 1 8 6 3