Section ROD Rank One Decomposition

From A First Course in Linear Algebra
Version 2.30
© 2004.
Licensed under the GNU Free Documentation License.
http://linear.ups.edu/

T h i s S e c t i o n i s a D r a f t, S u b j e c t t o C h a n g e s

Our first decomposition applies only to diagonalizable (Definition DZM) matrices, and yields a decomposition into a sum of very simple matrices.

Theorem ROD
Rank One Decomposition
Suppose that $A$ is a diagonalizable matrix of size $n$ and rank $r$ . Then there are $r$ square matrices $A_{1}, A_{2}, A_{3}, \dots, A_{r}$ , each of size $n$ and rank $1$ such that

$\begin{array}{l} A & = A_{1} + A_{2} + A_{3} + \dots + A_{r} \end{array}$

Furthermore, if $λ_{1}, λ_{2}, λ_{3}, \dots, λ_{r}$ are the nonzero eigenvalues of $A$ , then there are two sets of $r$ linearly independent vectors from $ℂ^{n}$ ,

$\begin{array}{l} X & = \{x_{1}, x_{2}, x_{3}, \dots, x_{r}\} & Y & = \{y_{1}, y_{2}, y_{3}, \dots, y_{r}\} \end{array}$

such that $A_{k} = λ_{k} x_{k} y_{k}^{t}$ , $1 \leq k \leq r$ . $□$

Proof The proof is constructive. Generally, we will diagonalize $A$ , creating a nonsingular matrix $S$ and a diagonal matrix $D$ . Then we split up the diagonal matrix into a sum of matrices with a single nonzero entry (on the diagonal). This fundamentally creates the decomposition in the statement of the theorem, the remainder is just bookkeeping. The vectors in $X$ and $Y$ will result from the columns of $S$ and the rows of $S^{- 1}$ .

Let $λ_{1}, λ_{2}, λ_{3}, \dots, λ_{n}$ be the eigenvalues of $A$ (repeated according to their algebraic multiplicity). If $A$ has rank $r$ , then $dim (N (A)) = n - r$ (Theorem RPNC). The null space of $A$ is the eigenspace of the eigenvalue $λ = 0$ (Theorem EMNS), so it follows that the algebraic multiplicity of $λ = 0$ is $n - r$ , $α_{A} (0) = n - r$ . Presume that the complete list of eigenvalues is ordered so that $λ_{k} = 0$ for $r + 1 \leq k \leq n$ .

Since $A$ is hypothesized to be diagonalizable, there exists a diagonal matrix $D$ and an invertible matrix $S$ , such that $D = S^{- 1} A S$ . We can rearrange tis equation to read, $A = S D S^{- 1}$ . Also, the proof of Theorem DC says that the diagonal elements of $D$ are the eigenvalues of $A$ and we have the flexibility to assume they lie on the diagonal in the same order as we have specified above. Now, let $X^{*} = \{x_{1}, x_{2}, x_{3}, \dots, x_{n}\}$ be the columns of $S$ , and let $Y^{*} = \{y_{1}, y_{2}, y_{3}, \dots, y_{n}\}$ be the rows of $S^{- 1}$ converted to column vectors. With little motivation other than the statement of the theorem, define size $n$ matrices $A_{k}$ , $1 \leq k \leq n$ by $A_{k} = λ_{k} x_{k} y_{k}^{t}$ . Finally, let $D_{k}$ be the size $n$ matrix that is totally zero, other than having $λ_{k}$ in row $k$ and column $k$ .

With everything in place, we compute entry-by-entry,

$\begin{array}{l} {[A]}_{i j} & = {[S D S^{- 1}]}_{i j} & Definition DZM \\ = {[S (\sum_{k = 1}^{n} D_{k}) S^{- 1}]}_{i j} & Definition MA \\ = {[S (\sum_{k = 1}^{n} D_{k} S^{- 1})]}_{i j} & Theorem MMDAA \\ = {[\sum_{k = 1}^{n} S D_{k} S^{- 1}]}_{i j} & Theorem MMDAA \\ = \sum_{k = 1}^{n} {[S D_{k} S^{- 1}]}_{i j} & Definition MA \\ = \sum_{k = 1}^{n} \sum_{ℓ = 1}^{n} {[S D_{k}]}_{i ℓ} {[S^{- 1}]}_{ℓ j} & Theorem EMP \\ = \sum_{k = 1}^{n} \sum_{ℓ = 1}^{n} \sum_{p = 1}^{n} {[S]}_{i p} {[D_{k}]}_{p ℓ} {[S^{- 1}]}_{ℓ j} & Theorem EMP \\ = \sum_{k = 1}^{n} {[S]}_{i k} {[D_{k}]}_{k k} {[S^{- 1}]}_{k j} & {[D_{k}]}_{p ℓ} = 0 if p \neq k, or ℓ \neq k \\ = \sum_{k = 1}^{n} {[S]}_{i k} λ_{k} {[S^{- 1}]}_{k j} & {[D_{k}]}_{k k} = λ_{k} \\ = \sum_{k = 1}^{n} λ_{k} {[S]}_{i k} {[S^{- 1}]}_{k j} & Property CMCN \\ = \sum_{k = 1}^{n} λ_{k} {[x_{k}]}_{i 1} {[y_{k}^{t}]}_{1 j} & Definition of X^{*}, Y^{*} \\ = \sum_{k = 1}^{n} λ_{k} \sum_{q = 1}^{1} {[x_{k}]}_{i q} {[y_{k}^{t}]}_{q j} \\ = \sum_{k = 1}^{n} λ_{k} {[x_{k} y_{k}^{t}]}_{i j} & Theorem EMP \\ = \sum_{k = 1}^{n} {[λ_{k} x_{k} y_{k}^{t}]}_{i j} & Definition MSM \\ = \sum_{k = 1}^{n} {[A_{k}]}_{i j} & Definition of A_{k} \\ = {[\sum_{k = 1}^{n} A_{k}]}_{i j} & Definition MA \end{array}$

So by Definition ME we have the desired equality of matrices. The careful reader will have noted that $A_{k} = O$ , $r + 1 \leq k \leq n$ , since $λ_{k} = 0$ in these instances. To get the sets $X$ and $Y$ from $X^{*}$ and $Y^{*}$ , simply discard the last $n - r$ vectors. We can safely ignore (or remove) $A_{r + 1}, A_{r + 2}, \dots, A_{n}$ from the summation just derived.

One last assertion to check. What is the rank of $A_{k}$ , $1 \leq k \leq r$ ? Every row of $A_{k}$ is a scalar multiple of $y_{k}^{t}$ , row $k$ of the nonsingular matrix $S^{- 1}$ ( Theorem MIMI). As a row of a nonsingular matrix, $y_{k}^{t}$ cannot be all zeros. In particular, row $i$ of $A_{k}$ is obtained as a scalar multiple of $y_{k}^{t}$ by the scalar $α_{k} {[x_{k}]}_{i}$ . We have restricted ourselves to the nonzero eigenvalues of $A$ , and as $S$ is nonsingular, some entry of $x_{k}$ is nonzero. This all implies that some row of $A_{k}$ will be nonzero. Now consider row-reducing $A_{k}$ . Swap the nonzero row up into row 1. Use scalar multiples of this row to zero out every other row. This leaves a single nonzero row in the reduced row-echelon form, so $A_{k}$ has rank one. $■$

We record two observations that was not stated in our theorem above. First, the vectors in $X$ , chosen as columns of $S$ , are eigenvectors of $A$ . Second, the product of two vectors from $X$ and $Y$ in the opposite order, by which we mean $y_{i}^{t} x_{j}$ , is the entry in row $i$ and column $j$ of the matrix product $S^{- 1} S = I_{n}$ ( Theorem EMP). In particular,

$\begin{array}{l} y_{i}^{t} x_{j} & = \{\begin{matrix} 1 & if i = j \\ 0 & if i \neq j \end{matrix} \end{array}$

We give two computational examples. One small, one a bit bigger.

Example ROD2
Rank one decomposition, size 2
Consider the $2 \times 2$ matrix,

$\begin{array}{l} A & = [\begin{matrix} - 16 & - 6 \\ 45 & 17 \end{matrix}] \end{array}$

By the techniques of Chapter E we find the eigenvalues and eigenspaces,

$\begin{array}{l} λ_{1} & = 2 & ℰ_{A} (2) & = 〈\{[\begin{matrix} - 1 \\ 3 \end{matrix}]\}〉 & λ_{2} & = - 1 & ℰ_{A} (- 1) & = 〈\{[\begin{matrix} - 2 \\ 5 \end{matrix}]\}〉 \end{array}$

With $n = 2$ distinct eigenvalues, Theorem DED tells us that $A$ is diagonalizable, and with no zero eigenvalues we see that $A$ has full rank. Theorem DC says we can construct the nonsingular matrix $S$ with eigenvectors of $A$ as columns, so we have

$\begin{array}{l} S & = [\begin{matrix} - 1 & - 2 \\ 3 & 5 \end{matrix}] & S^{- 1} & = [\begin{matrix} 5 & 2 \\ - 3 & - 1 \end{matrix}] \end{array}$

From these matrices we obtain the sets of vectors

$\begin{array}{l} X & = \{[\begin{matrix} - 1 \\ 3 \end{matrix}], [\begin{matrix} - 2 \\ 5 \end{matrix}]\} & Y & = \{[\begin{matrix} 5 \\ 2 \end{matrix}], [\begin{matrix} - 3 \\ - 1 \end{matrix}]\} \end{array}$

And we have the matrices,

$\begin{array}{l} A_{1} & = 2 [\begin{matrix} - 1 \\ 3 \end{matrix}] {[\begin{matrix} 5 \\ 2 \end{matrix}]}^{t} = 2 [\begin{matrix} - 5 & - 2 \\ 15 & 6 \end{matrix}] = [\begin{matrix} - 10 & - 4 \\ 30 & 12 \end{matrix}] \\ A_{2} & = (- 1) [\begin{matrix} - 2 \\ 5 \end{matrix}] {[\begin{matrix} - 3 \\ - 1 \end{matrix}]}^{t} = (- 1) [\begin{matrix} 6 & 2 \\ - 15 & - 5 \end{matrix}] = [\begin{matrix} - 6 & - 2 \\ 15 & 5 \end{matrix}] \end{array}$

And you can easily verify that $A = A_{1} + A_{2}$ . $⊠$

Here’s a slightly larger example, and the matrix does not have full rank.

Example ROD4
Rank one decomposition, size 4
Consider the $4 \times 4$ matrix,

$\begin{array}{l} B & = [\begin{matrix} 34 & 18 & - 1 & - 6 \\ - 44 & - 24 & - 1 & 9 \\ 36 & 18 & - 3 & - 6 \\ 36 & 18 & - 6 & - 3 \end{matrix}] \end{array}$

By the techniques of Chapter E we find the eigenvalues and eigenvectors,

$\begin{array}{l} λ_{1} & = 3 & ℰ_{B} (3) & = 〈\{[\begin{matrix} 1 \\ - 2 \\ 1 \\ - 1 \end{matrix}], [\begin{matrix} 1 \\ - 1 \\ 1 \\ 2 \end{matrix}]\}〉 \\ λ_{2} & = - 2 & ℰ_{B} (- 2) & = 〈\{[\begin{matrix} - 1 \\ 2 \\ 0 \\ 0 \end{matrix}]\}〉 \\ λ_{3} & = 0 & ℰ_{A} (0) & = 〈\{[\begin{matrix} 2 \\ - 3 \\ 2 \\ 2 \end{matrix}]\}〉 \end{array}$

The algebraic and geometric multiplicities of each eigenvalue are equal, so Theorem DMFE tells us that $A$ is diagonalizable. With a single zero eigenvalue we see that $A$ has rank $4 - 1 = 3$ . Theorem DC says we can construct the nonsingular matrix $S$ with eigenvectors of $A$ as columns, so we have

$\begin{array}{l} S & = [\begin{matrix} 1 & 1 & - 1 & 2 \\ - 2 & - 1 & 2 & - 3 \\ 1 & 1 & 0 & 2 \\ - 1 & 2 & 0 & 2 \end{matrix}] & S^{- 1} & = [\begin{matrix} 4 & 2 & 0 & - 1 \\ 8 & 4 & - 1 & - 1 \\ - 1 & 0 & 1 & 0 \\ - 6 & - 3 & 1 & 1 \end{matrix}] \end{array}$

Since $r = 3$ , we need only collect three vectors from each of these matrices,

$\begin{array}{l} X & = \{[\begin{matrix} 1 \\ - 2 \\ 1 \\ - 1 \end{matrix}], [\begin{matrix} 1 \\ - 1 \\ 1 \\ 2 \end{matrix}], [\begin{matrix} - 1 \\ 2 \\ 0 \\ 0 \end{matrix}]\} & Y & = \{[\begin{matrix} 4 \\ 2 \\ 0 \\ - 1 \end{matrix}], [\begin{matrix} 8 \\ 4 \\ - 1 \\ - 1 \end{matrix}], [\begin{matrix} - 1 \\ 0 \\ 1 \\ 0 \end{matrix}]\} \end{array}$

And we obtain the matrices,

$\begin{array}{l} B_{1} & = 3 [\begin{matrix} 1 \\ - 2 \\ 1 \\ - 1 \end{matrix}] {[\begin{matrix} 4 \\ 2 \\ 0 \\ - 1 \end{matrix}]}^{t} = 3 [\begin{matrix} 4 & 2 & 0 & - 1 \\ - 8 & - 4 & 0 & 2 \\ 4 & 2 & 0 & - 1 \\ - 4 & - 2 & 0 & 1 \end{matrix}] = [\begin{matrix} 12 & 6 & 0 & - 3 \\ - 24 & - 12 & 0 & 6 \\ 12 & 6 & 0 & - 3 \\ - 12 & - 6 & 0 & 3 \end{matrix}] \\ B_{2} & = 3 [\begin{matrix} 1 \\ - 1 \\ 1 \\ 2 \end{matrix}] {[\begin{matrix} 8 \\ 4 \\ - 1 \\ - 1 \end{matrix}]}^{t} = 3 [\begin{matrix} 8 & 4 & - 1 & - 1 \\ - 8 & - 4 & 1 & 1 \\ 8 & 4 & - 1 & - 1 \\ 16 & 8 & - 2 & - 2 \end{matrix}] = [\begin{matrix} 24 & 12 & - 3 & - 3 \\ - 24 & - 12 & 3 & 3 \\ 24 & 12 & - 3 & - 3 \\ 48 & 24 & - 6 & - 6 \end{matrix}] \\ B_{3} & = (- 2) [\begin{matrix} - 1 \\ 2 \\ 0 \\ 0 \end{matrix}] {[\begin{matrix} - 1 \\ 0 \\ 1 \\ 0 \end{matrix}]}^{t} = (- 2) [\begin{matrix} 1 & 0 & - 1 & 0 \\ - 2 & 0 & 2 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}] = [\begin{matrix} - 2 & 0 & 2 & 0 \\ 4 & 0 & - 4 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}] \end{array}$

Then we verify that

$\begin{array}{l} B & = B_{1} + B_{2} + B_{3} \\ = [\begin{matrix} 12 & 6 & 0 & - 3 \\ - 24 & - 12 & 0 & 6 \\ 12 & 6 & 0 & - 3 \\ - 12 & - 6 & 0 & 3 \end{matrix}] + [\begin{matrix} 24 & 12 & - 3 & - 3 \\ - 24 & - 12 & 3 & 3 \\ 24 & 12 & - 3 & - 3 \\ 48 & 24 & - 6 & - 6 \end{matrix}] + [\begin{matrix} - 2 & 0 & 2 & 0 \\ 4 & 0 & - 4 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}] \\ = [\begin{matrix} 34 & 18 & - 1 & - 6 \\ - 44 & - 24 & - 1 & 9 \\ 36 & 18 & - 3 & - 6 \\ 36 & 18 & - 6 & - 3 \end{matrix}] \end{array}$

$⊠$