Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

7.4* The Rational Canonical Form

Until now we have used eigenvalues, eigenvectors, and generalized eigenvectors in our analysis of linear operators with characteristic polynomials that split. In general, characteristic polynomials need not split, and indeed, operators need not have eigenvalues! However, the unique factorization theorem for polynomials (see page 562) guarantees that the characteristic polynomial f(t) of any linear operator T on an n-dimensional vector space factors uniquely as

$f (t) = {(- 1)}^{n} {(ϕ_{1} (t))}^{n_{1}} {(ϕ_{2} (t))}^{n_{2}} \dots {(ϕ_{k} (t))}^{n_{k}},$

where the $ϕ_{i} (t)$ ’s $(1 \leq i \leq k)$ are distinct irreducible monic polynomials and the $n_{i}$ ’s are positive integers. In the case that f(t) splits, each irreducible monic polynomial factor is of the form $ϕ_{i} (t) = t - λ_{i}$ , where $λ_{i}$ is an eigenvalue of T, and there is a one-to-one correspondence between eigenvalues of T and the irreducible monic factors of the characteristic polynomial. In general, eigenvalues need not exist, but the irreducible monic factors always exist. In this section, we establish structure theorems based on the irreducible monic factors of the characteristic polynomial instead of eigenvalues.

In this context, the following definition is the appropriate replacement for eigenspace and generalized eigenspace.

Definition.

Let T be a linear operator on a finite-dimensional vector space V with characteristic polynomial

$f (t) = {(- 1)}^{n} {(ϕ_{1} (t))}^{n_{1}} {(ϕ_{2} (t))}^{n_{2}} \dots {(ϕ_{k} (t))}^{n_{k}},$

where the $ϕ_{i} (t)$ ’s $(1 \leq i \leq k)$ are distinct irreducible monic polynomials and the $n_{i}$ ’s are positive integers. For $1 \leq i \leq k$ , we define the subset $K_{ϕ_{i}}$ of V by

$K_{ϕ_{i}} = {x \in V : {(ϕ_{i} (T))}^{p} (x) = 0 for some positive integer p} .$

We show that each $K_{ϕ_{i}}$ is a nonzero T-invariant subspace of V. Note that if $ϕ_{i} (t) = t - λ$ is of degree one, then $K_{ϕ_{i}}$ is the generalized eigenspace of T corresponding to the eigenvalue $λ$ .

Having obtained suitable generalizations of the related concepts of eigenvalue and eigenspace, our next task is to describe a canonical form of a linear operator suitable to this context. The one that we study is called the rational canonical form. Since a canonical form is a description of a matrix representation of a linear operator, it can be defined by specifying the form of the ordered bases allowed for these representations.

Here the bases of interest naturally arise from the generators of certain cyclic subspaces. For this reason, the reader should recall the definition of a T-cyclic subspace generated by a vector and Theorem 5.21 (p. 314). We briefly review this concept and introduce some new notation and terminology.

Let T be a linear operator on a finite-dimensional vector space V, and let x be a nonzero vector in V. We use the notation $C_{x}$ for the T-cyclic subspace generated by x. Recall (Theorem 5.21) that if $\dim (C_{x}) = k$ , then the set

${x, T (x), T^{2} (x), \dots, T^{k - 1} (x)}$

is an ordered basis for $C_{x}$ . To distinguish this basis from all other ordered bases for $C_{x}$ , we call it the T-cyclic basis generated by x and denote it by $β_{x}$ . Let A be the matrix representation of the restriction of T to $C_{x}$ in the ordered basis $β_{x}$ . Recall from the proof of Theorem 5.21 that

$A = (\begin{array}{c} 0 & 0 & \dots & 0 & - a_{0} \\ 1 & 0 & \dots & 0 & - a_{1} \\ 0 & 1 & \dots & 0 & - a_{2} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & - a_{k - 1} \end{array}),$

where

$a_{0} x + a_{1} T (x) + \dots + a_{k - 1} T^{k - 1} (x) + T^{k} (x) = 0.$

Furthermore, the characteristic polynomial of A is given by

$\det (A - t I) = {(- 1)}^{k} (a_{0} + a_{1} t + \dots + a_{k - 1} t^{k - 1} + t^{k}) .$

The matrix A is called the companion matrix of the monic polynomial $h (t) = a_{0} + a_{1} t + \dots + a_{k - 1} t^{k - 1} + t^{k}$ . Every monic polynomial has a companion matrix, and the characteristic polynomial of the companion matrix of a monic polynomial g(t) of degree k is equal to ${(- 1)}^{k} g (t)$ . (See Exercise 19 of Section 5.4.) By Theorem 7.15 (p. 512), the monic polynomial h(t) is also the minimal polynomial of A. Since A is the matrix representation of the restriction of T to $C_{x}, h (t)$ is also the minimal polynomial of this restriction. By Exercise 15 of Section 7.3, h(t) is also the T-annihilator of x.

It is the object of this section to prove that for every linear operator T on a finite-dimensional vector space V, there exists an ordered basis $β$ for V such that the matrix representation ${[T]}_{β}$ is of the form

$(\begin{array}{c} C_{1} & O & \dots & O \\ O & C_{2} & \dots & O \\ ⋮ & ⋮ & ⋮ \\ O & O & \dots & C_{r} \end{array}),$

where each $C_{i}$ is the companion matrix of a polynomial ${(ϕ (t))}^{m}$ such that $ϕ (t)$ is a monic irreducible divisor of the characteristic polynomial of T and m is a positive integer. A matrix representation of this kind is called a rational canonical form of T. We call the accompanying basis a rational canonical basis for T.

The next theorem is a simple consequence of the following lemma, which relies on the concept of T-annihilator, introduced in the Exercises of Section 7.3.

Lemma.

Let T be a linear operator on a finite-dimensional vector space V, let x be a nonzero vector in V, and suppose that the T-annihilator of x is of the form ${(ϕ (t))}^{p}$ for some irreducible monic polynomial $ϕ (t)$ . Then $ϕ (t)$ divides the minimal polynomial of T, and $x \in K_{ϕ}$ .

Proof.

By Exercise 15(b) of Section 7.3, ${(ϕ (t))}^{p}$ divides the minimal polynomial of T. Therefore $ϕ (t)$ divides the minimal polynomial of T. Furthermore, $x \in K_{ϕ}$ by the definition of $K_{ϕ}$ .

Theorem 7.17.

Let T be a linear operator on a finite-dimensional vector space V, and let $β$ be an ordered basis for V. Then $β$ is a rational canonical basis for T if and only if $β$ is the disjoint union of T-cyclic bases $β_{v_{i}}$ , where each $v_{i}$ lies in $K_{ϕ}$ for some irreducible monic divisor $ϕ (t)$ of the characteristic polynomial of T.

Proof.

Exercise.

Example 1

Suppose that T is a linear operator on $R^{8}$ and

$β = {v_{1}, v_{2}, v_{3}, v_{4}, v_{5}, v_{6}, v_{7}, v_{8}}$

is a rational canonical basis for T such that

A diagram of a 8 by 8 matrix that is a canonical basis for T.

7.4-12 Full Alternative Text

is a rational canonical form of T. In this case, the submatrices $C_{1}, C_{2}$ , and $C_{3}$ are the companion matrices of the polynomials $ϕ_{1} (t), {(ϕ_{2} (t))}^{2}$ , and $ϕ_{2} (t)$ , respectively, where

$ϕ_{1} (t) = t^{3} - t + 3 and ϕ_{2} (t) = t^{2} + 1.$

In the context of Theorem 7.17, $β$ is the disjoint union of the T-cyclic bases; that is,

$\begin{array}{rcl} β & = & β_{v_{1}} \cup β_{v_{3}} \cup β_{v_{7}} \\ = & {v_{1}, v_{2}} \cup {v_{3}, v_{4}, v_{5}, v_{6}} \cup {v_{7}, v_{8}} . \end{array}$

By Exercise 39 of Section 5.4, the characteristic polynomial f(t) of T is the product of the characteristic polynomials of the companion matrices:

$f (t) = ϕ_{1} (t) {(ϕ_{2} (t))}^{2} ϕ_{2} (t) = ϕ_{1} (t) {(ϕ_{2} (t))}^{3} .$

The rational canonical form C of the operator T in Example 1 is constructed from matrices of the form $C_{i}$ , each of which is the companion matrix of some power of a monic irreducible divisor of the characteristic polynomial of T. Furthermore, each such divisor is used in this way at least once.

In the course of showing that every linear operator T on a finite dimensional vector space has a rational canonical form C, we show that the companion matrices $C_{i}$ that constitute C are always constructed from powers of the monic irreducible divisors of the characteristic polynomial of T. A key role in our analysis is played by the subspaces $K_{ϕ}$ , where $ϕ (t)$ is an irreducible monic divisor of the minimal polynomial of T. Since the minimal polynomial of an operator divides the characteristic polynomial of the operator, every irreducible divisor of the former is also an irreducible divisor of the latter. We eventually show that the converse is also true; that is, the minimal polynomial and the characteristic polynomial have the same irreducible divisors.

We begin with a result that lists several properties of irreducible divisors of the minimal polynomial. The reader is advised to review the definition of T-annihilator and the accompanying Exercises 15 of Section 7.3.

Theorem 7.18.

Let T be a linear operator on a finite-dimensional vector space V, and suppose that

$p (t) = {(ϕ_{1} (t))}^{m_{1}} {(ϕ_{2} (t))}^{m_{2}} \dots {(ϕ_{k} (t))}^{m_{k}}$

is the minimal polynomial of T, where the $ϕ_{i} (t)$ ‘s $(1 \leq i \leq k)$ are the distinct irreducible monic factors of p(t) and the $m_{i}$ ’s are positive integers. Then the following statements are true.

(a) $K_{ϕ_{i}}$ is a nonzero T-invariant subspace of V for each i.
(b) If x is a nonzero vector in some $K_{ϕ_{i}}$ , then the T-annihilator of x is of the form ${(ϕ_{i} (t))}^{p}$ for some integer p.
(c) $K_{ϕ_{i}} \cap K_{ϕ_{j}} = {0} for i \neq j$ .
(d) $K_{ϕ_{i}}$ is invariant under $ϕ_{j} (T)$ for $i \neq j$ , and the restriction of $ϕ_{j} (T)$ to $K_{ϕ_{i}}$ is one-to-one and onto.
(e) $K_{ϕ_{i}} = N ({(ϕ_{i} (T))}^{m_{i}})$ for each i.

Proof.

If $k = 1$ , then (a), (b), and (e) are obvious, while (c) and (d) are vacuously true. Now suppose that $k > 1$ .

(a) The proof that $K_{ϕ_{i}}$ is a T-invariant subspace of V is left as an exercise. Let $f_{i} (t)$ be the polynomial obtained from p(t) by omitting the factor ${(ϕ_{i} (t))}^{m_{i}}$ . To prove that $K_{ϕ_{i}}$ is nonzero, first observe that $f_{i} (t)$ is a proper divisor of p(t); therefore there exists a vector $z \in V$ such that $x = f_{i} (T) (z) \neq 0$ . Then $x \in K_{ϕ_{i}}$ because

${(ϕ_{i} (T))}^{m_{i}} (x) = {(ϕ_{i} (T))}^{m_{i}} f_{i} (T) (z) = p (T) (z) = 0.$
(b) Assume the hypothesis. Then ${(ϕ_{i} (T))}^{q} (x) = 0$ for some positive integer q. Hence the T-annihilator of x divides ${(ϕ_{i} (t))}^{q}$ by Exercise 15(b) of Section 7.3, and the result follows.
(c) Assume $i \neq j$ . Let $x \in K_{ϕ_{i}} \cap K_{ϕ_{j}}$ , and suppose that $x \neq 0$ . By (b), the T-annihilator of x is a power of both $ϕ_{i} (t)$ and $ϕ_{j} (t)$ . But this is impossible because $ϕ_{i} (t)$ and $ϕ_{j} (t)$ are relatively prime (see Appendix E). We conclude that $x = 0$ .
(d) Assume $i \neq j$ . Since $K_{ϕ_{i}}$ is T-invariant, it is also $ϕ_{j} (T)$ -invariant. Suppose that $ϕ_{j} (T) (x) = 0$ for some $x \in K_{ϕ_{i}}$ . Then $x \in K_{ϕ_{i}} \cap K_{ϕ_{j}} = {0}$ by (c). Therefore the restriction of $ϕ_{j} (T)$ to $K_{ϕ_{i}}$ is one-to-one. Since V is finite-dimensional, this restriction is also onto.
(e) Suppose that $1 \leq i \leq k$ . Clearly, $N ({(ϕ_{i} (T))}^{m_{i}}) \subseteq K_{ϕ_{i}}$ . Let $f_{i} (t)$ be the polynomial defined in (a). Since $f_{i} (t)$ is a product of polynomials of the form $ϕ_{j} (t)$ for $j \neq i$ , we have by (d) that the restriction of $f_{i} (T)$ to $K_{ϕ_{i}}$ is onto. Let $x \in K_{ϕ_{i}}$ . Then there exists $y \in K_{ϕ_{i}}$ such that $f_{i} (T) (y) = x$ . Therefore

$({(ϕ_{i} (T))}^{m_{i}}) (x) = ({(ϕ_{i} (T))}^{m_{i}}) f_{i} (T) (y) = p (T) (y) = 0,$

and hence $x \in N ({(ϕ_{i} (T))}^{m_{i}})$ . Thus $K_{ϕ_{i}} = N ({(ϕ_{i} (T))}^{m_{i}})$ .

Since a rational canonical basis for an operator T is obtained from a union of T-cyclic bases, we need to know when such a union is linearly independent. The next major result, Theorem 7.19, reduces this problem to the study of T-cyclic bases within $K_{ϕ}$ , where $ϕ (t)$ is an irreducible monic divisor of the minimal polynomial of T. We begin with the following lemma.

Lemma.

Let T be a linear operator on a finite-dimensional vector space V, and suppose that

$p (t) = {(ϕ_{1} (t))}^{m_{1}} {(ϕ_{2} (t))}^{m_{2}} \dots {(ϕ_{k} (t))}^{m_{k}}$

is the minimal polynomial of T, where the $ϕ_{i}$ ’s $(1 \leq i \leq k)$ are the distinct irreducible monic factors of p(t) and the $m_{i}$ ’s are positive integers. For $1 \leq i \leq k$ , let $v_{i} \in K_{ϕ_{i}}$ be such that

$v_{1} + v_{2} + \dots + v_{k} = 0.$ (2)

Then $v_{i} = 0$ for all i.

Proof.

The result is trivial if $k = 1$ , so suppose that $k > 1$ . Consider any i. Let $f_{i} (t)$ be the polynomial obtained from p(t) by omitting the factor ${(ϕ_{i} (t))}^{m_{i}}$ . As a consequence of Theorem 7.18, $f_{i} (T)$ is one-to-one on $K_{ϕ_{i}}$ , and $f_{i} (T) (v_{j}) = 0$ for $i \neq j$ . Thus, applying $f_{i} (T)$ to (2), we obtain $f_{i} (T) (v_{i}) = 0$ , from which it follows that $v_{i} = 0$ .

Theorem 7.19.

Let T be a linear operator on a finite-dimensional vector space V, and suppose that

$p (t) = {(ϕ_{1} (t))}^{m_{1}} {(ϕ_{2} (t))}^{m_{2}} \dots {(ϕ_{k} (t))}^{m_{k}}$

is the minimal polynomial of T, where the $ϕ_{i}$ ‘s $(1 \leq i \leq k)$ are the distinct irreducible monic factors of p(t) and the $m_{i}$ ’s are positive integers. For $1 \leq i \leq k$ , let $S_{i}$ be a linearly independent subset of $K_{ϕ_{i}}$ . Then

$S_{i} \cap S_{j} = \emptyset$ for $i \neq j$
$S_{1} \cup S_{2} \cup \dots \cup S_{k}$ is linearly independent.

Proof.

If $k = 1$ , then (a) is vacuously true and (b) is obvious. Now suppose that $k > 1$ . Then (a) follows immediately from Theorem 7.18(c). Furthermore, the proof of (b) is identical to the proof of Theorem 5.5 (p. 261) with the eigenvectors replaced by the generalized eigenvectors.

In view of Theorem 7.19, we can focus on bases of individual spaces of the form $K_{ϕ}$ , where $ϕ (t)$ is an irreducible monic divisor of the minimal polynomial of T. The next several results give us ways to construct bases for these spaces that are unions of T-cyclic bases. These results serve the dual purposes of leading to the existence theorem for the rational canonical form and of providing methods for constructing rational canonical bases.

For Theorems 7.20 and 7.21 and the latter’s corollary, we fix a linear operator T on a finite-dimensional vector space V and an irreducible monic divisor $ϕ (t)$ of the minimal polynomial of T.

Theorem 7.20.

Let $v_{1}, v_{2}, \dots, v_{k}$ be distinct vectors in $K_{ϕ}$ such that

$S_{1} = β_{v_{1}} \cup β_{v_{2}} \cup \dots \cup β_{v_{k}}$

is linearly independent. For each i, suppose there exists $w_{i} \in V$ such that $ϕ (T) (w_{i}) = v_{i}$ . Then

$S_{2} = β_{w_{1}} \cup β_{w_{2}} \cup \dots \cup β_{w_{k}}$

is also linearly independent.

Proof.

Consider any linear combination of vectors in $S_{2}$ that sums to zero, say,

$\sum_{i = 1}^{k} \sum_{j = 0}^{n_{i}} a_{i j} T^{j} (w_{i}) = 0.$ (3)

For each i, let $f_{i} (t)$ be the polynomial defined by

$f_{i} (t) = \sum_{j = 0}^{n_{i}} a_{i j} t^{j} .$

Then (3) can be rewritten as

$\sum_{i = 1}^{k} f_{i} (T) (w_{i}) = 0.$ (4)

Apply $ϕ (T)$ to both sides of (4) to obtain

$\sum_{i = 1}^{k} ϕ (T) f_{i} (T) (w_{i}) = \sum_{i = 1}^{k} f_{i} (T) ϕ (T) (w_{i}) = \sum_{i = 1}^{k} f_{i} (T) (v_{i}) = 0.$

This last sum can be rewritten as a linear combination of the vectors in $S_{1}$ so that each $f_{i} (T) (v_{i})$ is a linear combination of the vectors in $β_{v_{i}}$ . Since $S_{1}$ is linearly independent, it follows that

$f_{i} (T) (v_{i}) = 0 for all i .$

Therefore the T-annihilator of $v_{i}$ divides $f_{i} (t)$ for all i. (See Exercise 15 of Section 7.3.) By Theorem 7.18(b), $ϕ (t)$ divides the T-annihilator of $v_{i}$ , and hence $ϕ (t)$ divides $f_{i} (t)$ for all i. Thus, for each i, there exists a polynomial $g_{i} (t)$ such that $f_{i} (t) = g_{i} (t) ϕ (t)$ . So (4) becomes

$\sum_{i = 1}^{k} g_{i} (T) ϕ (T) (w_{i}) = \sum_{i = 1}^{k} g_{i} (T) (v_{i}) = 0.$

Again, linear independence of $S_{1}$ requires that

$f_{i} (T) (w_{i}) = g_{i} (T) (v_{i}) = 0 for all i .$

But $f_{i} (T) (w_{i})$ is the result of grouping the terms of the linear combination in (3) that arise from the linearly independent set $β_{w_{i}}$ . We conclude that for each i, $a_{i j} = 0$ for all j. Therefore $S_{2}$ is linearly independent.

We now show that $K_{ϕ}$ has a basis consisting of a union of T-cycles.

Lemma.

Let W be a T-invariant subspace of $K_{ϕ}$ , and let $β$ be a basis for W. Then the following statements are true.

(a) Suppose that $x \in N (ϕ (T))$ , but $x \notin W$ . Then $β \cup β_{x}$ is linearly independent.
(b) For some $w_{1}, w_{2}, \dots, w_{s}$ in $N (ϕ (T))$ , $β$ can be extended to the linearly independent set

$β^{'} = β \cup β_{w_{1}} \cup β_{w_{2}} \cup \dots \cup β_{w_{s}},$

whose span contains $N (ϕ (T))$ .

Proof.

(a) Let $β = {v_{1}, v_{2}, \dots, v_{k}}$ , and suppose that

$\sum_{i = 1}^{k} a_{i} v_{i} + z = 0 and z = \sum_{j = 0}^{d - 1} b_{j} T^{j} (x),$

where d is the degree of $ϕ (t)$ . Then $z \in C_{x} \cap W$ , and hence $C_{z} \subseteq C_{x} \cap W$ . Suppose that $z \neq 0$ . Then z has $ϕ (t)$ as its T-annihilator, and therefore

$d = dim (C_{z}) \leq dim (C_{x} \cap W) \leq dim (C_{x}) = d .$

It follows that $C_{x} \cap W = C_{x}$ , and consequently $x \in W$ , contrary to hypothesis. Therefore $z = 0$ , from which it follows that $b_{j} = 0$ for all j. Since $β$ is linearly independent, it follows that $a_{i} = 0$ for all i. Thus $β \cup β_{x}$ is linearly independent.

(b) Suppose that W does not contain $N (ϕ (T))$ . Choose a vector $w_{1} \in N (ϕ (T))$ that is not in W. By (a), $β_{1} = β \cup β_{w_{1}}$ is linearly independent. Let $W_{1} = span (β_{1})$ . If $W_{1}$ does not contain $N (ϕ (T))$ , choose a vector $w_{2}$ in $N (ϕ (T))$ , but not in $W_{1}$ , so that $β_{2} = β_{1} \cup β_{w_{2}} = β \cup β_{w_{1}} \cup β_{w_{2}}$ is linearly independent. Continuing this process, we eventually obtain vectors $w_{1}, w_{2}, \dots, w_{s}$ in $N (ϕ (T))$ such that the union

$β^{'} = β \cup β_{w_{1}} \cup β_{w_{2}} \cup \dots \cup β_{w_{s}}$

is a linearly independent set whose span contains $N (ϕ (T))$ .

Theorem 7.21.

If the minimal polynomial of T is of the form $p (t) = {(ϕ (t))}^{m}$ , then there exists a rational canonical basis for T.

Proof.

The proof is by mathematical induction on m. Suppose that $m = 1$ . Apply (b) of the lemma to $W = {0}$ to obtain a linearly independent subset of V of the form $β_{v_{1}} \cup β_{v_{2}} \cup \dots \cup β_{v_{k}}$ , whose span contains $N (ϕ (T))$ . Since $V = N (ϕ (T))$ , this set is a rational canonical basis for V.

Now suppose that, for some integer $m > 1$ , the result is valid whenever the minimal polynomial of T is of the form ${(ϕ (t))}^{k}$ , where $k < m$ , and assume that the minimal polynomial of T is $p (t) = {(ϕ (t))}^{m}$ . Let $r = rank (ϕ (T))$ . Then $R (ϕ (T))$ is a T-invariant subspace of V, and the restriction of T to this subspace has ${(ϕ (t))}^{m - 1}$ as its minimal polynomial. Therefore we may apply the induction hypothesis to obtain a rational canonical basis for the restriction of T to R(T). Suppose that $v_{1}, v_{2}, \dots, v_{k}$ are the generating vectors of the T-cyclic bases that constitute this rational canonical basis. For each i, choose $w_{i}$ in V such that $v_{i} = ϕ (T) (w_{i})$ . By Theorem 7.20, the union $β$ of the sets $β_{w_{i}}$ is linearly independent. Let $W = span (β)$ . Then W contains $R (ϕ (T))$ . Apply (b) of the lemma and adjoin additional T-cyclic bases $β_{w_{k + 1}}, β_{w_{k + 2}}, \dots, β_{w_{s}}$ to $β$ , if necessary, where $w_{i}$ is in $N (ϕ (T))$ for $i \geq k$ , to obtain a linearly independent set

$β^{'} = β_{w_{1}} \cup β_{w_{2}} \cup \dots \cup β_{w_{k}} \cup \dots \cup β_{w_{s}}$

whose span $W^{'}$ contains both W and $N (ϕ (T))$ .

We show that $W^{'} = V$ . Let U denote the restriction of $ϕ (T)$ to $W^{'}$ , which is $ϕ (T)$ -invariant. By the way in which $W^{'}$ was obtained from $R (ϕ (T))$ , it follows that $R (U) = R (ϕ (T))$ and $N (U) = N (ϕ (T))$ . Therefore

$\begin{array}{rcl} dim (W^{'}) & = & rank (U) + nullity (U) \\ = & rank (ϕ (T)) + nullity (ϕ (T)) \\ = & dim (V) . \end{array}$

Thus $W^{'} = V$ , and $β^{'}$ is a rational canonical basis for T.

Corollary.

$K_{ϕ}$ has a basis consisting of the union of T-cyclic bases.

Proof.

Apply Theorem 7.21 to the restriction of T to $K_{ϕ}$ .

We are now ready to study the general case.

Theorem 7.22.

Every linear operator on a finite-dimensional vector space has a rational canonical basis and, hence, a rational canonical form.

Proof.

Let T be a linear operator on a finite-dimensional vector space V, and let $p (t) = {(ϕ_{1} (t))}^{m_{1}} {(ϕ_{2} (t))}^{m_{2}} \dots {(ϕ_{k} (t))}^{m_{k}}$ be the minimal polynomial of T, where the $ϕ_{i} (t)$ ’s are the distinct irreducible monic factors of p(t) and $m_{i} > 0$ for all i. The proof is by mathematical induction on k. The case $k = 1$ is proved in Theorem 7.21.

Suppose that the result is valid whenever the minimal polynomial contains fewer than k distinct irreducible factors for some $k > 1$ , and suppose that p(t) contains k distinct factors. Let U be the restriction of T to the T-invariant subspace $W = R ({(ϕ_{k} (T))}^{m_{k}})$ , and let q(t) be the minimal polynomial of U. Then q(t) divides p(t) by Exercise 10 of Section 7.3. Furthermore, $ϕ_{k} (t)$ does not divide q(t). For otherwise, there would exist a nonzero vector $x \in W$ such that $ϕ_{k} (U) (x) = 0$ and a vector $y \in V$ such that $x = {(ϕ_{k} (T))}^{m_{k}} (y)$ . It follows that ${(ϕ_{k} (T))}^{m_{k + 1}} (y) = 0$ , and hence $y \in K_{ϕ_{k}}$ and $x = {(ϕ_{k} (T))}^{m_{k}} (y) = 0$ by Theorem 7.18(e), a contradiction. Thus q(t) contains fewer than k distinct irreducible divisors. So by the induction hypothesis, U has a rational canonical basis $β_{1}$ consisting of a union of U-cyclic bases (and hence T-cyclic bases) of vectors from some of the subspaces $K_{ϕ_{i}}, 1 \leq i \leq k - 1$ . By the corollary to Theorem 7.21, $K_{ϕ_{k}}$ has a basis $β_{2}$ consisting of a union of T-cyclic bases. By Theorem 7.19, $β_{1}$ and $β_{2}$ are disjoint, and $β = β_{1} \cup β_{2}$ is linearly independent. Let s denote the number of vectors in $β$ .Then

$\begin{array}{rcl} s & = & \dim (R ({(ϕ_{k} (T))}^{m_{k}})) + \dim (K_{ϕ_{k}}) \\ = & rank ({(ϕ_{k} (T))}^{m_{k}}) + nullity ({(ϕ_{k} (T))}^{m_{k})} \\ = & n . \end{array}$

We conclude that $β$ is a basis for V. Therefore $β$ is a rational canonical basis, and T has a rational canonical form.

In our study of the rational canonical form, we relied on the minimal polynomial. We are now able to relate the rational canonical form to the characteristic polynomial.

Theorem 7.23.

Let T be a linear operator on an n-dimensional vector space V with characteristic polynomial

$f (t) = {(- 1)}^{n} {(ϕ_{1} (t))}^{n_{1}} {(ϕ_{2} (t))}^{n_{2}} \dots {(ϕ_{k} (t))}^{n_{k},}$

where the $ϕ_{i} (t)$ ’s $(1 \leq i \leq k)$ are distinct irreducible monic polynomials and the $n_{i}$ ’s are positive integers. Then the following statements are true.

(a) $ϕ_{1} (t), ϕ_{2} (t), \dots, ϕ_{k} (t)$ are the irreducible monic factors of the minimal polynomial.
(b) For each i, $\dim (K_{ϕ_{i}}) = d_{i} n_{i}$ , where $d_{i}$ is the degree of $ϕ_{i} (t)$ .
(c) If $β$ is a rational canonical basis for T, then $β_{i} = β \cap K_{ϕ_{i}}$ is a basis for $K_{ϕ_{i}}$ for each i.
(d) If $γ_{i}$ is a basis for $K_{ϕ_{i}}$ for each i, then $γ = γ_{1} \cup γ_{2} \cup \dots \cup γ_{k}$ is a basis for V. In particular, if each $γ_{i}$ is a disjoint union of T-cyclic bases, then $γ$ is a rational canonical basis for T.

Proof.

(a) By Theorem 7.22, T has a rational canonical form C. By Exercise 39 of Section 5.4, the characteristic polynomial of C, and hence of T, is the product of the characteristic polynomials of the companion matrices that compose C. Therefore each irreducible monic divisor $ϕ_{i} (t)$ of f(t) divides the characteristic polynomial of at least one of the companion matrices, and hence for some integer p, ${(ϕ_{i} (t))}^{p}$ is the T-annihilator of a nonzero vector of V. We conclude that ${(ϕ_{i} (t))}^{p}$ , and so $ϕ_{i} (t)$ , divides the minimal polynomial of T. Conversely, if $ϕ (t)$ is an irreducible monic polynomial that divides the minimal polynomial of T, then $ϕ (t)$ divides the characteristic polynomial of T because the minimal polynomial divides the characteristic polynomial.

(b), (c), and (d) Let $C = {[T]}_{β}$ , which is a rational canonical form of T. Consider any i $i (1 \leq i \leq k)$ . Since f(t) is the product of the characteristic polynomials of the companion matrices that compose C, we may multiply those characteristic polynomials that arise from the T-cyclic bases in $β_{i}$ to obtain the factor ${(ϕ_{i} (t))}^{n_{i}}$ of f(t). Since this polynomial has degree $n_{i} d_{i}$ , and the union of these bases is a linearly independent subset $β_{i}$ of $K_{ϕ_{i}}$ , we have

$n_{i} d_{i} \leq \dim (K_{ϕ_{i}}) .$

Furthermore, $n = \sum_{i = 1}^{k} d_{i} n_{i},$ because this sum is equal to the degree of f(t).

Now let s denote the number of vectors in $γ$ . By Theorem 7.19, $γ$ is linearly independent, and therefore

$n = \sum_{i = 1}^{k} d_{i} n_{i} \leq \sum_{i = 1}^{k} \dim (K_{ϕ_{i}}) = s \leq n .$

Hence $n = s$ , and $d_{i} n_{i} = \dim (K_{ϕ_{i}})$ for all i. It follows that $γ$ is a basis for V and $β_{i}$ is a basis for $K_{ϕ_{i}}$ for each i.

Uniqueness of the Rational Canonical Form

Having shown that a rational canonical form exists, we are now in a position to ask about the extent to which it is unique. Certainly, the rational canonical form of a linear operator T can be modified by permuting the T-cyclic bases that constitute the corresponding rational canonical basis. This has the effect of permuting the companion matrices that make up the rational canonical form. As in the case of the Jordan canonical form, we show that except for these permutations, the rational canonical form is unique, although the rational canonical bases are not.

To simplify this task, we adopt the convention of ordering every rational canonical basis so that all the T-cyclic bases associated with the same irreducible monic divisor of the characteristic polynomial are grouped together. Furthermore, within each such grouping, we arrange the T-cyclic bases in decreasing order of size. Our task is to show that, subject to this order, the rational canonical form of a linear operator is unique up to the arrangement of the irreducible monic divisors.

As in the case of the Jordan canonical form, we introduce arrays of dots from which we can reconstruct the rational canonical form. For the Jordan canonical form, we devised a dot diagram for each eigenvalue of the given operator. In the case of the rational canonical form, we define a dot diagram for each irreducible monic divisor of the characteristic polynomial of the given operator. A proof that the resulting dot diagrams are completely determined by the operator is also a proof that the rational canonical form is unique.

In what follows, T is a linear operator on a finite-dimensional vector space with rational canonical basis $β, ϕ (t)$ is an irreducible monic divisor of the characteristic polynomial of $β_{v_{1}}, β_{v_{2}}, \dots, β_{v_{k}}$ are the T-cyclic bases of $β$ that are contained in $K_{ϕ}$ ; and d is the degree of $ϕ (t)$ . For each j, let ${(ϕ (t))}^{p_{j}}$ be the annihilator of $v_{j}$ . This polynomial has degree $d p_{j}$ ; therefore, by Exercise 15 of Section 7.3, $β_{v_{j}}$ contains $d_{p_{j}}$ vectors. Furthermore, $p_{1} \geq p_{2} \geq \dots \geq p_{k}$ since the T-cyclic bases are arranged in decreasing order of size. We define the dot diagram of $ϕ (t)$ to be the array consisting of k columns of dots with $p_{j}$ dots in the jth column, arranged so that the jth column begins at the top and terminates after $p_{j}$ dots. For example, if $k = 3, p_{1} = 4, p_{2} = 2$ , and $p_{3} = 2$ , then the dot diagram is

A dot diagram with 4 rows and 3 columns.

7.4-13 Full Alternative Text

Although each column of a dot diagram corresponds to a T-cyclic basis $β_{v_{i}}$ in $K_{ϕ}$ , there are fewer dots in the column than there are vectors in the basis.

Example 2

Recall the linear operator T of Example 1 with the rational canonical basis $β$ and the rational canonical form $C = {[T]}_{β}$ . Since there are two irreducible monic divisors of the characteristic polynomial of T, $ϕ_{1} (t) = t^{2} - t + 3$ and $ϕ_{2} (t) = t^{2} + 1$ , there are two dot diagrams to consider. Because $ϕ_{1} (t)$ is the T-annihilator of $v_{1}$ and $β_{v_{1}}$ is a basis for $K_{ϕ_{1}}$ , the dot diagram for $ϕ_{1} (t)$ consists of a single dot. The other two T-cyclic bases, $β_{v_{3}}$ and $β_{v_{7}}$ , lie in $K_{ϕ_{2}}$ . Since $v_{3}$ has T-annihilator ${(ϕ_{2} (t))}^{2}$ and $v_{7}$ has T-annihilator $ϕ_{2} (t)$ , in the dot diagram of $ϕ_{2} (t)$ we have $p_{1} = 2$ and $p_{2} = 1$ . These diagrams are as follows:

A diagram of a dot features 1 dot at the top. Text Below reads, Dot diagram for phi sub t.

A dot diagram contains 2 columns and 2 rows.

7.4-15 Full Alternative Text

In practice, we obtain the rational canonical form of a linear operator from the information provided by dot diagrams. This is illustrated in the next example.

Example 3

Let T be a linear operator on a finite-dimensional vector space over R, and suppose that the irreducible monic divisors of the characteristic polynomial of T are

$ϕ_{1} (t) = t - 1, ϕ_{2} (t) = t^{2} + 2, and ϕ_{3} (t) = t^{2} + t + 1.$

Suppose, furthermore, that the dot diagrams associated with these divisors are as follows:

A dot diagram contains 2 columns and 2 rows. The top row on the left is a dot. To the right of the dot is a dot. The second row from the top left is a dot. Below the diagram is the label Diagram for psi sub 1 t.

A dot diagram consists of 1 row of two dots. Text Below reads, Diagram for psi sub 2 t.

A dot diagram contains 1 dot at the top in the middle. Below the diagram is the label Diagram for psi sub 3 t.

Since the dot diagram for $ϕ_{1} (t)$ has two columns, it contributes two companion matrices to the rational canonical form. The first column has two dots, and therefore corresponds to the $2 \times 2$ companion matrix of ${(ϕ_{1} (t))}^{2} = {(t - 1)}^{2}$ . The second column, with only one dot, corresponds to the $1 \times 1$ companion matrix of $ϕ_{1} (t) = t - 1$ . These two companion matrices are given by

$C_{1} = (\begin{array}{r} 0 & - 1 \\ 1 & 2 \end{array}) and C_{2} = (1) .$

The dot diagram for $ϕ_{2} (t) = t^{2} + 2$ consists of two columns. each containing a single dot; hence this diagram contributes two copies of the $2 \times 2$ companion matrix for $ϕ_{2} (t)$ , namely,

$C_{3} = C_{4} = (\begin{array}{r} 0 & - 2 \\ 1 & 0 \end{array}) .$

The dot diagram for $ϕ_{3} (t) = t^{2} + t + 1$ consists of a single column with a single dot contributing the single $2 \times 2$ companion matrix

$C_{5} = (\begin{matrix} 0 & - 1 \\ 1 & - 1 \end{matrix}) .$

Therefore the rational canonical form of T is the $9 \times 9$ matrix

A rational canonical form of T is a 9 by 9 matrix.

7.4-19 Full Alternative Text

We return to the general problem of finding dot diagrams. As we did before, we fix a linear operator T on a finite-dimensional vector space and an irreducible monic divisor $ϕ (t)$ of the characteristic polynomial of T. Let U denote the restriction of the linear operator $ϕ (T)$ to $K_{ϕ}$ . By Theorem 7.18(d), $U^{q} = T_{0}$ for some positive integer q. Consequently, by Exercise 12 of Section 7.2, the characteristic polynomial of U is ${(- 1)}^{m} t^{m}$ , where $m = dim (K_{ϕ})$ . Therefore $K_{ϕ}$ is the generalized eigenspace of U corresponding to $λ = 0$ , and U has a Jordan canonical form. The dot diagram associated with the Jordan canonical form of U gives us a key to understanding the dot diagram of T that is associated with $ϕ (t)$ . We now relate the two diagrams.

Let $β$ be a rational canonical basis for T, and $β_{v_{1}}, β_{v_{2}}, \dots, β_{v_{k}}$ be the T-cyclic bases of $β$ that are contained in $K_{ϕ}$ . Consider one of these T-cyclic bases $β_{v_{j}}$ , and suppose again that the T-annihilator of $v_{j}$ is ${(ϕ (t))}^{p_{j}}$ . Then $β_{v_{j}}$ consists of $d p_{j}$ vectors in $β$ . For $0 \leq i < d$ , let $γ_{i}$ be the cycle of generalized eigenvectors of U corresponding to $λ = 0$ with end vector $T^{i} (v_{j})$ , where $T^{0} (v_{j}) = b_{j}$ . Then

$γ_{i} = {{(ϕ (T))}^{p_{j} - 1} T^{i} (v_{j}), {(ϕ (T))}^{p_{j} - 2} T^{i} (v_{j}), \dots, (ϕ (T)) T^{i} (v_{j}), T^{i} (v_{j})} .$

By Theorem 7.1 (p. 478), $γ_{i}$ is a linearly independent subset of $C_{v_{i}}$ . Now let

$α_{j} = γ_{0} \cup γ_{1} \cup \dots \cup γ_{d - 1} .$

Notice that $α_{j}$ contains $d p_{j}$ vectors.

Lemma 1.

$α_{j}$ is an ordered basis for $C_{v_{j}}$ .

Proof. The key to this proof is Theorem 7.4 (p. 480). Since $α_{j}$ is the union of cycles of generalized eigenvectors of U corresponding to $λ = 0$ , it suffices to show that the set of initial vectors of these cycles

${{(ϕ (T))}^{p_{j} - 1} (v_{j}), {(ϕ (T))}^{p_{j} - 1} T (v_{j}), \dots, {(ϕ (T))}^{p_{j} - 1} T^{d - 1} (v_{j})}$

is linearly independent. Consider any linear combination of these vectors

$a_{0} {(ϕ (T))}^{p_{j} - 1} (v_{j}) + a_{1} {(ϕ (T))}^{p_{j} - 1} T (v_{j}) + \dots + a_{d - 1} {(ϕ (T))}^{p_{j} - 1} T^{d - 1} (v_{j}),$

where not all of the coefficients are zero. Let g(t) be the polynomial defined by $g (t) = a_{0} + a_{1} t + \dots + a_{d - 1} t^{d - 1}$ . Then g(t) is a nonzero polynomial of degree less than d, and hence ${(ϕ (t))}^{p_{j} - 1} g (t)$ is a nonzero polynomial with degree less than $d p_{j}$ . Since ${(ϕ (t))}^{p_{j}}$ is the T-annihilator of $v_{j}$ , it follows that ${(ϕ (T))}^{p_{j} - 1} g (T) (v_{j}) \neq 0$ . Therefore the set of initial vectors is linearly independent. So by Theorem 7.4, $α_{j}$ is linearly independent, and the $γ_{i}$ ’s are disjoint. Consequently, $α_{j}$ consists of $d p_{j}$ linearly independent vectors in $C_{v_{j}}$ , which has dimension $d p_{j}$ . We conclude that $α_{j}$ is a basis for $C_{v_{j}}$ .

Thus we may replace $β_{v_{j}}$ by $α_{j}$ as a basis for $C_{v_{j}}$ . We do this for each j to obtain a subset $α = α_{1} \cup α_{2} \cup \dots \cup α_{k}$ of $K_{ϕ}$ .

Lemma 2.

$α$ is a Jordan canonical basis for $K_{ϕ}$ .

Proof. Since $β_{v_{1}} \cup β_{v_{2}} \cup \dots \cup β_{v_{k}}$ is a basis for $K_{ϕ}$ , and since $span (α_{i}) = span (β_{v_{i}}) = C_{v_{i}}$ , Exercise 9 implies that $α$ is a basis for $K_{ϕ}$ . Because $α$ is a union of cycles of generalized eigenvectors of U, we conclude that $α$ is a Jordan canonical basis.

We are now in a position to relate the dot diagram of T corresponding to $ϕ (t)$ to the dot diagram of U, bearing in mind that in the first case we are considering a rational canonical form and in the second case we are considering a Jordan canonical form. For convenience, we designate the first diagram $D_{1}$ , and the second diagram $D_{2}$ . For each j, the presence of the T-cyclic basis $β_{x_{j}}$ results in a column of $p_{j}$ dots in $D_{1}$ . By Lemma 1, this basis is replaced by the union $α_{j}$ of d cycles of generalized eigenvectors of U, each of length $p_{j}$ , which becomes part of the Jordan canonical basis for U. In effect, $α_{j}$ determines d columns each containing $p_{j}$ dots in $D_{2}$ . So each column in $D_{1}$ determines d columns in $D_{2}$ of the same length, and all columns in $D_{2}$ are obtained in this way. Alternatively, each row in $D_{2}$ has d times as many dots as the corresponding row in $D_{1}$ . Since Theorem 7.10 (p. 493) gives us the number of dots in any row of $D_{2}$ , we may divide the appropriate expression in this theorem by d to obtain the number of dots in the corresponding row of $D_{1}$ . Thus we have the following result.

Theorem 7.24.

Let T be a linear operator on a finite-dimensional vector space V, let $ϕ (t)$ be an irreducible monic divisor of the characteristic polynomial of T of degree d, and let $r_{i}$ denote the number of dots in the ith row of the dot diagram for $ϕ (t)$ with respect to a rational canonical basis for T. Then

(a) $r_{1} = \frac{1}{d} [dim (V) - rank (ϕ (T))]$ ;
(b) $r_{i} = \frac{1}{d} [rank ({(ϕ (T))}^{i - 1}) - rank ({(ϕ (T))}^{i})]$ for $i > 1$ .

Thus the dot diagrams associated with a rational canonical form of an operator are completely determined by the operator. Since the rational canonical form is completely determined by its dot diagrams, we have the following uniqueness condition.

Corollary.

Under the conventions described earlier, the rational canonical form of a linear operator is unique up to the arrangement of the irreducible monic divisors of the characteristic polynomial.

Since the rational canonical form of a linear operator is unique, the polynomials corresponding to the companion matrices that determine this form are also unique. These polynomials, which are powers of the irreducible monic divisors, are called the elementary divisors of the linear operator. Since a companion matrix may occur more than once in a rational canonical form, the same is true for the elementary divisors. We call the number of such occurrences the multiplicity of the elementary divisor.

Conversely, the elementary divisors and their multiplicities determine the companion matrices and, therefore, the rational canonical form of a linear operator.

Example 4

Let

$β = {e^{x} \cos 2 x, e^{x} \sin 2 x, x e^{x} \cos 2 x, x e^{x} \sin 2 x}$

be viewed as a subset of $F (R, R)$ , the space of all real-valued functions defined on R, and let $V = span (β)$ . Then V is a four-dimensional subspace of $F (R, R)$ , and $β$ is an ordered basis for V. Let D be the linear operator on V defined by $D (y) = y^{'}$ , the derivative of y, and let $A = {[D]}_{β}$ . Then

$A = (\begin{array}{r} 1 & 2 & 1 & 0 \\ - 2 & 1 & 0 & 1 \\ 0 & 0 & 1 & 2 \\ 0 & 0 & - 2 & 1 \end{array}),$

and the characteristic polynomial of D, and hence of A, is

$f (t) = {(t^{2} - 2 t + 5)}^{2} .$

Thus $ϕ (t) = t^{2} - 2 t + 5$ is the only irreducible monic divisor of f(t). Since $ϕ (t)$ has degree 2 and V is four-dimensional, the dot diagram for $ϕ (t)$ contains only two dots. Therefore the dot diagram is determined by $r_{1}$ , the number of dots in the first row. Because ranks are preserved under matrix representations, we can use A in place of D in the formula given in Theorem 7.24. Now

$ϕ (A) = (\begin{array}{r} 0 & 0 & 0 & 4 \\ 0 & 0 & - 4 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{array}),$

and so

$r_{1} = \frac{1}{2} [4 - rank (ϕ (A))] = \frac{1}{2} [4 - 2] = 1.$

It follows that the second dot lies in the second row, and the dot diagram is as follows:

Hence V is a D-cyclic space generated by a single function with D-annihilator ${(ϕ (t))}^{2}$ . Furthermore, its rational canonical form is given by the companion matrix of ${(ϕ (t))}^{2} = t^{4} - 4 t^{3} + 14 t^{2} - 20 t + 25$ , which is

$(\begin{array}{r} 0 & 0 & 0 & - 25 \\ 1 & 0 & 0 & 20 \\ 0 & 1 & 0 & - 14 \\ 0 & 0 & 1 & 4 \end{array}) .$

Thus ${(ϕ (t))}^{2}$ is the only elementary divisor of D, and it has multiplicity 1. For the cyclic generator, it suffices to find a function g in V for which $ϕ (D) (g) \neq 0$ . Since $ϕ (A) (e_{3}) \neq 0$ , it follows that $ϕ (D) (x e^{x} \cos 2 x) \neq 0$ ; therefore $g (x) = x e^{x} \cos 2 x$ can be chosen as the cyclic generator. Hence

$β_{g} = {x e^{x} \cos 2 x, D (x e^{x} \cos 2 x), D^{2} (x e^{x} \cos 2 x), D^{3} (x e^{x} \cos 2 x)}$

is a rational canonical basis for D. Notice that the function h defined by $h (x) = x e^{x} \sin 2 x$ can be chosen in place of g. This shows that the rational canonical basis is not unique.

It is convenient to refer to the rational canonical form and elementary divisors of a matrix, which are defined in the obvious way.

Definitions.

Let $A \in M_{n \times n} (F)$ . The rational canonical form of A is defined to be the rational canonical form of $L_{A}$ . Likewise, for A, the elementary divisors and their multiplicities are the same as those of $L_{A}$ .

Let A be an $n \times n$ matrix, let C be a rational canonical form of A, and let $β$ be the appropriate rational canonical basis for $L_{A}$ . Then $C = {[L_{A}]}_{β}$ , and therefore A is similar to C. In fact, if Q is the matrix whose columns are the vectors of $β$ in the same order, then $Q^{- 1} A Q = C$ .

Example 5

For the following real matrix A, we find the rational canonical form C of A and a matrix Q such that $Q^{- 1} A Q = C$ .

$A = (\begin{array}{r} 0 & 2 & 0 & - 6 & 2 \\ 1 & - 2 & 0 & 0 & 2 \\ 1 & 0 & 1 & - 3 & 2 \\ 1 & - 2 & 1 & - 1 & 2 \\ 1 & - 4 & 3 & - 3 & 4 \end{array})$

The characteristic polynomial of A is $f (t) = - {(t^{2} + 2)}^{2} (t - 2)$ ; therefore $ϕ_{1} (t) = t^{2} + 2$ and $ϕ_{2} (t) = t - 2$ are the distinct irreducible monic divisors of f(t). By Theorem 7.23, $dim (K_{ϕ_{1}}) = 4$ and $dim (K_{ϕ_{2}}) = 1$ . Since the degree of $ϕ_{1} (t)$ is 2, the total number of dots in the dot diagram of $ϕ_{1} (t)$ is $4 / 2 = 2$ , and the number of dots $r_{1}$ in the first row is given by

$\begin{array}{rcl} r_{1} & = & \frac{1}{2} [dim (R^{5}) - rank (ϕ_{1} (A))] \\ = & \frac{1}{2} [5 - rank (A^{2} + 2 I)] \\ = & \frac{1}{2} [5 - 1] = 2. \end{array}$

Thus the dot diagram of $ϕ_{1} (t)$ is

and each column contributes the companion matrix

$(\begin{array}{r} 0 & - 2 \\ 1 & 0 \end{array})$

for $ϕ_{1} (t) = t^{2} + 2$ to the rational canonical form C. Consequently $ϕ_{1} (t)$ is an elementary divisor with multiplicity 2. Since $dim (K_{ϕ_{2}}) = 1$ , the dot diagram of $ϕ_{2} (t) = t - 2$ consists of a single dot, which contributes the $1 \times 1$ matrix (2). Hence $ϕ_{2} (t)$ is an elementary divisor with multiplicity 1. Therefore the rational canonical form C is

A diagram of the rational canonical form of C is a 5 X 5 matrix.

7.4-22 Full Alternative Text

We can infer from the dot diagram of $ϕ_{1} (t)$ that if $β$ is a rational canonical basis for $L_{A}$ , then $β \cap K_{ϕ_{1}}$ is the union of two cyclic bases $β_{v_{1}}$ and $β_{v_{2}}$ , where $v_{1}$ and $v_{2}$ each have annihilator $ϕ_{1} (t)$ . It follows that both $v_{1}$ and $v_{2}$ lie in $N (ϕ_{1} (L_{A}))$ . It can be shown that

${(\begin{array}{r} 1 \\ 0 \\ 0 \\ 0 \\ 0 \end{array}), (\begin{array}{r} 0 \\ 1 \\ 0 \\ 0 \\ 0 \end{array}), (\begin{array}{r} 0 \\ 0 \\ 2 \\ 1 \\ 0 \end{array}), (\begin{array}{r} 0 \\ 0 \\ - 1 \\ 0 \\ 1 \end{array})}$

is a basis for $N (ϕ_{1} (L_{A}))$ . Setting $v_{1} = e_{1}$ , we see that

$A v_{1} = (\begin{array}{r} 0 \\ 1 \\ 1 \\ 1 \\ 1 \end{array}) .$

Next choose $v_{2}$ in $K_{ϕ_{1}} = N (ϕ (L_{A}))$ , but not in the span of $β_{v_{1}} = {v_{1}, A v_{1}}$ . For example, $v_{2} = e_{2}$ . Then it can be seen that

$A v_{2} = (\begin{array}{r} 2 \\ - 2 \\ 0 \\ - 2 \\ - 4 \end{array}),$

and $β_{v_{1}} \cup β_{v_{2}}$ is a basis for $K_{ϕ_{1}}$ .

Since the dot diagram of $ϕ_{2} (t) = t - 2$ consists of a single dot, any nonzero vector in $K_{ϕ_{2}}$ is an eigenvector of A corresponding to the eigenvalue $λ = 2$ . For example, choose

$v_{3} = (\begin{array}{r} 0 \\ 1 \\ 1 \\ 1 \\ 2 \end{array}) .$

By Theorem 7.23, $β = {v_{1}, A v_{1}, v_{2}, A v_{2}, v_{3}}$ is a rational canonical basis for $L_{A}$ . So setting

$Q = (\begin{array}{r} 1 & 0 & 0 & 2 & 0 \\ 0 & 1 & 1 & - 2 & 1 \\ 0 & 1 & 0 & 0 & 1 \\ 0 & 1 & 0 & - 2 & 1 \\ 0 & 1 & 0 & - 4 & 2 \end{array}),$

we have $Q^{- 1} A Q = C$ .

Example 6

For the following matrix A, we find the rational canonical form C and a matrix Q such that $Q^{- 1} A Q = C$ .

$A = (\begin{array}{r} 2 & 1 & 0 & 0 \\ 0 & 2 & 1 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 2 \end{array})$

Since the characteristic polynomial of A is $f (t) = {(t - 2)}^{4}$ , the only irreducible monic divisor of f(t) is $ϕ (t) = t - 2$ , and so $K_{ϕ} = R^{4}$ . In this case, $ϕ (t)$ has degree 1; hence in applying Theorem 7.24 to compute the dot diagram for $ϕ (t)$ , we obtain

$\begin{array}{rcl} r_{1} & = & 4 - rank (ϕ (A)) = 4 - 2 = 2, \\ r_{2} & = & rank (ϕ (A)) - rank ({(ϕ (A))}^{2}) = 2 - 1 = 1, \end{array}$

and

$r_{3} = rank ({(ϕ (A))}^{2}) - rank ({(ϕ (A))}^{3}) = 1 - 0 = 1,$

where $r_{i}$ is the number of dots in the ith row of the dot diagram. Since there are $dim (R^{4}) = 4$ dots in the diagram, we may terminate these computations with $r_{3}$ . Thus the dot diagram for A is

Since ${(t - 2)}^{3}$ has the companion matrix

$(\begin{array}{r} 0 & 0 & 8 \\ 1 & 0 & - 12 \\ 0 & 1 & 6 \end{array})$

and $(t - 2)$ has the companion matrix (2), the rational canonical form of A is given by

$C = (\begin{array}{r} 0 & 0 & 8 & 0 \\ 1 & 0 & - 12 & 0 \\ 0 & 1 & 6 & 0 \\ 0 & 0 & 0 & 2 \end{array}) .$

Next we find a rational canonical basis for $L_{A}$ . The preceding dot diagram indicates that there are two vectors $v_{1}$ and $v_{2}$ in $R^{4}$ with annihilators ${(ϕ (t))}^{3}$ and $ϕ (t)$ , respectively, and such that

$β = β_{v_{1}} \cup β_{v_{2}} = {v_{1}, A v_{1}, A^{2} v_{1}, v_{2}}$

is a rational canonical basis for $L_{A}$ . Furthermore, $v_{1} \notin N ({(L_{A} - 2 I)}^{2})$ , and $v_{2} \in N (L_{A} - 2 I)$ . It can easily be shown that

$N (L_{A} - 2 I) = span ({e_{1}, e_{4}})$

and

$N ({(L_{A} - 2 I)}^{2}) = span ({e_{1}, e_{2}, e_{4}}) .$

The standard vector $e_{3}$ meets the criteria for $v_{1}$ ; so we set $v_{1} = e_{3}$ . It follows that

$A v_{1} = (\begin{matrix} 0 \\ 1 \\ 2 \\ 0 \end{matrix}) and A^{2} v_{1} = (\begin{matrix} 1 \\ 4 \\ 4 \\ 0 \end{matrix}) .$

Next we choose a vector $v_{2} \in N (L_{A} - 2 I)$ that is not in the span of $β_{v_{1}}$ . Clearly, $v_{2} = e_{4}$ satisfies this condition. Thus

${(\begin{array}{r} 0 \\ 0 \\ 1 \\ 0 \end{array}), (\begin{array}{r} 0 \\ 1 \\ 2 \\ 0 \end{array}), (\begin{array}{r} 1 \\ 4 \\ 4 \\ 0 \end{array}), (\begin{array}{r} 0 \\ 0 \\ 0 \\ 1 \end{array})}$

is a rational canonical basis for $L_{A}$ .

Finally, let Q be the matrix whose columns are the vectors of $β$ in the same order:

$Q = (\begin{array}{r} 0 & 0 & 1 & 0 \\ 0 & 1 & 4 & 0 \\ 1 & 2 & 4 & 0 \\ 0 & 0 & 0 & 1 \end{array}) .$

Then $C = Q^{- 1} A Q$ .

Direct Sums*

The next theorem is a simple consequence of Theorem 7.23.

Theorem 7.25. (Primary Decomposition Theorem)

Let T be a linear operator on an n-dimensional vector space V with characteristic polynomial

$f (t) = {(- 1)}^{n} {(ϕ_{1} (t))}^{n_{1}} {(ϕ_{2} (t))}^{n_{2}} \dots {(ϕ_{k} (t))}^{n_{k}},$

where the $ϕ_{i} (t)' s (1 \leq i \leq k)$ are distinct irreducible monic polynomials and the $n_{i}$ ’s are positive integers. Then the following statements are true.

$V = K_{ϕ_{1}} \oplus K_{ϕ_{2}} \oplus \dots \oplus K_{ϕ_{k}}$ .
If $T_{i} (1 \leq i \leq k)$ is the restriction of T to $K_{ϕ_{i}}$ and $C_{i}$ is the rational canonical form of $T_{i}$ , then $C_{1} \oplus C_{2} \oplus \dots \oplus C_{k}$ is the rational canonical form of T.

Proof.

Exercise.

The next theorem is a simple consequence of Theorem 7.17.

Theorem 7.26.

Let T be a linear operator on a finite-dimensional vector space V. Then V is a direct sum of T-cyclic subspaces $C_{v_{i}}$ , where each $v_{i}$ lies in $K_{ϕ}$ for some irreducible monic divisor $ϕ (t)$ of the characteristic polynomial of T.

Proof.

Exercise.

Exercises

Label the following statements as true or false.
1. (a) Every rational canonical basis for a linear operator T is the union of T-cyclic bases.
2. (b) If a basis is the union of T-cyclic bases for a linear operator T, then it is a rational canonical basis for T.
3. (c) There exist square matrices having no rational canonical form.
4. (d) A square matrix is similar to its rational canonical form.
5. (e) For any linear operator T on a finite-dimensional vector space, any irreducible factor of the characteristic polynomial of T divides the minimal polynomial of T.
6. (f) Let $ϕ (t)$ be an irreducible monic divisor of the characteristic polynomial of a linear operator T. The dots in the diagram used to compute the rational canonical form of the restriction of T to $K_{ϕ}$ are in one-to-one correspondence with the vectors in a basis for $K_{ϕ}$ .
7. (g) If a matrix has a Jordan canonical form, then its Jordan canonical form and rational canonical form are similar.
For each of the following matrices $A \in M_{n \times n} (F)$ , find the rational canonical form C of A and a matrix $Q \in M_{n \times n} (F)$ such that $Q^{- 1} A Q = C$ .
1. (a) $A = (\begin{array}{r} 3 & 1 & 0 \\ 0 & 3 & 1 \\ 0 & 0 & 3 \end{array}) F = R$
2. (b) $A = (\begin{array}{r} 0 & - 1 \\ 1 & - 1 \end{array}) F = R$
3. (c) $A = (\begin{array}{r} 0 & - 1 \\ 1 & - 1 \end{array}) F = C$
4. (d) $A = (\begin{array}{r} 0 & - 7 & 14 & - 6 \\ 1 & - 4 & 6 & - 3 \\ 0 & - 4 & 9 & - 4 \\ 0 & - 4 & 11 & - 5 \end{array}) F = R$
5. (e) $A = (\begin{array}{r} 0 & - 4 & 12 & - 7 \\ 1 & - 1 & 3 & - 3 \\ 0 & - 1 & 6 & - 4 \\ 0 & - 1 & 8 & - 5 \end{array}) F = R$
For each of the following linear operators T, find the elementary divisors, the rational canonical form C, and a rational canonical basis $β$ .
1. (a) T is the linear operator on $P_{3} (R)$ defined by
  
  $T (f (x)) = f (0) x - f^{'} (1) .$
2. (b) Let $S = {\sin x, \cos x, x \sin x, x \cos x}$ , a subset of $F (R, R)$ , and let $V = span (S)$ . Define T to be the linear operator on V such that
  
  $T (f) = f^{'} .$
3. (c) T is the linear operator on $M_{2 \times 2} (R)$ defined by
  
  $T (A) = (\begin{array}{r} 0 & 1 \\ - 1 & 1 \end{array}) \cdot A .$
4. (d) Let $S = {\sin x \sin y, \sin x \cos y, \cos x \sin y, \cos x \cos y}$ , a subset of $F (R \times R, R)$ , and let $V = span (S)$ . Define T to be the linear operator on V such that
  
  $T (f) (x, y) = \frac{\partial f (x, y)}{\partial x} + \frac{\partial f (x, y)}{\partial y} .$
Let T be a linear operator on a finite-dimensional vector space V with minimal polynomial ${(ϕ (t))}^{m}$ for some positive integer m.
1. (a) Prove that $R (ϕ (T)) \subseteq N ({(ϕ (T))}^{m - 1})$ .
2. (b) Give an example to show that the subspaces in (a) need not be equal.
3. (c) Prove that the minimal polynomial of the restriction of T to $R (ϕ (T))$ equals ${(ϕ (t))}^{m - 1}$ .
Let T be a linear operator on a finite-dimensional vector space. Prove that the rational canonical form of T is a diagonal matrix if and only if T is diagonalizable. Visit goo.gl/tK8pru for a solution.
Let T be a linear operator on a finite-dimensional vector space V with characteristic polynomial $f (t) = {(- 1)}^{n} ϕ_{1} (t) ϕ_{2} (t)$ , where $ϕ_{1} (t)$ and $ϕ_{2} (t)$ are distinct irreducible monic polynomials and $n = dim (V)$ .
1. (a) Prove that there exist $v_{1}, v_{2} \in V$ such that $v_{1}$ has T-annihilator $ϕ_{1} (t), v_{2}$ has T-annihilator $ϕ_{2} (t)$ , and $β_{v_{1}} \cup β_{v_{2}}$ is a basis for V.
2. (b) Prove that there is a vector $v_{3} \in V$ with T-annihilator $ϕ_{1} (t) ϕ_{2} (t)$ such that $β_{v_{3}}$ is a basis for V.
3. (c) Describe the difference between the matrix representation of T with respect to $β_{v_{1}} \cup β_{v_{2}}$ and the matrix representation of T with respect to $β_{v_{3}}$ .
Thus, to assure the uniqueness of the rational canonical form, we require that the generators of the T-cyclic bases that constitute a rational canonical basis have T-annihilators equal to powers of irreducible monic factors of the characteristic polynomial of T.
Let T be a linear operator on a finite-dimensional vector space with minimal polynomial

$f (t) = {(ϕ_{1} (t))}^{m_{1}} {(ϕ_{2} (t))}^{m_{2}} \dots {(ϕ_{k} (t))}^{m_{k}},$

where the $ϕ_{i} (t)$ ’s are distinct irreducible monic factors of f(t). Prove that for each i, $m_{i}$ is the number of entries in the first column of the dot diagram for $ϕ_{i} (t)$ .
Let T be a linear operator on a finite-dimensional vector space V. Prove that for any irreducible polynomial $ϕ (t)$ , if $ϕ (T)$ is not one-to-one, then $ϕ (t)$ divides the characteristic polynomial of T. Hint: Apply Exercise 15 of Section 7.3.
Let V be a vector space and $β_{1}, β_{2}, \dots, β_{k}$ be disjoint subsets of V whose union is a basis for V. Now suppose that $γ_{1}, γ_{2}, \dots, γ_{k}$ are linearly independent subsets of V such that $span (γ_{i}) = span (β_{i})$ for all i. Prove that $γ_{1} \cup γ_{2} \cup \dots \cup γ_{k}$ is also a basis for V.
Let T be a linear operator on a finite-dimensional vector space, and suppose that $ϕ (t)$ is an irreducible monic factor of the characteristic polynomial of T. Prove that if $ϕ (t)$ is the T-annihilator of vectors x and y, then $x \in C_{y}$ if and only if $C_{x} = C_{y}$ .

Exercises 11 and 12 are concerned with direct sums.

Prove Theorem 7.25.
Prove Theorem 7.26.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 7.4* The Rational Canonical Form

Create new playlist

Sign In

Sign Up

Table of Contents for
7.4* The Rational Canonical Form