Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

5.4 Invariant Subspaces and the Cayley-Hamilton Theorem

In Section 5.1, we observed that if v is an eigenvector of a linear operator T, then T maps the span of {v} ${v}$ into itself. Subspaces that are mapped into themselves are of great importance in the study of linear operators (see, e.g., Exercises 29-33 of Section 2.1).

Definition.

Let T be a linear operator on a vector space V. A subspace W of V is called a T-invariant subspace of V if T(W)⊆W $T (W) \subseteq W$ , that is, if T(v)∈W $T (v) \in W$ for all v∈W $v \in W$ .

Example 1

Suppose that T is a linear operator on a vector space V. Then the following subspaces of V are T-invariant:

(1) {0} ${0}$
(2) V
(3) R(T)
(4) N(T)
(5) Eλ $E_{λ}$ , for any eigenvalue λ $λ$ of T.

The proofs that these subspaces are T-invariant are left as exercises. (see Exercise 3.)

Example 2

Let T be the linear operator on R3 $R^{3}$ defined by

T (a, b, c) = (a + b, b + c, 0) .

$T (a, b, c) = (a + b, b + c, 0) .$

Then the xy−plane={(x, y, 0): x, y∈R} $x y - plane = {(x, y, 0) : x, y \in R}$ and the x−axis={(x, 0, 0): x∈R} $x - axis = {(x, 0, 0) : x \in R}$ are T-invariant subspaces of R3 $R^{3}$ .

Let T be a linear operator on a vector space V, and let x be a nonzero vector in V. The subspace

W = span ({x, T (x), T 2 (x), \dots})

$W = span ({x, T (x), T^{2} (x), \dots})$

is called the T-cyclic subspace of V generated by x. It is a simple matter to show that W is T-invariant. In fact, W is the “smallest” T-invariant sub- space of V containing x. That is, any T-invariant subspace of V containing x must also contain W (see Exercise 11). Cyclic subspaces have various uses. We apply them in this section to establish the Cayley-Hamilton theorem. In Exercise 31, we outline a method for using cyclic subspaces to compute the characteristic polynomial of a linear operator without resorting to determinants. Cyclic subspaces also play an important role in Chapter 7, where we study matrix representations of nondiagonalizable linear operators.

Example 3

Let T be the linear operator on R3 $R^{3}$ defined by

T (a, b, c) = (- b + c, a + c, 3 c) .

$T (a, b, c) = (- b + c, a + c, 3 c) .$

We determine the T-cyclic subspace generated by e1=(1, 0, 0) $e_{1} = (1, 0, 0)$ . Since

T (e 1) = T (1, 0, 0) = (0, 1, 0) = e 2

$T (e_{1}) = T (1, 0, 0) = (0, 1, 0) = e_{2}$

and

T 2 (e 1) = T (T (e 1)) = T (e 2) = (- 1, 0, 0) = - e 1,

$T^{2} (e_{1}) = T (T (e_{1})) = T (e_{2}) = (- 1, 0, 0) = - e_{1},$

it follows that

span ({e 1, T (e 1), T 2 (e 1), \dots}) = span ({e 1, e 2}) = {(s, t, 0) : s, t \in R} .

$span ({e_{1}, T (e_{1}), T^{2} (e_{1}), \dots}) = span ({e_{1}, e_{2}}) = {(s, t, 0) : s, t \in R} .$

Example 4

Let T be the linear operator on P(R) defined by T(f(x))=f′(x) $T (f (x)) = f^{'} (x)$ . Then the T-cyclic subspace generated by x2 $x^{2}$ is span({x2, 2x, 2})=P2(R) $span ({x^{2}, 2 x, 2}) = P_{2} (R)$ .

The existence of a T-invariant subspace provides the opportunity to define a new linear operator whose domain is this subspace. If T is a linear operator on V and W is a T-invariant subspace of V, then the restriction TW $T_{W}$ of T to W (see Appendix B) is a mapping from W to W, and it follows that TW $T_{W}$ is a linear operator on W (see Exercise 7). As a linear operator, TW $T_{W}$ inherits certain properties from its parent operator T. The following result illustrates one way in which the two operators are linked.

Theorem 5.20.

Let T be a linear operator on a finite-dimensional vector space V, and let W be a T-invariant subspace of V. Then the characteristic polynomial of T divides the characteristic polynomial of T.

Proof.

Choose an ordered basis γ={v1, v2, …, vk} $γ = {v_{1}, v_{2}, \dots, v_{k}}$ for W, and extend it to an ordered basis β={v1, v2, …, vk, vk+1, …, vn} $β = {v_{1}, v_{2}, \dots, v_{k}, v_{k + 1}, \dots, v_{n}}$ for V. Let A=[T]β $A = {[T]}_{β}$ and B1=[TW]γ $B_{1} = {[T_{W}]}_{γ}$ . Then, by Exercise 12, A can be written in the form

A = (B 1 O B 2 B 3) .

$A = (\begin{array}{r} B_{1} & B_{2} \\ O & B_{3} \end{array}) .$

Let f(t) be the characteristic polynomial of T and g(t) the characteristic polynomial of TW $T_{W}$ . Then

f (t) = det (A - t I n) = det (B 1 - t I k O B 2 B 3 - t I n - k) = g (t) \cdot det (B 3 - t I n - k)

$f (t) = det (A - t I_{n}) = det (\begin{array}{r} B_{1} - t I_{k} & B_{2} \\ O & B_{3} - t I_{n - k} \end{array}) = g (t) \cdot det (B_{3} - t I_{n - k})$

by Exercise 21 of Section 4.3. Thus g(t) divides f(t).

Example 5

Let T be the linear operator on R4 $R^{4}$ defined by

T (a, b, c, d) = (a + b + 2 c - d, b + d, 2 c - d, c + d),

$T (a, b, c, d) = (a + b + 2 c - d, b + d, 2 c - d, c + d),$

and let W={(t, s, 0, 0): t, s∈R} $W = {(t, s, 0, 0) : t, s \in R}$ . Observe that W is a T-invariant subspace of R4 $R^{4}$ because, for any vector (a, b, 0, 0)∈W $(a, b, 0, 0) \in W$ ,

T (a, b, 0, 0) = (a + b, b, 0, 0) \in W .

$T (a, b, 0, 0) = (a + b, b, 0, 0) \in W .$

Let γ={e1, e2} $γ = {e_{1}, e_{2}}$ , which is an ordered basis for W. Extend γ $γ$ to the standard ordered basis β $β$ for R4 $R^{4}$ . Then

B 1 = [T W] γ = (1011) and A = [T] β = ⎛ ⎝ ⎜ ⎜ ⎜ 100011002021 - 1 1 - 1 1 ⎞ ⎠ ⎟ ⎟ ⎟

$B_{1} = {[T_{W}]}_{γ} = (\begin{array}{r} 1 & 1 \\ 0 & 1 \end{array}) and A = {[T]}_{β} = (\begin{array}{r} 1 & 1 & 2 & - 1 \\ 0 & 1 & 0 & 1 \\ 0 & 0 & 2 & - 1 \\ 0 & 0 & 1 & 1 \end{array})$

in the notation of Theorem 5.20. Let f(t) be the characteristic polynomial of T and g(t) be the characteristic polynomial of TW $T_{W}$ . Then

f (t) = = = det (A - t I 4) = det ⎛ ⎝ ⎜ ⎜ ⎜ 1 - t 000 1 1 - t 00 20 2 - t 1 - 1 1 - 1 1 - t ⎞ ⎠ ⎟ ⎟ ⎟ det (1 - t 0 1 1 - t) \cdot det (2 - t 1 - 1 1 - t) g (t) \cdot det (2 - t 1 - 1 1 - t) .

$\begin{array}{rcl} f (t) & = & det (A - t I_{4}) = det (\begin{array}{c} 1 - t & 1 & 2 & - 1 \\ 0 & 1 - t & 0 & 1 \\ 0 & 0 & 2 - t & - 1 \\ 0 & 0 & 1 & 1 - t \end{array}) \\ = & det (\begin{array}{c} 1 - t & 1 \\ 0 & 1 - t \end{array}) \cdot det (\begin{array}{c} 2 - t & - 1 \\ 1 & 1 - t \end{array}) \\ = & g (t) \cdot det (\begin{array}{c} 2 - t & - 1 \\ 1 & 1 - t \end{array}) . \end{array}$

In view of Theorem 5.20, we may use the characteristic polynomial of TW $T_{W}$ to gain information about the characteristic polynomial of T itself. In this regard, cyclic subspaces are useful because the characteristic polynomial of the restriction of a linear operator T to a cyclic subspace is readily computable.

Theorem 5.21.

Let T be a linear operator on a finite-dimensional vector space V, and let W denote the T-cyclic subspace of V generated by a nonzero vector v∈V $v \in V$ . Let k=dim(W) $k = dim (W)$ . Then

(a) {v, T(v), T2(v), …, Tk−1(v)} ${v, T (v), T^{2} (v), \dots, T^{k - 1} (v)}$ is a basis for W.
(b) If a0v+a1T(v)+⋯+ak−1Tk−1(v)+Tk(v)=0 $a_{0} v + a_{1} T (v) + \dots + a_{k - 1} T^{k - 1} (v) + T^{k} (v) = 0$ , then the characteristic polynomial of TW $T_{W}$ is f(t)=(−1)k(a0+a1t+⋯+ak−1tk−1+tk) $f (t) = {(- 1)}^{k} (a_{0} + a_{1} t + \dots + a_{k - 1} t^{k - 1} + t^{k})$ .

Proof.

(a) Since v≠0 $v \neq 0$ , the set {v} ${v}$ is linearly independent. Let j be the largest positive integer for which

β = {v, T (v), \dots, T j - 1 (v)}

$β = {v, T (v), \dots, T^{j - 1} (v)}$

is linearly independent. Such a j must exist because V is finite-dimensional. Let Z=span(β) $Z = span (β)$ . Then β $β$ is a basis for Z. Furthermore, Tj(v)∈Z $T^{j} (v) \in Z$ by Theorem 1.7 (p. 40). We use this information to show that Z is a T-invariant subspace of V. Let w∈Z $w \in Z$ . Since w is a linear combination of the vectors of β $β$ , there exist scalars b0, b1, …, bj−1 $b_{0}, b_{1}, \dots, b_{j - 1}$ such that

w = b 0 v + b 1 T (v) + \dots + b j - 1 T j - 1 (v),

$w = b_{0} v + b_{1} T (v) + \dots + b_{j - 1} T^{j - 1} (v),$

and hence

T (w) = b 0 T (v) + b 1 T 2 (v) + \dots + b j - 1 T j (v) .

$T (w) = b_{0} T (v) + b_{1} T^{2} (v) + \dots + b_{j - 1} T^{j} (v) .$

Thus T(w) is a linear combination of vectors in Z, and hence belongs to Z. So Z is T-invariant. Furthermore, v∈Z $v \in Z$ . By Exercise 11, W is the smallest T-invariant subspace of V that contains v, so that W⊆Z $W \subseteq Z$ . Clearly, Z⊆W $Z \subseteq W$ , and so we conclude that Z=W $Z = W$ . It follows that β $β$ is a basis for W, and therefore dim(W)=j $dim (W) = j$ . Thus j=k $j = k$ . This proves (a).

(b) Now view β $β$ (from (a)) as an ordered basis for W. Let a0, a1, …, ak−1 $a_{0}, a_{1}, \dots, a_{k - 1}$ be the scalars such that

a 0 v + a 1 T (v) + \dots + a k - 1 T k - 1 (v) + T k (v) = 0.

$a_{0} v + a_{1} T (v) + \dots + a_{k - 1} T^{k - 1} (v) + T^{k} (v) = 0.$

Observe that

[T W] β = ⎛ ⎝ ⎜ ⎜ ⎜ ⎜ 01 ⋮ 0 00 ⋮ 0 \dots \dots \dots 00 ⋮ 1 - a 0 - a 1 ⋮ - a k - 1 ⎞ ⎠ ⎟ ⎟ ⎟ ⎟,

${[T_{W}]}_{β} = (\begin{array}{c} 0 & 0 & \dots & 0 & - a_{0} \\ 1 & 0 & \dots & 0 & - a_{1} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & - a_{k - 1} \end{array}),$

which has the characteristic polynomial

f (t) = (- 1) k (a 0 + a 1 t + \dots + a k - 1 t k - 1 + t k)

$f (t) = {(- 1)}^{k} (a_{0} + a_{1} t + \dots + a_{k - 1} t^{k - 1} + t^{k})$

by Exercise 19. Thus f(t) is the characteristic polynomial of TW $T_{W}$ , proving (b).

Example 6

Let T be the linear operator of Example 3, and let W=span({e1, e2}) $W = span ({e_{1}, e_{2}})$ , the T-cyclic subspace generated by e1 $e_{1}$ . We compute the characteristic polynomial f(t) of TW $T_{W}$ in two ways: by means of Theorem 5.21 and by means of determinants.

(a) By means of Theorem 5.21. From Example 3, we have that {e1, e2} ${e_{1}, e_{2}}$ is a cycle that generates W, and that T2(e1=−e1) $T^{2} (e_{1} = - e_{1})$ . Hence

1 e 1 + 0 T (e 1) + T 2 (e 1) = 0.

$1 e_{1} + 0 T (e_{1}) + T^{2} (e_{1}) = 0.$

Therefore, by Theorem 5.21(b),

f (t) = (- 1) 2 (1 + 0 t + t 2) = t 2 + 1.

$f (t) = {(- 1)}^{2} (1 + 0 t + t^{2}) = t^{2} + 1.$

(b) By means of determinants. Let β={e1, e2} $β = {e_{1}, e_{2}}$ , which is an ordered basis for W. Since T(e1)=e2 $T (e_{1}) = e_{2}$ and T(e2)=−e1 $T (e_{2}) = - e_{1}$ , we have

[T W] β = (01 - 1 0)

${[T_{W}]}_{β} = (\begin{array}{r} 0 & - 1 \\ 1 & 0 \end{array})$

and therefore,

f (t) = det (- t 1 - 1 - t) = t 2 + 1.

$f (t) = det (\begin{array}{r} - t & - 1 \\ 1 & - t \end{array}) = t^{2} + 1.$

The Cayley-Hamilton Theorem

As an illustration of the importance of Theorem 5.21, we prove a well- known result that is used in Chapter 7. The reader should refer to Appendix E for the definition of f(T), where T is a linear operator and f(x) is a polynomial.

Theorem 5.22. (Cayley-Hamilton)

Let T be a linear operator on a finite-dimensional vector space V, and let f(t) be the characteristic polynomial of T. Then f(T)=T0 $f (T) = T_{0}$ , the zero transformation. That is, T “satisfies” its characteristic equation.

Proof.

We show that f(T)(v)=0 $f (T) (v) = 0$ for all v∈V $v \in V$ . This is obvious if v=0 $v = 0$ because f $f$ (T) is linear; so suppose that v≠0 $v \neq 0$ . Let W be the T-cyclic subspace generated by v, and suppose that dim(W)=k $dim (W) = k$ . By Theorem 5.21(a), there exist scalars a0, a1, …, ak−1 $a_{0}, a_{1}, \dots, a_{k - 1}$ such that

a 0 v + a 1 T (v) + \dots + a k - 1 T k - 1 (v) + T k (v) = 0.

$a_{0} v + a_{1} T (v) + \dots + a_{k - 1} T^{k - 1} (v) + T^{k} (v) = 0.$

Hence Theorem 5.21(b) implies that

g (t) = (- 1) k (a 0 + a 1 t + \dots + a k - 1 t k - 1 + t k)

$g (t) = {(- 1)}^{k} (a_{0} + a_{1} t + \dots + a_{k - 1} t^{k - 1} + t^{k})$

is the characteristic polynomial of TW $T_{W}$ . Combining these two equations yields

g (T) (v) = (- 1) k (a 0 I + a 1 T + \dots + a k - 1 T k - 1 + T k) (v) = 0.

$g (T) (v) = {(- 1)}^{k} (a_{0} I + a_{1} T + \dots + a_{k - 1} T^{k - 1} + T^{k}) (v) = 0.$

By Theorem 5.20, g(t) divides f $f$ (t); hence there exists a polynomial q(t) such that f(t)=q(t)g(t) $f (t) = q (t) g (t)$ . So

f (T) (v) = q (T) g (T) (v) = q (T) (g (T) (v)) = q (T) (0) = 0.

$f (T) (v) = q (T) g (T) (v) = q (T) (g (T) (v)) = q (T) (0) = 0.$

Example 7

Let T be the linear operator on R2 $R^{2}$ defined by T(a, b)=(a+2b, −2a+b) $T (a, b) = (a + 2 b, - 2 a + b)$ , and let β={e1, e2} $β = {e_{1}, e_{2}}$ . Then

A = (1 - 2 21),

$A = (\begin{array}{r} 1 & 2 \\ - 2 & 1 \end{array}),$

where A=[T]β $A = {[T]}_{β}$ . The characteristic polynomial of T is, therefore,

f (t) = det (A - t I) = det (1 - t - 2 2 1 - t) = t 2 - 2 t + 5.

$f (t) = det (A - t I) = det (\begin{array}{r} 1 - t & 2 \\ - 2 & 1 - t \end{array}) = t^{2} - 2 t + 5.$

It is easily verified that T0=f(T)=T2−2T+5I $T_{0} = f (T) = T^{2} - 2 T + 5 I$ . Similarly,

f (A) = = A 2 - 2 A + 5 I = (- 3 - 4 4 - 3) + (- 2 4 - 4 - 2) + (5005) (0000) .

$\begin{array}{rcl} f (A) & = & A^{2} - 2 A + 5 I = (\begin{array}{r} - 3 & 4 \\ - 4 & - 3 \end{array}) + (\begin{array}{r} - 2 & - 4 \\ 4 & - 2 \end{array}) + (\begin{array}{r} 5 & 0 \\ 0 & 5 \end{array}) \\ = & (\begin{array}{r} 0 & 0 \\ 0 & 0 \end{array}) . \end{array}$

Example 7 suggests the following result.

Corollary (Cayley-Hamilton Theorem for Matrices).

Let A be an n×n $n \times n$ matrix, and let f(t) be the characteristic polynomial of A. Then f(A)=O $f (A) = O$ , the n×n $n \times n$ zero matrix.

Proof.

see Exercise 15.

Invariant Subspaces and Direct Sums³

It is useful to decompose a finite-dimensional vector space V into a direct sum of as many T-invariant subspaces as possible because the behavior of T on V can be inferred from its behavior on the direct summands. For example, T is diagonalizable if and only if V can be decomposed into a direct sum of one-dimensional T-invariant subspaces (see Exercise 35). In Chapter 7, we consider alternate ways of decomposing V into direct sums of T-invariant subspaces if T is not diagonalizable. We proceed to gather a few facts about direct sums of T-invariant subspaces that are used in Section 7.4. The first of these facts is about characteristic polynomials.

Theorem 5.23.

Let T be a linear operator on a finite-dimensional vector space V, and suppose that V=W1⊕W2⊕⋯⊕Wk $V = W_{1} \oplus W_{2} \oplus \dots \oplus W_{k}$ , where Wi $W_{i}$ is a T-invariant subspace of V for each i(1≤i≤k) $i (1 \leq i \leq k)$ . Suppose that fi(t) $f_{i} (t)$ is the characteristic polynomial of TWi(1≤i≤k) $T_{W_{i}} (1 \leq i \leq k)$ . Then f1(t)⋅f2(t)⋅⋯⋅fk(t) $f_{1} (t) \cdot f_{2} (t) \cdot \dots \cdot f_{k} (t)$ is the characteristic polynomial of T.

Proof.

The proof is by mathematical induction on k. In what follows,f(t) denotes the characteristic polynomial of T. Suppose first that k=2 $k = 2$ . Let β1 $β_{1}$ be an ordered basis for W1, β2 $W_{1}, β_{2}$ an ordered basis for W2 $W_{2}$ , and β=β1∪β2 $β = β_{1} \cup β_{2}$ . Then β $β$ is an ordered basis for V by Theorem 5.9(d) (p. 275). Let A=[T]β, B1=[TW1]β1 $A = {[T]}_{β}, B_{1} = {[T_{W_{1}}]}_{β_{1}}$ , and B2=[TW2]β2 $B_{2} = {[T_{W_{2}}]}_{β_{2}}$ . By Exercise 33, it follows that

A = (B 1 O' O B 2),

$A = (\begin{array}{r} B_{1} & O \\ O^{'} & B_{2} \end{array}),$

where O and O′ $O^{'}$ are zero matrices of the appropriate sizes. Then

f (t) = det (A - t I) = det (B 1 - t I) \cdot det (B 2 - t I) = f 1 (t) \cdot f 2 (t)

$f (t) = det (A - t I) = det (B_{1} - t I) \cdot det (B_{2} - t I) = f_{1} (t) \cdot f_{2} (t)$

as in the proof of Theorem 5.20, proving the result for k=2 $k = 2$ .

Now assume that the theorem is valid for k−1 $k - 1$ summands, where k−1≥2 $k - 1 \geq 2$ , and suppose that V is a direct sum of k subspaces, say,

V = W 1 \oplus W 2 \oplus \dots \oplus W k .

$V = W_{1} \oplus W_{2} \oplus \dots \oplus W_{k} .$

Let W=W1+W2+⋯+Wk−1 $W = W_{1} + W_{2} + \dots + W_{k - 1}$ . It is easily verified that W is T-invariant and that V=W⊕Wk $V = W \oplus W_{k}$ . So by the case for k=2, f(t)=g(t)⋅fk(t) $k = 2, f (t) = g (t) \cdot f_{k} (t)$ , where g(t) is the characteristic polynomial of TW $T_{W}$ . Clearly W=W1⊕W2⊕⋯⊕Wk−1 $W = W_{1} \oplus W_{2} \oplus \dots \oplus W_{k - 1}$ , and therefore g(t)=f1(t)⋅f2(t)⋅⋯⋅fk−1(t) $g (t) = f_{1} (t) \cdot f_{2} (t) \cdot \dots \cdot f_{k - 1} (t)$ by the induction hypothesis. We conclude that f(t)=g(t)⋅fk(t)=f1(t)⋅f2(t)⋅⋯⋅fk(t) $f (t) = g (t) \cdot f_{k} (t) = f_{1} (t) \cdot f_{2} (t) \cdot \dots \cdot f_{k} (t)$ .

As an illustration of this result, suppose that T is a diagonalizable linear operator on a finite-dimensional vector space V with distinct eigenvalues λ1, λ2, …, λk $λ_{1}, λ_{2}, \dots, λ_{k}$ . By Theorem 5.10 (p. 277), V is a direct sum of the eigenspaces of T. Since each eigenspace is T-invariant, we may view this situation in the context of Theorem 5.23. For each eigenvalue λi $λ_{i}$ , the restriction of T toEλi $E_{λ_{i}}$ has characteristic polynomial (λi−t)mi ${(λ_{i} - t)}^{m_{i}}$ , where mi $m_{i}$ is the dimension of Eλi $E_{λ_{i}}$ . By Theorem 5.23, the characteristic polynomialf(t) of T is the product

f (t) = (λ 1 - t) m 1 (λ 2 - t) m 2 \dots (λ k - t) m k .

$f (t) = {(λ_{1} - t)}^{m_{1}} {(λ_{2} - t)}^{m_{2}} \dots {(λ_{k} - t)}^{m_{k}} .$

It follows that the multiplicity of each eigenvalue is equal to the dimension of the corresponding eigenspace, as expected.

Example 8

Let T be the linear operator on R4 $R^{4}$ defined by

T (a, b, c, d) = (2 a - b, a + b, c - d, c + d),

$T (a, b, c, d) = (2 a - b, a + b, c - d, c + d),$

and let W1={(s, t, 0, 0): s, t∈R} $W_{1} = {(s, t, 0, 0) : s, t \in R}$ and W2={(0, 0, s, t): s, t∈R} $W_{2} = {(0, 0, s, t) : s, t \in R}$ . Notice that W1 $W_{1}$ and W2 $W_{2}$ are each T-invariant and that R4=W1⊕W2 $R^{4} = W_{1} \oplus W_{2}$ . Let β1={e1, e2}, β2={e3, e4} $β_{1} = {e_{1}, e_{2}}, β_{2} = {e_{3}, e_{4}}$ , and β=β1∪β2={e1, e2, e3, e4} $β = β_{1} \cup β_{2} = {e_{1}, e_{2}, e_{3}, e_{4}}$ . Then β1 $β_{1}$ is an ordered basis for W1, β2 $W_{1}, β_{2}$ is an ordered basis for W2 $W_{2}$ , and β $β$ is an ordered basis for R4 $R^{4}$ . Let A=[T]β, B1=[TW1]β1 $A = {[T]}_{β}, B_{1} = {[T_{W_{1}}]}_{β_{1}}$ and B2=[TW2]β2 $B_{2} = {[T_{W_{2}}]}_{β_{2}}$ . Then

B 1 = (21 - 1 1), B 2 = (11 - 1 1),

$B_{1} = (\begin{array}{r} 2 & - 1 \\ 1 & 1 \end{array}), B_{2} = (\begin{array}{r} 1 & - 1 \\ 1 & 1 \end{array}),$

and

A = (B 1 O O B 2) = ⎛ ⎝ ⎜ ⎜ ⎜ 2100 - 1 100 0011 00 - 1 1 ⎞ ⎠ ⎟ ⎟ ⎟ .

$A = (\begin{array}{r} B_{1} & O \\ O & B_{2} \end{array}) = (\begin{array}{r} 2 & - 1 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 0 & 0 & 1 & - 1 \\ 0 & 0 & 1 & 1 \end{array}) .$

Let f(t), f1(t) $f (t), f_{1} (t)$ , and f2(t) $f_{2} (t)$ denote the characteristic polynomials of T, TW1 $T_{W_{1}}$ , and TW2 $T_{W_{2}}$ , respectively. Then

f (t) = det (A - t I) = det (B 1 - t I) \cdot det (B 2 - t I) = f 1 (t) \cdot f 2 (t) .

$f (t) = det (A - t I) = det (B_{1} - t I) \cdot det (B_{2} - t I) = f_{1} (t) \cdot f_{2} (t) .$

The matrix A in Example 8 can be obtained by joining the matrices B1 $B_{1}$ and B2 $B_{2}$ in the manner explained in the next definition.

Definition.

Let B1∈Mm×m(F) $B_{1} \in M_{m \times m} (F)$ , and let B2∈Mn×n(F) $B_{2} \in M_{n \times n} (F)$ . We define the direct sum of B1 $B_{1}$ and B2 $B_{2}$ , denoted B1⊕B2 $B_{1} \oplus B_{2}$ , as the (m+n)×(m+n) $(m + n) \times (m + n)$ matrix A such that

A i j = ⎧ ⎩ ⎨ ⎪ ⎪ ⎪ ⎪ (B 1) i j (B 2) (i - m), (j - m) 0 for 1 \leq i, j \leq m for m + 1 \leq i, j \leq n + m otherwise .

$A_{i j} = {\begin{array}{l} {(B_{1})}_{i j} & for 1 \leq i, j \leq m \\ {(B_{2})}_{(i - m), (j - m)} & for m + 1 \leq i, j \leq n + m \\ 0 & otherwise . \end{array} \begin{array}{l} \end{array}$

If B1, B2, …, Bk $B_{1}, B_{2}, \dots, B_{k}$ are square matrices with entries from F, then we define the direct sum of B1, B2, …, Bk $B_{1}, B_{2}, \dots, B_{k}$ recursively by

B 1 \oplus B 2 \oplus \dots \oplus B k = (B 1 \oplus B 2 \oplus \dots \oplus B k - 1) \oplus B k .

$B_{1} \oplus B_{2} \oplus \dots \oplus B_{k} = (B_{1} \oplus B_{2} \oplus \dots \oplus B_{k - 1}) \oplus B_{k} .$

If A=B1⊕B2⊕⋯⊕Bk $A = B_{1} \oplus B_{2} \oplus \dots \oplus B_{k}$ , then we often write

A = ⎛ ⎝ ⎜ ⎜ ⎜ ⎜ B 1 O ⋮ O O B 2 ⋮ O \dots \dots \dots O O ⋮ B k ⎞ ⎠ ⎟ ⎟ ⎟ ⎟ .

$A = (\begin{array}{c} B_{1} & O & \dots & O \\ O & B_{2} & \dots & O \\ ⋮ & ⋮ & ⋮ \\ O & O & \dots & B_{k} \end{array}) .$

Example 9

Let

B 1 = (1121), B 2 = (3), and B 3 = ⎛ ⎝ ⎜ 111221131 ⎞ ⎠ ⎟ .

$B_{1} = (\begin{array}{r} 1 & 2 \\ 1 & 1 \end{array}), B_{2} = (3), and B_{3} = (\begin{array}{r} 1 & 2 & 1 \\ 1 & 2 & 3 \\ 1 & 1 & 1 \end{array}) .$

Then

The diagram of a matrix with 6 rows and 6 columns.

5.4-1 Full Alternative Text

The final result of this section relates direct sums of matrices to direct sums of invariant subspaces. It is an extension of Exercise 33 to the case k≥2 $k \geq 2$ .

Theorem 5.24.

Let T be a linear operator on a finite-dimensional vector space V, and let W1, W2, …, Wk $W_{1}, W_{2}, \dots, W_{k}$ be T-invariant subspaces of V such that V=W1⊕W2⊕⋯⊕Wk $V = W_{1} \oplus W_{2} \oplus \dots \oplus W_{k}$ . For each i, let βi $β_{i}$ be an ordered basis for Wi $W_{i}$ , and let β=β1∪β2∪⋯∪βk $β = β_{1} \cup β_{2} \cup \dots \cup β_{k}$ . Let A=[T]β $A = {[T]}_{β}$ and Bi=[TWi]βi $B_{i} = {[T_{W_{i}}]}_{β_{i}}$ for i=1, 2, …, k $i = 1, 2, \dots, k$ . Then A=B1⊕B2⊕⋯⊕Bk $A = B_{1} \oplus B_{2} \oplus \dots \oplus B_{k}$ .

Proof.

see Exercise 34.

Exercises

Label the following statements as true or false.
1. (a) There exists a linear operator T with no T-invariant subspace.
2. (b) If T is a linear operator on a finite-dimensional vector space V and W is a T-invariant subspace of V, then the characteristic polynomial of TW $T_{W}$ divides the characteristic polynomial of T.
3. (c) Let T be a linear operator on a finite-dimensional vector space V, and let v and w be in V. If W is the T-cyclic subspace generated by v, W′ $W^{'}$ is the T-cyclic subspace generated by w, and W=W′ $W = W^{'}$ , then v=w $v = w$ .
4. (d) If T is a linear operator on a finite-dimensional vector space V, then for any v∈V $v \in V$ the T-cyclic subspace generated by v is the same as the T-cyclic subspace generated by T(v).
5. (e) Let T be a linear operator on an n-dimensional vector space. Then there exists a polynomial g(t) of degree n such that g(T)=T0 $g (T) = T_{0}$ .
6. (f) Any polynomial of degree n with leading coefficient (−1)n ${(- 1)}^{n}$ is the characteristic polynomial of some linear operator.
7. (g) If T is a linear operator on a finite-dimensional vector space V, and if V is the direct sum of k T-invariant subspaces, then there is an ordered basis β $β$ for V such that [T]β ${[T]}_{β}$ is a direct sum of k matrices.
For each of the following linear operators T on the vector space V, determine whether the given subspace W is a T-invariant subspace of V.
1. (a) V=P3(R), T(f(x))=f′(x) $V = P_{3} (R), T (f (x)) = f^{'} (x)$ , and W=P2(R) $W = P_{2} (R)$
2. (b) V=P(R), T(f(x))=xf(x) $V = P (R), T (f (x)) = x f (x)$ , and W=P2(R) $W = P_{2} (R)$
3. (c) V=R3, T(a, b, c)=(a+b+c, a+b+c, a+b+c) $V = R^{3}, T (a, b, c) = (a + b + c, a + b + c, a + b + c)$ , and W={(t, t, t): t∈R} $W = {(t, t, t) : t \in R}$
4. (d) V=C([0, 1]), T(f(t))=[∫10f(x)dx]t $V = C ([0, 1]), T (f (t)) = [\int_{0}^{1} f (x) d x] t$ , and W={f∈V: f(t)=at+b for some a and b} $W = {f \in V : f (t) = a t + b for some a and b}$
5. (f) V=M2×2(R), T(A)=(0110)A $V = M_{2 \times 2} (R), T(A)= (\begin{array}{r} 0 & 1 \\ 1 & 0 \end{array}) A$ , and W={A∈V: At=A} $W = {A \in V : A^{t} = A}$
Let T be a linear operator on a finite-dimensional vector space V. Prove that the following subspaces are T-invariant.
1. (a) {0} ${0}$ and V
2. (b) N(T) and R(T)
3. (c) Eλ $E_{λ}$ , for any eigenvalue λ $λ$ of T
Let T be a linear operator on a vector space V, and let W be a T-invariant subspace of V. Prove that W is g(T)-invariant for any polynomial g(t).
Let T be a linear operator on a vector space V. Prove that the intersection of any collection of T-invariant subspaces of V is a T-invariant subspace of V.
For each linear operator T on the vector space V, find an ordered basis for the T-cyclic subspace generated by the vector z.
1. (a) V=R4, T(a, b, c, d)=(a+b, b−c, a+c, a+d) $V = R^{4}, T (a, b, c, d) = (a + b, b - c, a + c, a + d)$ , and z=e1 $z = e_{1}$ .
2. (b) V=P3(R), T(f(x))=f′′(x) $V = P_{3} (R), T (f (x)) = f^{″} (x)$ , and z=x3 $z = x^{3}$ .
3. (c) V=M2×2(R), T(A)=At $V = M_{2 \times 2} (R), T (A) = A^{t}$ , and z=(0110) $z = (\begin{array}{r} 0 & 1 \\ 1 & 0 \end{array})$ .
4. (d) V=M2×2(R), T(A)=(1212)A $V = M_{2 \times 2} (R), T (A) = (\begin{array}{r} 1 & 1 \\ 2 & 2 \end{array}) A$ , and z=(0110) $z = (\begin{array}{r} 0 & 1 \\ 1 & 0 \end{array})$ .
Prove that the restriction of a linear operator T to a T-invariant sub-space is a linear operator on that subspace.
Let T be a linear operator on a vector space with a T-invariant subspace W. Prove that if v is an eigenvector of TW $T_{W}$ with corresponding eigenvalue λ $λ$ , then v is also an eigenvector of T with corresponding eigenvalue λ $λ$ .
For each linear operator T and cyclic subspace W in Exercise 6, compute the characteristic polynomial of TW $T_{W}$ in two ways, as in Example 6.
For each linear operator in Exercise 6, find the characteristic polynomialf(t) of T, and verify that the characteristic polynomial of TW $T_{W}$ (computed in Exercise 9) dividesf(t).
Let T be a linear operator on a vector space V, let v be a nonzero vector in V, and let W be the T-cyclic subspace of V generated by v. Prove that
1. (a) W is T-invariant.
2. (b) Any T-invariant subspace of V containing v also contains W.
Prove that A=(B1OB2B3) $A = (\begin{array}{r} B_{1} & B_{2} \\ O & B_{3} \end{array})$ in the proof of Theorem 5.20.
Let T be a linear operator on a vector space V, let v be a nonzero vector in V, and let W be the T-cyclic subspace of V generated by v. For any w∈V $w \in V$ , prove that w∈W $w \in W$ if and only if there exists a polynomial g(t) such that w=g(T)(v) $w = g (T) (v)$ .
Prove that the polynomial g(t) of Exercise 13 can always be chosen so that its degree is less than dim(W) $dim (W)$ .
Use the Cayley-Hamilton theorem (Theorem 5.22) to prove its corollary for matrices. Warning: If f(t)=det(A−tI) $f (t) = det (A - t I)$ is the characteristic polynomial of A, it is tempting to “prove” that f(A)=O $f (A) = O$ by saying “f(A)=det(A−AI)=det(O)=0 $f (A) = det (A - A I) = det (O) = 0$ .” Why is this argument incorrect? Visit goo.gl/ZMVn9i for a solution.
Let T be a linear operator on a finite-dimensional vector space V.
1. (a) Prove that if the characteristic polynomial of T splits, then so does the characteristic polynomial of the restriction of T to any T-invariant subspace of V.
2. (b) Deduce that if the characteristic polynomial of T splits, then any nontrivial T-invariant subspace of V contains an eigenvector of T.
Let A be an n×n $n \times n$ matrix. Prove that

$dim (span ({I n, A, A 2, \dots})) \leq n .$ $dim (span ({I_{n}, A, A^{2}, \dots})) \leq n .$
Let A be an n×n $n \times n$ matrix with characteristic polynomial

$f (t) = (- 1) n t n + a n - 1 t n - 1 + \dots + a 1 t + a 0 .$ $f (t) = {(- 1)}^{n} t^{n} + a_{n - 1} t^{n - 1} + \dots + a_{1} t + a_{0} .$
1. (a) Prove that A is invertible if and only if a0≠0 $a_{0} \neq 0$ .
2. (b) Prove that if A is invertible, then
  
  $A - 1 = (- 1 / a 0) [(- 1) n A n - 1 + a n - 1 A n - 2 + \dots + a 1 I n] .$ $A^{- 1} = (- 1 / a_{0}) [{(- 1)}^{n} A^{n - 1} + a_{n - 1} A^{n - 2} + \dots + a_{1} I_{n}] .$
3. (c) Use (b) to compute A−1 $A^{- 1}$ for
  
  $A = ⎛ ⎝ ⎜ 100220 13 - 1 ⎞ ⎠ ⎟ .$ $A = (\begin{array}{r} 1 & 2 & 1 \\ 0 & 2 & 3 \\ 0 & 0 & - 1 \end{array}) .$
Let A denote the k×k $k \times k$ matrix

$⎛ ⎝ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ 010 ⋮ 0 001 ⋮ 0 \dots \dots \dots \dots 000 ⋮ 1 - a 0 - a 1 - a 2 ⋮ - a k - 1 ⎞ ⎠ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟,$ $(\begin{array}{c} 0 & 0 & \dots & 0 & - a_{0} \\ 1 & 0 & \dots & 0 & - a_{1} \\ 0 & 1 & \dots & 0 & - a_{2} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & - a_{k - 1} \end{array}),$

where a0, a1, …, ak−1 $a_{0}, a_{1}, \dots, a_{k - 1}$ are arbitrary scalars. Prove that the characteristic polynomial of A is

$(- 1) k (a 0 + a 1 t + \dots + a k - 1 t k - 1 + t k) .$ ${(- 1)}^{k} (a_{0} + a_{1} t + \dots + a_{k - 1} t^{k - 1} + t^{k}) .$

Hint: Use mathematical induction on k, computing the determinant by cofactor expansion along the first row.
Let T be a linear operator on a vector space V, and suppose that V is a T-cyclic subspace of itself. Prove that if U is a linear operator on V, then UT=TU $UT = TU$ if and only if U=g(T) $U = g (T)$ for some polynomial g(t). Hint: Suppose that V is generated by v. Choose g(t) according to Exercise 13 so that g(T)(v)=U(v) $g (T) (v) = U (v)$ .
Let T be a linear operator on a two-dimensional vector space V. Prove that either V is a T-cyclic subspace of itself or T=cI $T = c I$ for some scalar c.
Let T be a linear operator on a two-dimensional vector space V and suppose that T≠cI $T \neq c I$ for any scalar c. Show that if U is any linear operator on V such that UT=TU $UT = TU$ , then U=g(T) $U = g (T)$ for some polynomial g(t).
Let T be a linear operator on a finite-dimensional vector space V, and let W be a T-invariant subspace of V. Suppose that v1, v2, …, vk $v_{1}, v_{2}, \dots, v_{k}$ are eigenvectors of T corresponding to distinct eigenvalues. Prove that if v1+v2+⋯+vk $v_{1} + v_{2} + \dots + v_{k}$ is in W, then vi∈W $v_{i} \in W$ for all i. Hint: Use mathematical induction on k.
Prove that the restriction of a diagonalizable linear operator T to any nontrivial T-invariant subspace is also diagonalizable. Hint: Use the result of Exercise 23.
1. (a) Prove the converse to Exercise 19(a) of Section 5.2: If T and U are diagonalizable linear operators on a finite-dimensional vector space V such that UT=TU $UT = TU$ , then T and U are simultaneously diagonalizable. (See the definitions in the exercises of Section 5.2.) Hint: For any eigenvalue λ $λ$ of T, show that Eλ $E_{λ}$ is U-invariant, and apply Exercise 24 to obtain a basis for Eλ $E_{λ}$ of eigenvectors of U.
2. (b) State and prove a matrix version of (a).
Let T be a linear operator on an n-dimensional vector space V such that T has n distinct eigenvalues. Prove that V is a T-cyclic subspace of itself. Hint: Use Exercise 23 to find a vector v such that {v, T(v), …, Tn−1(v)} ${v, T (v), \dots, T^{n - 1} (v)}$ is linearly independent.

Exercises 27 through 31 require familiarity with quotient spaces as defined in Exercise 31 of Section 1.3. Before attempting these exercises, the reader should first review the other exercises treating quotient spaces: Exercise 35 of Section 1.6, Exercise 42 of Section 2.1, and Exercise 24 of Section 2.4.

For the purposes of Exercises 27 through 31, T is a fixed linear operator on a finite-dimensional vector space V, and W is a nonzero T-invariant subspace of V. We require the following definition.

Definition.

Let T be a linear operator on a vector space V, and let W be a T-invariant subspace of V. Define T¯¯¯: V/W→V/W $\bar{T} : V / W \to V / W$ by

T ¯ ¯ ¯ (v + W) = T (v) + W f o r a n y v + W \in V / W .

$\bar{T} (v + W) = T (v) + W for any v + W \in V / W .$

1. (a) Prove that T¯¯¯ $\bar{T}$ is well defined. That is, show that T¯¯¯(v+W)=T¯¯¯(v′+W) $\bar{T} (v + W) = \bar{T} (v^{'} + W)$ whenever v+W=v′+W $v + W = v^{'} + W$ .
2. (b) Prove that T¯¯¯ $\bar{T}$ is a linear operator on V/W.
3. (c) Let η: V→V/W $η : V \to V / W$ be the linear transformation defined in Exercise 42 of Section 2.1 by η(v)=v+W $η (v) = v + W$ . Show that the diagram of Figure 5.6 commutes; that is, prove that ηT=T¯¯¯η $η T = \bar{T} η$ . (This exercise does not require the assumption that V is finite-dimensional.)
Figure 5.6

Figure 5.6 Full Alternative Text
Letf(t), g(t), and h(t) be the characteristic polynomials of T, TW $T_{W}$ , and T¯¯¯ $\bar{T}$ , respectively. Prove that f(t)=g(t)h(t) $f (t) = g (t) h (t)$ . Hint: Extend an ordered basis γ={v1, v2, …, vk} $γ = {v_{1}, v_{2}, \dots, v_{k}}$ for W to an ordered basis β={v1, v2, …, vk, vk+1, …, vn} $β = {v_{1}, v_{2}, \dots, v_{k}, v_{k + 1}, \dots, v_{n}}$ for V. Then show that the collection of cosets α={vk+1+W, vk+2+W, …, vn+W} $α = {v_{k + 1} + W, v_{k + 2} + W, \dots, v_{n} + W}$ is an ordered basis for V/W, and prove that

$[T] β = (B 1 O B 2 B 3),$ ${[T]}_{β} = (\begin{array}{r} B_{1} & B_{2} \\ O & B_{3} \end{array}),$

where B1=[T]γ $B_{1} = {[T]}_{γ}$ and B3=[T¯¯¯]α $B_{3} = {[\bar{T}]}_{α}$ .
Use the hint in Exercise 28 to prove that if T is diagonalizable, then so is $\bar{T}$ .
Prove that if both $T_{W}$ and $\bar{T}$ are diagonalizable and have no common eigenvalues, then T is diagonalizable.

The results of Theorem 5.21 and Exercise 28 are useful in devising methods for computing characteristic polynomials without the use of determinants. This is illustrated in the next exercise.

Let $A = (\begin{array}{r} 1 & 1 & - 3 \\ 2 & 3 & 4 \\ 1 & 2 & 1 \end{array})$ , let $T = L_{A}$ , and let W be the cyclic subspace of $R^{3}$ generated by $e_{1}$ .
1. (a) Use Theorem 5.21 to compute the characteristic polynomial of $T_{W}$ .
2. (b) Show that ${e_{2} + W}$ is a basis for $R^{3} / W$ , and use this fact to compute the characteristic polynomial of $\bar{T}$ .
3. (c) Use the results of (a) and (b) to find the characteristic polynomial of A.

Exercises 32 through 39 are concerned with direct sums.

Let T be a linear operator on a vector space V, and let $W_{1}, W_{2}, \dots W_{k}$ be T-invariant subspaces of V. Prove that $W_{1} + W_{2} + \dots + W_{k}$ is also a T-invariant subspace of V.
Give a direct proof of Theorem 5.24 for the case $k = 2$ . (This result is used in the proof of Theorem 5.23.)
Prove Theorem 5.24. Hint: Begin with Exercise 33 and extend it using mathematical induction on k, the number of subspaces.
Let T be a linear operator on a finite-dimensional vector space V. Prove that T is diagonalizable if and only if V is the direct sum of one-dimensional T-invariant subspaces.
Let T be a linear operator on a finite-dimensional vector space V, and let $W_{1}, W_{2}, \dots W_{k}$ be T-invariant subspaces of V such that $V = W_{1} \oplus W_{2} \oplus \dots \oplus W_{k}$ . Prove that

$det (T) = det (T_{W_{1}}) \cdot det (T_{W_{2}}) \cdot \dots \cdot det (T_{W_{k}}) .$
Let T be a linear operator on a finite-dimensional vector space V, and let $W_{1}, W_{2}, \dots, W_{k}$ be T-invariant subspaces of V such that $V = W_{1} \oplus W_{2} \oplus \dots \oplus W_{k}$ . Prove that T is diagonalizable if and only if $T_{W_{i}}$ is diagonalizable for all i.
Let C be a collection of diagonalizable linear operators on a finite- dimensional vector space V. Prove that there is an ordered basis $β$ such that ${[T]}_{^{β}}$ is a diagonal matrix for all $T \in C$ if and only if the operators of C commute under composition. (This is an extension of Exercise 25.) Hints for the case that the operators commute: The result is trivial if each operator has only one eigenvalue. Otherwise, establish the general result by mathematical induction on dim(V), using the fact that V is the direct sum of the eigenspaces of some operator in C that has more than one eigenvalue.
Let $B_{1}, B_{2}, \dots, B_{k}$ be square matrices with entries in the same field, and let $A = B_{1} \oplus B_{2} \oplus \dots \oplus B_{k}$ . Prove that the characteristic polynomial of A is the product of the characteristic polynomials of the $B_{i}' s$ .
Let

$A = (\begin{array}{c} 1 & 2 & \dots & n \\ n + 1 & n + 2 & \dots & 2 n \\ ⋮ & ⋮ & ⋮ \\ n^{2} - n + 1 & n^{2} - n + 2 & \dots & n^{2} \end{array}) .$

Find the characteristic polynomial of A. Hint: First prove that A has rank 2 and that $span ({(1, 1, \dots, 1), (1, 2, \dots, n)})$ is $L_{A}$ -invariant.
Let $A \in M_{n \times n} (R)$ be the matrix defined by $A_{i j} = 1$ for all i and j. Find the characteristic polynomial of A.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 5.4 Invariant Subspaces and the Cayley-Hamilton Theorem

Create new playlist

Sign In

Sign Up

Table of Contents for
5.4 Invariant Subspaces and the Cayley-Hamilton Theorem