7.3 The Minimal Polynomial

The Cayley-Hamilton theorem (Theorem 5.22 p. 315) tells us that for any linear operator T on an n-dimensional vector space, there is a polynomial f(t) of degree n such that f(T)=T0, namely, the characteristic polynomial of T. Hence there is a polynomial of least degree with this property, and this degree is at most n. If g(t) is such a polynomial, we can divide g(t) by its leading coefficient to obtain another polynomial p(t) of the same degree with leading coefficient 1, that is, p(t) is a monic polynomial. (See Appendix E.)

Definition.

Let T be a linear operator on a finite-dimensional vector space. A polynomial p(t) is called a minimal polynomial of T if p(t) is a monic polynomial of least positive degree for which p(T)=T0.

The preceding discussion shows that every linear operator on a finite-dimensional vector space has a minimal polynomial. The next result shows that it is unique, and hence we can speak of the minimal polynomial of T.

Theorem 7.12.

Let p(t) be a minimal polynomial of a linear operator T on a finite-dimensional vector space V.

  1. (a) For any polynomial g(t), if g(T)=T0, then p(t) divides g(t). In particular, p(t) divides the characteristic polynomial of T.

  2. (b) The minimal polynomial of T is unique.

Proof.

(a) Let g(t) be a polynomial for which g(T)=T0. By the division algorithm for polynomials (Theorem E.1 of Appendix E, p. 555), there exist polynomials q(t) and r(t) such that

g(t)=q(t)p(t)+r(t),(1)

where r(t) has degree less than the degree of p(t). Substituting T into (1) and using that g(T)=p(T)=T0, we have r(T)=T0. Since r(t) has degree less than p(t) and p(t) is the minimal polynomial of T, r(t) must be the zero polynomial. Thus (1) simplifies to g(t)=q(t)p(t), proving (a).

(b) Suppose that p1(t) and p2(t) are each minimal polynomials of T. Then p1(t) divides p2(t) by (a). Since p1(t) and p2(t) have the same degree, we have that p2(t)=cp1(t) for some nonzero scalar c. Because p1(t) and p2(t) are monic, c=1; hence p1(t)=p2(t).

The minimal polynomial of a linear operator has an obvious analog for a matrix.

Definition.

Let AMn×n(F). The minimal polynomial p(t) of A is the monic polynomial of least positive degree for which p(A)=O.

The following results are now immediate.

Theorem 7.13.

Let T be a linear operator on a finite-dimensional vector space V, and let β be an ordered basis for V. Then the minimal polynomial of T is the same as the minimal polynomial of [T]β.

Proof.

Exercise.

Corollary.

For any AMn×n(F), the minimal polynomial of A is the same as the minimal polynomial of LA.

Proof.

Exercise.

In view of the preceding theorem and corollary, Theorem 7.12 and all subsequent theorems in this section that are stated for operators are also valid for matrices.

For the remainder of this section, we study primarily minimal polynomials of operators (and hence matrices) whose characteristic polynomials split. A more general treatment of minimal polynomials is given in Section 7.4.

Theorem 7.14.

Let T be a linear operator on a finite-dimensional vector space V, and let p(t) be the minimal polynomial of T. A scalar λ is an eigenvalue of T if and only if p(λ)=0. Hence the characteristic polynomial and the minimal polynomial of T have the same zeros.

Proof.

Let f(t) be the characteristic polynomial of T. Since p(t) divides f(t), there exists a polynomial q(t) such that f(t)=q(t)p(t). If λ is a zero of p(t), then

f(λ)=q(λ)p(λ)=q(λ)0=0.

So λ is a zero of f(t); that is, λ is an eigenvalue of T.

Conversely, suppose that λ is an eigenvalue of T, and let xV be an eigenvector corresponding to λ. By Exercise 22 of Section 5.1, we have

0=T0(x)=p(T)(x)=p(λ)x.

Since x0, it follows that p(λ)=0, and so λ is a zero of p(t).

The following corollary is immediate.

Corollary.

Let T be a linear operator on a finite-dimensional vector space V with minimal polynomial p(t) and characteristic polynomial f(t). Suppose that f(t) factors as

f(t)=(λ1t)n1(λ2t)n2(λkt)nk,

where λ1, λ2, , λk are the distinct eigenvalues of T. Then there exist integers m1, m2, , mk such that 1mini for all i and

p(t)=(tλ1)m1(tλ2)m2(tλk)mk.

Example 1

We compute the minimal polynomial of the matrix

A=(310020112).

Since A has the characteristic polynomial

f(t)=det(3t1002t0112t)=(t2)2(t3),

the minimal polynomial of A must be either (t2)(t3) or (t2)2(t3) by the corollary to Theorem 7.14. Substituting A into p(t)=(t2)(t3), we find that p(A)=O; hence p(t) is the minimal polynomial of A.

Example 2

Let T be the linear operator on R2 defined by

T(a, b)=(2a+5b, 6a+b)

and β be the standard ordered basis for R2. Then

[T]β=(2561),

and hence the characteristic polynomial of T is

f(t)=det(2t561t)=(t7)(t+4).

Thus the minimal polynomial of T is also (t7)(t+4).

Example 3

Let D be the linear operator on P2(R) defined by D(g(x))=g(x), the derivative of g(x). We compute the minimal polynomial of T. Let β be the standard ordered basis for P2(R). Then

[D]β=(010002000),

and it follows that the characteristic polynomial of D is t3. So by the corollary to Theorem 7.14, the minimal polynomial of D is t, t2, or t3. Since D2(x2)=20, it follows that D2T0; hence the minimal polynomial of D must be t3.

In Example 3, it is easily verified that P2(R) is a D-cyclic subspace (of itself). Here the minimal and characteristic polynomials are of the same degree. This is no coincidence.

Theorem 7.15.

Let T be a linear operator on an n-dimensional vector space V such that V is a T-cyclic subspace of itself. Then the characteristic polynomial f(t) and the minimal polynomial p(t) have the same degree, and hence f(t)=(1)np(t).

Proof.

Since V is a T-cyclic space, there exists an xV such that

β={x, T(x), , Tn1(x)}

is a basis for V (Theorem 5.21 p. 314). Let

g(t)=a0+a1t++aktk,

be a polynomial of degree k<n. Then ak0 and

g(T)(x)=a0x+a1T(x)++akTk(x),

and so g(T)(x) is a linear combination of the vectors of β having at least one nonzero coefficient, namely, ak. Since β is linearly independent, it follows that g(T)(x)0; hence g(T)T0. Therefore the minimal polynomial of T has degree n, which is also the degree of the characteristic polynomial of T.

Theorem 7.15 gives a condition under which the degree of the minimal polynomial of an operator is as large as possible. We now investigate the other extreme. By Theorem 7.14, the degree of the minimal polynomial of an operator must be greater than or equal to the number of distinct eigenvalues of the operator. The next result shows that the operators for which the degree of the minimal polynomial is as small as possible are precisely the diagonalizable operators.

Theorem 7.16.

Let T be a linear operator on a finite-dimensional vector space V. Then T is diagonalizable if and only if the minimal polynomial of T is of the form

p(t)=(tλ1)(tλ2)(tλk),

where λ1, λ2, , λk are the distinct eigenvalues of T.

Proof.

Suppose that T is diagonalizable. Let λ1, λ2, , λk be the distinct eigenvalues of T, and define

p(t)=(tλ1)(tλ2)(tλk).

By Theorem 7.14, p(t) divides the minimal polynomial of T. Let β={v1, v2, , vn} be a basis for V consisting of eigenvectors of T, and consider any viβ. Then (TλjI)(vi)=0 for some eigenvalue λj. Since tλj divides p(t), there is a polynomial qj(t) such that p(t)=qj(t)(tλj). Hence

p(T)(vi)=qj(T)(TλjI)(vi)=0.

It follows that p(T)=T0, since p(T) takes each vector in a basis for V into 0. Therefore p(t) is the minimal polynomial of T.

Conversely, suppose that there are distinct scalars λ1, λ2, , λk such that the minimal polynomial p(t) of T factors as

p(t)=(tλ1)(tλ2)(tλk).

By Theorem 7.14, the λi’s are eigenvalues of T. We apply mathematical induction on n=dim(V). Clearly T is diagonalizable for n=1. Now assume that T is diagonalizable whenever dim(V)<n for some n>1, and let dim(V)=n and W=R(TλkI). Obviously WV, because λk is an eigenvalue of T. If W={0}, then T=λkI, which is clearly diagonalizable. So suppose that 0<dim(W)<n. Then W is T-invariant, and for any xW,

(Tλ1I)(Tλ2I)(Tλk1I)(x)=0.

It follows that the minimal polynomial of TW divides the polynomial (tλ1)(tλ2)(tλk1). Hence by the induction hypothesis, TW is diagonalizable. Furthermore, λk is not an eigenvalue of TW by Theorem 7.14. Therefore WN(TλkI)={0}. Now let β1={v1, v2, , vm} be a basis for W consisting of eigenvectors of Tw (and hence of T), and let β2={w1, w2, , wp} be a basis for N(TλkI), the eigenspace of T corresponding to λk. Then β1 and β2 are disjoint by the previous comment. Moreover, m+p=n by the dimension theorem applied to TλkI. We show that β=β1β2 is linearly independent. Consider scalars a1, a2, , am and b1, b2, , bp such that

a1v1+a2v2++amvm+b1w1+b2w2++bpwp=0.

Let

x=i=1maiviandy=i=1pbiwi.

Then xW, yN(TλkI), and x+y=0. It follows that x=yWN(TλkI), and therefore x=0. Since β1 is linearly independent, we have that a1=a2==am=0. Similarly, b1=b2==bp=0, and we conclude that β is a linearly independent subset of V consisting of n eigenvectors. It follows that β is a basis for V consisting of eigenvectors of T, and consequently T is diagonalizable.

In addition to diagonalizable operators, there are methods for determining the minimal polynomial of any linear operator on a finite-dimensional vector space. In the case that the characteristic polynomial of the operator splits, the minimal polynomial can be described using the Jordan canonical form of the operator. (See Exercise 13.) In the case that the characteristic polynomial does not split, the minimal polynomial can be described using the rational canonical form, which we study in the next section. (See Exercise 7 of Section 7.4.)

Example 4

We determine all matrices AM2×2(R) for which A23A+2I=O. Let g(t)=t23t+2=(t1)(t2). Since g(A)=O, the minimal polynomial p(t) of A divides g(t). Hence the only possible candidates for p(t) are t1, t2, and (t1)(t2). If p(t)=t1 or p(t)=t2, then A=I or A=2I, respectively. If p(t)=(t1)(t2), then A is diagonalizable with eigenvalues 1 and 2, and hence A is similar to

D=(1002),

that is, A=QDQ1 for some invertible matrix Q.

Example 5

Let AMn×n(R) satisfy A3=A. We show that A is diagonalizable. Let g(t)=t3t=t(t+1)(t1). Then g(A)=O, and hence the minimal polynomial p(t) of A divides g(t). Since g(t) has no repeated factors, neither does p(t). Thus A is diagonalizable by Theorem 7.16.

Example 6

In Example 3, we saw that the minimal polynomial of the differential operator D on P2(R) is t3. Hence, by Theorem 7.16, D is not diagonalizable.

Exercises

  1. Label the following statements as true or false. Assume that all vector spaces are finite-dimensional.

    1. (a) Every linear operator T has a polynomial p(t) of largest degree for which p(T)=T0.

    2. (b) Every linear operator has a unique minimal polynomial.

    3. (c) The characteristic polynomial of a linear operator divides the minimal polynomial of that operator.

    4. (d) The minimal and the characteristic polynomials of any diagonalizable operator are equal.

    5. (e) Let T be a linear operator on an n-dimensional vector space V, p(t) be the minimal polynomial of T, and f(t) be the characteristic polynomial of T. Suppose that f(t) splits. Then f(t) divides [p(t)]n.

    6. (f) The minimal polynomial of a linear operator always has the same degree as the characteristic polynomial of the operator.

    7. (g) A linear operator is diagonalizable if its minimal polynomial splits.

    8. (h) Let T be a linear operator on a vector space V such that V is a T-cyclic subspace of itself. Then the degree of the minimal polynomial of T equals dim(V).

    9. (i) Let T be a linear operator on a vector space V such that T has n distinct eigenvalues, where n=dim(V). Then the degree of the minimal polynomial of T equals n.

  2. Find the minimal polynomial of each of the following matrices.

    1. (a) (2112)

    2. (b) (1101)

    3. (c) (4145142164)

    4. (d) (301222101)

  3. For each linear operator T on V, find the minimal polynomial of T.

    1. (a) V=R2 and T(a, b)=(a+b, ab)

    2. (b) V=P2(R) and T(g(x))=g(x)+2g(x)

    3. (c) V=P2(R) and T(f(x))=xf(x)+f(x)+2f(x)

    4. (d) V=Mn×n(R) and T(A)=At Hint: Note that T2=I.

  4. Determine which of the matrices and operators in Exercises 2 and 3 are diagonalizable.

  5. Describe all linear operators T on R2 such that T is diagonalizable and T32T2+T=T0.

  6. Prove Theorem 7.13 and its corollary.

  7. Prove the corollary to Theorem 7.14.

  8. Let T be a linear operator on a finite-dimensional vector space, and let p(t) be the minimal polynomial of T. Prove the following results.

    1. (a) T is invertible if and only if p(0)0.

    2. (b) If T is invertible and p(t)=tm+am1tm1++a1t+a0, then

      T1=1a0(Tm1+am1Tm2++a2T+a1I).
  9. Let T be a diagonalizable linear operator on a finite-dimensional vector space V. Prove that V is a T-cyclic subspace if and only if each of the eigenspaces of T is one-dimensional.

  10. Let T be a linear operator on a finite-dimensional vector space V, and suppose that W is a T-invariant subspace of V. Prove that the minimal polynomial of TW divides the minimal polynomial of T.

  11. Let g(t) be the auxiliary polynomial associated with a homogeneous linear differential equation with constant coefficients (as defined in Section 2.7), and let V denote the solution space of this differential equation. Prove the following results.

    1. (a) V is a D-invariant subspace, where D is the differentiation operator on C.

    2. (b) The minimal polynomial of DV (the restriction of D to V) is g(t).

    3. (c) If the degree of g(t) is n, then the characteristic polynomial of DV is (1)ng(t).

    Hint: Use Theorem 2.32 (p. 135) for (b) and (c).

  12. Let D be the differentiation operator on P(R), the space of polynomials over R. Prove that there exists no polynomial g(t) for which g(D)=T0. Hence D has no minimal polynomial.

  13. Let T be a linear operator on a finite-dimensional vector space, and suppose that the characteristic polynomial of T splits. Let λ1, λ2, , λk be the distinct eigenvalues of T, and for each i let pi be the number of rows in the largest Jordan block corresponding to λi in a Jordan canonical form of T. Prove that the minimal polynomial of T is

    (tλ1)pi(tλ2)p2(tλk)pk.

    The following exercise requires knowledge of direct sums (see Section 5.2).

  14. Let T be a linear operator on a finite-dimensional vector space V, and let W1 and W2 be T-invariant subspaces of V such that V=W1W2. Suppose that p1(t) and p2(t) are the minimal polynomials of TW1 and TW2, respectively. Either prove that the minimal polynomial f(t) of T always equals p1(t)p2(t) or give an example in which f(t)p1(t)p2(t).

Exercise 15 uses the following definition.

Definition.

Let T be a linear operator on a finite-dimensional vector space V, and let x be a nonzero vector in V. The polynomial p(t) is called a T-annihilator of x if p(t) is a monic polynomial of least degree for which p(T)(x)=0.

  1. †Let T be a linear operator on a finite-dimensional vector space V, and let x be a nonzero vector in V. Prove the following results.

    1. (a) The vector x has a unique T-annihilator.

    2. (b) The T-annihilator of x divides any polynomial g(t) for which g(T)=T0.

    3. (c) If p(t) is the T-annihilator of x and W is the T-cyclic subspace generated by x, then p(t) is the minimal polynomial of TW, and dim(W) equals the degree of p(t).

    4. (d) The degree of the T-annihilator of x is 1 if and only if x is an eigenvector of T.

    Visit goo.gl/8KD6Gw for a solution.

  2. Let T be a linear operator on a finite-dimensional vector space V, and let W1 be a T-invariant subspace of V. Let xV such that xW1. Prove the following results.

    1. (a) There exists a unique monic polynomial g1(t) of least positive degree such that g1(T)(x)W1.

    2. (b) If h(t) is a polynomial for which h(T)(x)W1, then g1(t) divides h(t).

    3. (c) g1(t) divides the minimal and the characteristic polynomials of T.

    4. (d) Let W2 be a T-invariant subspace of V such that W2W1, and let g2(t) be the unique monic polynomial of least degree such that g2(T)(x)W2. Then g1(t) divides g2(t).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.223.196.59