Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

8.2 The Jordan Canonical Form

In this section, we will show that any linear operator L on an n-dimensional vector space V can be represented by a block diagonal matrix whose diagonal blocks are simple Jordan matrices. We will apply this result to solving systems of linear differential equations of the form $Y' = AY$ $Y' = AY$ , where A is defective.

Let us begin by considering the case where L has more than one distinct eigenvalue. We wish to show that if L has distinct eigenvalues $λ_{1}, \dots, λ_{k}$ $λ_{1}, \dots, λ_{k}$ , then V can be decomposed into a direct sum of invariant subspaces $S_{1}, \dots, S_{k}$ $S_{1}, \dots, S_{k}$ such that $L - λ_{i} ⌶$ $L - λ_{i} ⌶$ is nilpotent on $S_{i}$ $S_{i}$ for each $i = 1, \dots, k$ $i = 1, \dots, k$ . To do this, we must first prove the following lemma and theorem.

Lemma 8.2.1

If L is a linear operator mapping an n-dimensional vector space V into itself, then there exists a positive integer $k_{0}$ $k_{0}$ such that $ker (L^{k_{0}}) = ker (L^{k_{0} + k})$ $ker (L^{k_{0}}) = ker (L^{k_{0} + k})$ for all $k > 0$ $k > 0$ .

Proof

If $i < j$ $i < j$ , then clearly ker $(L^{i})$ $(L^{i})$ is a subspace of ker $(L^{j})$ $(L^{j})$ . We claim that if ker $(L^{i}) = ker (L^{i + 1})$ $(L^{i}) = ker (L^{i + 1})$ for some i, then ker $(L^{i}) = ker (L^{i + k})$ $(L^{i}) = ker (L^{i + k})$ for all $k \geq 1$ $k \geq 1$ . We will prove this by induction on k. In the case $k = 1$ $k = 1$ , there is nothing to prove. Assume that for some $k > 1$ $k > 1$ the result holds all indices less than k. If $v \in ker (L^{i + k})$ $v \in ker (L^{i + k})$ , then

0 = L^{i + k} (v) = L^{i + k - 1} (L (v))

$0 = L^{i + k} (v) = L^{i + k - 1} (L (v))$

Thus, $L (v) \in ker (L^{i + k - 1})$ $L (v) \in ker (L^{i + k - 1})$ . By the induction hypothesis, $ker (L^{i + k - 1}) = ker (L^{i})$ $ker (L^{i + k - 1}) = ker (L^{i})$ . Therefore, $L (v) \in ker (L^{i})$ $L (v) \in ker (L^{i})$ and hence $v \in ker (L^{i + 1})$ $v \in ker (L^{i + 1})$ . Since $ker (L^{i + 1}) = ker (L^{i})$ $ker (L^{i + 1}) = ker (L^{i})$ , it follows that $v \in ker (L^{i})$ $v \in ker (L^{i})$ and hence $ker (L^{i}) = ker (L^{i + k})$ $ker (L^{i}) = ker (L^{i + k})$ . Thus, if $ker (L^{i + 1}) = ker (L^{i})$ $ker (L^{i + 1}) = ker (L^{i})$ for some i, then

ker (L^{i}) = ker (L^{i + 1}) = ker (L^{i + 1}) = \dots

$ker (L^{i}) = ker (L^{i + 1}) = ker (L^{i + 1}) = \dots$

Since V is finite dimensional, the dimension of $ker (L^{k})$ $ker (L^{k})$ cannot keep increasing as k increases. Thus for some $k_{0}$ $k_{0}$ , we must have dim $(ker (L^{k_{0}})) = dim (ker (L^{k_{0} + 1}))$ $(ker (L^{k_{0}})) = dim (ker (L^{k_{0} + 1}))$ and hence $ker (L^{k_{0}})$ $ker (L^{k_{0}})$ and $ker (L^{k_{0} + 1})$ $ker (L^{k_{0} + 1})$ must be equal. It follows that

ker (L^{k_{0}}) = ker (L^{k_{0} + 1}) = ker (L^{k_{0} + 2}) = \dots

$ker (L^{k_{0}}) = ker (L^{k_{0} + 1}) = ker (L^{k_{0} + 2}) = \dots$

∎

Theorem 8.2.2

If L is a linear transformation on an n-dimensional vector space V, then there exist invariant subspaces X and Y such that $V = X \oplus Y$ $V = X \oplus Y$ , L is nilpotent on X, and $L_{[Y]}$ $L_{[Y]}$ is invertible.

Proof

Choose $k_{0}$ $k_{0}$ to be the smallest positive integer such that $ker (L^{k_{0}}) = ker (L^{k_{0} + 1})$ $ker (L^{k_{0}}) = ker (L^{k_{0} + 1})$ . It follows from Lemma 8.2.1 that $ker (L^{k_{0}}) = ker (L^{k_{0} + j})$ $ker (L^{k_{0}}) = ker (L^{k_{0} + j})$ for all $j \geq 1$ $j \geq 1$ . Let $X = ker (L^{k_{0}})$ $X = ker (L^{k_{0}})$ . Clearly, X is invariant under L for if $x \in X$ $x \in X$ , then $L (x) \in ker (L^{k_{0} - 1})$ $L (x) \in ker (L^{k_{0} - 1})$ , which is a proper subspace of $ker (L^{k_{0}})$ $ker (L^{k_{0}})$ . Let $Y = R (L^{k_{0}})$ $Y = R (L^{k_{0}})$ . If $w \in X \cap Y$ $w \in X \cap Y$ , then ${w = L}^{k_{0}} (v)$ ${w = L}^{k_{0}} (v)$ for some v and hence

0 = L^{k_{0}} (w) = L^{k_{0}} (L^{k_{0}} (v)) = L^{2 k_{0}} (v)

$0 = L^{k_{0}} (w) = L^{k_{0}} (L^{k_{0}} (v)) = L^{2 k_{0}} (v)$

Thus, $v \in ker (L^{2 k_{0}}) = ker (L^{k_{0}})$ $v \in ker (L^{2 k_{0}}) = ker (L^{k_{0}})$ and hence

w = L^{k_{0}} (v) = 0

$w = L^{k_{0}} (v) = 0$

Therefore, $X \cap Y = {0}$ $X \cap Y = {0}$ . We claim $V = X \oplus Y$ $V = X \oplus Y$ . Let ${x_{1}, \dots, x_{r}}$ ${x_{1}, \dots, x_{r}}$ be a basis for X and let ${y_{1}, \dots, y_{n - r}}$ ${y_{1}, \dots, y_{n - r}}$ be a basis for Y. By Lemma 8.2.1, it suffices to show that $x_{1}, \dots, x_{r}, y_{1}, \dots, y_{n - r}$ $x_{1}, \dots, x_{r}, y_{1}, \dots, y_{n - r}$ are linearly independent and hence form a basis for V. If

\begin{matrix} α_{1} x_{1} + \dots + α_{r} x_{r} + β_{1} y_{1} + \dots + β_{n - r} y_{n - r} = 0 \end{matrix}

$\begin{matrix} α_{1} x_{1} + \dots + α_{r} x_{r} + β_{1} y_{1} + \dots + β_{n - r} y_{n - r} = 0 \end{matrix}$ (1)

then applying $L^{k_{0}}$ $L^{k_{0}}$ to both sides gives

\begin{matrix} β_{1} L^{k_{0}} (y_{1}) + \dots + β_{n - r} L^{k_{0}} (y_{n - r}) = 0 \end{matrix}

$\begin{matrix} β_{1} L^{k_{0}} (y_{1}) + \dots + β_{n - r} L^{k_{0}} (y_{n - r}) = 0 \end{matrix}$

\begin{matrix} L^{k_{0}} (β_{1} y_{1} + \dots + β_{n - r} y_{n - r}) = 0 \end{matrix}

$\begin{matrix} L^{k_{0}} (β_{1} y_{1} + \dots + β_{n - r} y_{n - r}) = 0 \end{matrix}$

Therefore, $\begin{matrix} β_{1} y_{1} + \dots + β_{n - r} y_{n - r} \in X \cap Y \end{matrix}$ $\begin{matrix} β_{1} y_{1} + \dots + β_{n - r} y_{n - r} \in X \cap Y \end{matrix}$ and hence

\begin{matrix} β_{1} y_{1} + \dots + β_{n - r} y_{n - r} = 0 \end{matrix}

$\begin{matrix} β_{1} y_{1} + \dots + β_{n - r} y_{n - r} = 0 \end{matrix}$

Since the $y_{i}$ $y_{i}$ ’s are linearly independent, it follows that

β_{1} = β_{2} = \dots = β_{n - r} = 0

$β_{1} = β_{2} = \dots = β_{n - r} = 0$

and hence (1) simplifies to

α_{1} x_{1} + \dots + α_{r} x_{r} = 0

$α_{1} x_{1} + \dots + α_{r} x_{r} = 0$

Since the $x_{i}$ $x_{i}$ ’s are linearly independent, it follows that

α_{1} = α_{2} = \dots = α_{r} = 0

$α_{1} = α_{2} = \dots = α_{r} = 0$

Thus, $x_{1}, \dots, x_{r}, y_{1}, \dots, y_{n - r}$ $x_{1}, \dots, x_{r}, y_{1}, \dots, y_{n - r}$ are linearly independent and therefore $V = X \oplus Y$ $V = X \oplus Y$ . L is invariant and nilpotent on X. We claim that L is invariant and invertible on Y. Let $y \in Y$ $y \in Y$ ; then $y = L^{k_{0}} (v)$ $y = L^{k_{0}} (v)$ for some $v \in V$ $v \in V$ . Thus,

L (y) = L (L^{k_{0}} (v)) = L^{k_{0} + 1} (v) = L^{k_{0}} (L (v))

$L (y) = L (L^{k_{0}} (v)) = L^{k_{0} + 1} (v) = L^{k_{0}} (L (v))$

Therefore, $L (y) \in Y$ $L (y) \in Y$ and hence Y is invariant under L. To prove $L_{[Y]}$ $L_{[Y]}$ is invertible, it suffices to show that

ker (L_{[Y]}) = Y \cap ker (L) = {0}

$ker (L_{[Y]}) = Y \cap ker (L) = {0}$

This, however, follows immediately since $ker (L) \subset X$ $ker (L) \subset X$ and $X \cap Y = {0}$ $X \cap Y = {0}$ .

∎

We are now ready to prove the main result of this section.

Theorem 8.2.3

Let L be a linear operator mapping a finite dimensional vector space V into itself. If $λ_{1}, \dots, λ_{k}$ $λ_{1}, \dots, λ_{k}$ are the distinct eigenvalues of L, then V can be decomposed into a direct sum

X_{1} \oplus X_{2} \oplus \dots \oplus X_{k}

$X_{1} \oplus X_{2} \oplus \dots \oplus X_{k}$

such that $L - λ_{i} ⌶$ $L - λ_{i} ⌶$ is nilpotent on $X_{i}$ $X_{i}$ and the dimension of $X_{i}$ $X_{i}$ equals the multiplicity of $λ_{i}$ $λ_{i}$ .

Proof

Let $L_{1} = L - λ_{1} ⌶$ $L_{1} = L - λ_{1} ⌶$ . By Theorem 8.2.2, there exist subspaces $X_{1}$ $X_{1}$ and $Y_{1}$ $Y_{1}$ that are invariant under $L_{1}$ $L_{1}$ such that $V = X_{1} \oplus Y_{1}, L_{1}$ $V = X_{1} \oplus Y_{1}, L_{1}$ is nilpotent on $X_{1}$ $X_{1}$ , and $L_{1 [Y]}$ $L_{1 [Y]}$ is invertible. It follows that $X_{1}$ $X_{1}$ and $Y_{1}$ $Y_{1}$ are also invariant under L. By Corollary 8.1.2, $L_{[X_{1}]}$ $L_{[X_{1}]}$ can be represented by a block diagonal matrix $A_{1}$ $A_{1}$ , where diagonal blocks are simple Jordan matrices whose diagonal elements all equal $λ_{1}$ $λ_{1}$ . Thus,

det (A_{1} - λ I) = (λ_{1} - λ)^{m_{1}}

$det (A_{1} - λ I) = (λ_{1} - λ)^{m_{1}}$

where $m_{1}$ $m_{1}$ is the dimension of $X_{1}$ $X_{1}$ . Let $B_{1}$ $B_{1}$ be a matrix representing $L_{[Y_{1}]}$ $L_{[Y_{1}]}$ . Since $L_{1}$ $L_{1}$ is invertible on $Y_{1}$ $Y_{1}$ , it follows that $λ_{1}$ $λ_{1}$ is not an eigenvalue of $B_{1}$ $B_{1}$ . Thus,

\det (B_{1} - λ I) = q (λ)

$\det (B_{1} - λ I) = q (λ)$

where $q (λ_{1}) \neq 0$ $q (λ_{1}) \neq 0$ . It follows from Lemma 8.1.2 that the operator L on V can be represented by the matrix

A = [\begin{matrix} A_{1} \\ B_{1} \end{matrix}]

$A = [\begin{matrix} A_{1} \\ B_{1} \end{matrix}]$

Thus, if each eigenvalue $λ_{i}$ $λ_{i}$ of L has multiplicity $r_{i}$ $r_{i}$ , then

\begin{matrix} (λ_{1} - λ)^{r_{1}} {(λ_{2} - λ)}^{r_{2}} \dots {(λ_{k} - λ)}^{r_{k}} & = & \det (A - λ I) \\ = & \det (A - λ I) \det (B_{1} - λ I) \\ = & (λ_{1} - λ)^{m_{1}} q (λ) \end{matrix}

$\begin{matrix} (λ_{1} - λ)^{r_{1}} {(λ_{2} - λ)}^{r_{2}} \dots {(λ_{k} - λ)}^{r_{k}} & = & \det (A - λ I) \\ = & \det (A - λ I) \det (B_{1} - λ I) \\ = & (λ_{1} - λ)^{m_{1}} q (λ) \end{matrix}$

Therefore, $r_{1} = m_{1}$ $r_{1} = m_{1}$ and

q (λ) = (λ_{2} - λ)^{r_{2}} \dots {(λ_{k} - λ)}^{r_{k}}

$q (λ) = (λ_{2} - λ)^{r_{2}} \dots {(λ_{k} - λ)}^{r_{k}}$

If we consider the operator $L_{2} = L - λ_{2} ⌶$ $L_{2} = L - λ_{2} ⌶$ on the vector space $Y_{1}$ $Y_{1}$ , then we can decompose $Y_{1}$ $Y_{1}$ into a direct sum $X_{2} \oplus Y_{2}$ $X_{2} \oplus Y_{2}$ such that $X_{2}$ $X_{2}$ and $Y_{2}$ $Y_{2}$ are invariant under $L, L_{2}$ $L, L_{2}$ is nilpotent on $X_{2}$ $X_{2}$ , and $L_{[Y_{2}]}$ $L_{[Y_{2}]}$ is invertible. Indeed, we can continue this process of decomposing $Y_{i}$ $Y_{i}$ into a direct sum $X_{i + 1} \oplus Y_{i + 1}$ $X_{i + 1} \oplus Y_{i + 1}$ until we obtain a direct sum of the form

V = X_{1} \oplus X_{2} \oplus \dots \oplus X_{k - 1} \oplus Y_{k - 1}

$V = X_{1} \oplus X_{2} \oplus \dots \oplus X_{k - 1} \oplus Y_{k - 1}$

The vector space $Y_{k - 1}$ $Y_{k - 1}$ will be of dimension $r_{k}$ $r_{k}$ with a single eigenvalue $λ_{k}$ $λ_{k}$ . Thus, if we set $X_{k} = Y_{k - 1}$ $X_{k} = Y_{k - 1}$ , then $L - λ_{k} ⌶$ $L - λ_{k} ⌶$ will be nilpotent on $X_{k}$ $X_{k}$ and we will have the desired decomposition of V.

∎

It follows from Theorem 8.2.3 that each operator L mapping an n-dimensional vector space V into itself can be represented by a block diagonal matrix of the form

J = [\begin{matrix} A_{1} \\ A_{2} \\ ⋱ \\ A_{k} \end{matrix}]

$J = [\begin{matrix} A_{1} \\ A_{2} \\ ⋱ \\ A_{k} \end{matrix}]$

where each $A_{i}$ $A_{i}$ is an $r_{i} \times r_{i}$ $r_{i} \times r_{i}$ block diagonal matrix $(r_{i} = multiplicity of λ_{i})$ $(r_{i} = multiplicity of λ_{i})$ whose blocks consist of simple Jordan matrices with $λ_{i}$ $λ_{i}$ ’s along the main diagonal.

If A is an $n \times n$ $n \times n$ matrix, then A represents the operator $L_{A}$ $L_{A}$ with respect to the standard basis on $R^{n}$ $R^{n}$ , where $L_{A}$ $L_{A}$ is defined by

\begin{matrix} L_{A} (x) = A x & for each \end{matrix} x \in R^{n}

$\begin{matrix} L_{A} (x) = A x & for each \end{matrix} x \in R^{n}$

By the preceding remarks, $L_{A}$ $L_{A}$ can be represented by a matrix J of the form just described. It follows that A is similar to J. Thus, each $n \times n$ $n \times n$ matrix A with distinct eigenvalues $λ_{1}, \dots, λ_{k}$ $λ_{1}, \dots, λ_{k}$ is similar to a matrix J of the form

\begin{matrix} J = [\begin{matrix} A_{1} \\ A_{2} \\ ⋱ \\ A_{k} \end{matrix}] \end{matrix}

$\begin{matrix} J = [\begin{matrix} A_{1} \\ A_{2} \\ ⋱ \\ A_{k} \end{matrix}] \end{matrix}$ (2)

where $A_{i}$ $A_{i}$ is an $r_{i} \times r_{i}$ $r_{i} \times r_{i}$ matrix $(r_{i} = multiplicity of λ_{i})$ $(r_{i} = multiplicity of λ_{i})$ of the form

\begin{matrix} A_{i} = [\begin{matrix} J_{1} (λ_{i}) \\ J_{2} (λ_{i}) \\ ⋱ \\ J_{s} (λ_{i}) \end{matrix}] \end{matrix}

$\begin{matrix} A_{i} = [\begin{matrix} J_{1} (λ_{i}) \\ J_{2} (λ_{i}) \\ ⋱ \\ J_{s} (λ_{i}) \end{matrix}] \end{matrix}$ (3)

with the $\begin{matrix} J (λ_{i}) \end{matrix}$ $\begin{matrix} J (λ_{i}) \end{matrix}$ ’s being simple Jordan matrices. The matrix J defined by (2) and (3) is called the Jordan canonical form of A. The Jordan canonical form of a matrix is unique except for a reordering of the simple Jordan blocks along the diagonal.

Example 1 1

Find the Jordan canonical form of the matrix

A = [\begin{matrix} - 3 & 1 & 0 & 1 & 1 \\ - 3 & 1 & 0 & 1 & 1 \\ - 4 & 1 & 0 & 2 & 1 \\ - 3 & 1 & 0 & 1 & 1 \\ - 4 & 1 & 0 & 1 & 2 \end{matrix}]

$A = [\begin{matrix} - 3 & 1 & 0 & 1 & 1 \\ - 3 & 1 & 0 & 1 & 1 \\ - 4 & 1 & 0 & 2 & 1 \\ - 3 & 1 & 0 & 1 & 1 \\ - 4 & 1 & 0 & 1 & 2 \end{matrix}]$

SOLUTION

The characteristic polynomial of A is

| A - λ I | = λ^{4} (1 - λ)

$| A - λ I | = λ^{4} (1 - λ)$

The eigenspace corresponding to $λ = 1$ $λ = 1$ is spanned by $x_{1} = (1, 1, 1, 1, 2)^{T}$ $x_{1} = (1, 1, 1, 1, 2)^{T}$ and the eigen-space corresponding to $λ = 0$ $λ = 0$ is spanned by $x_{2} = {(1, 1, 0, 1, 1)}^{T}$ $x_{2} = {(1, 1, 0, 1, 1)}^{T}$ and $x_{3} = {(0, 0, 1, 0, 0)}^{T}$ $x_{3} = {(0, 0, 1, 0, 0)}^{T}$ . Thus, the Jordan canonical form of A then will consist of three simple Jordan blocks. Except for a reordering of the blocks, there are only two possibilities:

To determine which of these forms is correct, we compute $(A - 0 I)^{2} = A^{2}$ $(A - 0 I)^{2} = A^{2}$ .

A^{2} = [\begin{matrix} - 1 & 0 & 0 & 0 & 1 \\ - 1 & 0 & 0 & 0 & 1 \\ - 1 & 0 & 0 & 0 & 1 \\ - 1 & 0 & 0 & 0 & 1 \\ - 2 & 0 & 0 & 0 & 2 \end{matrix}]

$A^{2} = [\begin{matrix} - 1 & 0 & 0 & 0 & 1 \\ - 1 & 0 & 0 & 0 & 1 \\ - 1 & 0 & 0 & 0 & 1 \\ - 1 & 0 & 0 & 0 & 1 \\ - 2 & 0 & 0 & 0 & 2 \end{matrix}]$

Next we consider the systems

A^{2} x = x_{i}

$A^{2} x = x_{i}$

for $i = 2, 3$ $i = 2, 3$ . Since these systems turn out to be inconsistent, the Jordan canonical form of A cannot have any $3 \times 3$ $3 \times 3$ simple Jordan blocks and, consequently, it must be of the form

To find X, we must solve

A x = x_{i}

$A x = x_{i}$

for $i = 2, 3$ $i = 2, 3$ . The system $A x = x_{2}$ $A x = x_{2}$ has infinitely many solutions. We need choose only one of these, say, $x_{4} = (1, 3, 0, 0, 1)^{T}$ $x_{4} = (1, 3, 0, 0, 1)^{T}$ . Similarly, $A x = x_{3}$ $A x = x_{3}$ has infinitely many solutions, one of which is $x_{5} = (1, 0, 0, 2, 1)^{T}$ $x_{5} = (1, 0, 0, 2, 1)^{T}$ . Let

X = [\begin{matrix} x_{1} & x_{2} & x_{3} & x_{4} & x_{5} \end{matrix}] = [\begin{matrix} 1 & 1 & 0 & 1 & 1 \\ 1 & 1 & 0 & 3 & 0 \\ 1 & 0 & 1 & 0 & 0 \\ 1 & 1 & 0 & 0 & 2 \\ 2 & 1 & 0 & 1 & 1 \end{matrix}]

$X = [\begin{matrix} x_{1} & x_{2} & x_{3} & x_{4} & x_{5} \end{matrix}] = [\begin{matrix} 1 & 1 & 0 & 1 & 1 \\ 1 & 1 & 0 & 3 & 0 \\ 1 & 0 & 1 & 0 & 0 \\ 1 & 1 & 0 & 0 & 2 \\ 2 & 1 & 0 & 1 & 1 \end{matrix}]$

The reader may verify that $X^{- 1} AX = J$ $X^{- 1} AX = J$ .

One of the main applications of the Jordan canonical form is in solving systems of linear differential equations that have defective coefficient matrices. Given such a system

Y' (t) = A Y (t)

$Y' (t) = A Y (t)$

we can simplify it by using the Jordan canonical form of A. Indeed, if $A = {XJX}^{- 1}$ $A = {XJX}^{- 1}$ , then

Y' = ({XJX}^{- 1}) Y

$Y' = ({XJX}^{- 1}) Y$

Thus, if we set $Z = X^{- 1} Y$ $Z = X^{- 1} Y$ , then $Y' = X Z'$ $Y' = X Z'$ and the system simplifies to

X Z' = XJ Z

$X Z' = XJ Z$

Multiplying by $X^{- 1}$ $X^{- 1}$ , we get

\begin{matrix} Z' = J Z \end{matrix}

$\begin{matrix} Z' = J Z \end{matrix}$ (4)

Because of the structure of J, this new system is much easier to solve. Indeed, solving (4) will only involve solving a number of smaller systems, each of the form

\begin{matrix} {\begin{matrix} z \end{matrix}}_{1}^{'} & = λ z_{1} + z_{2} \\ {\begin{matrix} z \end{matrix}}_{2}^{'} & = λ z_{2} + z_{3} \\ ⋮ \\ {\begin{matrix} z \end{matrix}}_{k - 1}^{'} & = λ z_{k - 1} + z_{k} \\ {\begin{matrix} z \end{matrix}}_{k}^{'} & = λ z_{k} \end{matrix}

$\begin{matrix} {\begin{matrix} z \end{matrix}}_{1}^{'} & = λ z_{1} + z_{2} \\ {\begin{matrix} z \end{matrix}}_{2}^{'} & = λ z_{2} + z_{3} \\ ⋮ \\ {\begin{matrix} z \end{matrix}}_{k - 1}^{'} & = λ z_{k - 1} + z_{k} \\ {\begin{matrix} z \end{matrix}}_{k}^{'} & = λ z_{k} \end{matrix}$

These equations can be solved one at a time starting with the last. The solution to the last equation is clearly

z_{k} = {ce}^{λ t}

$z_{k} = {ce}^{λ t}$

The solution to any equation of the form

z' (t) - λ z (t) = u (t)

$z' (t) - λ z (t) = u (t)$

is given by

z (t) = e^{λ t} \int e^{- λ t} u (t) dt

$z (t) = e^{λ t} \int e^{- λ t} u (t) dt$

Thus, we can solve

z_{k - 1}^{'} - λ z_{k - 1} = z_{k}

$z_{k - 1}^{'} - λ z_{k - 1} = z_{k}$

for $z_{k - 1}$ $z_{k - 1}$ and then solve

z_{k - 2}^{'} - λ z_{k - 2} = z_{k - 1}

$z_{k - 2}^{'} - λ z_{k - 2} = z_{k - 1}$

for $z_{k - 2}$ $z_{k - 2}$ , etc.

Example 2

Solve the initial value problem

\begin{matrix} \begin{matrix} [\begin{matrix} y_{1}^{'} \\ y_{2}^{'} \\ y_{3}^{'} \\ y_{4}^{'} \end{matrix}] \end{matrix} & = & [\begin{matrix} 1 & 0 & 0 & - 1 \\ 0 & 1 & 1 & 0 \\ 0 & - 1 & 1 & 2 \\ 1 & 0 & 2 & 1 \end{matrix}] [\begin{matrix} y_{1} \\ y_{2} \\ y_{3} \\ y_{4} \end{matrix}] \\ y_{1} (0) & = & y_{2} (0) = y_{3} (0) = y_{4} (0) = 2 \end{matrix}

$\begin{matrix} \begin{matrix} [\begin{matrix} y_{1}^{'} \\ y_{2}^{'} \\ y_{3}^{'} \\ y_{4}^{'} \end{matrix}] \end{matrix} & = & [\begin{matrix} 1 & 0 & 0 & - 1 \\ 0 & 1 & 1 & 0 \\ 0 & - 1 & 1 & 2 \\ 1 & 0 & 2 & 1 \end{matrix}] [\begin{matrix} y_{1} \\ y_{2} \\ y_{3} \\ y_{4} \end{matrix}] \\ y_{1} (0) & = & y_{2} (0) = y_{3} (0) = y_{4} (0) = 2 \end{matrix}$

SOLUTION

The coefficient matrix A has two distinct eigenvalues $λ_{1} = 0$ $λ_{1} = 0$ and $λ_{2} = 2$ $λ_{2} = 2$ , each of multiplicity 2. The corresponding eigenspaces are both dimension 1. Using the methods of this section, A can be factored into a product ${XJX}^{- 1}$ ${XJX}^{- 1}$ , where

J = [\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 2 & 1 \\ 0 & 0 & 0 & 2 \end{matrix}]

$J = [\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 2 & 1 \\ 0 & 0 & 0 & 2 \end{matrix}]$

The choice of X is not unique. The reader may verify that the one we have calculated:

X = [\begin{array}{r} 1 & 1 & - 1 & 1 \\ 1 & 1 & 1 & - 1 \\ - 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 0 \end{array}]

$X = [\begin{array}{r} 1 & 1 & - 1 & 1 \\ 1 & 1 & 1 & - 1 \\ - 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 0 \end{array}]$

does the job. If we now change variable and set $Z = X^{- 1} Y$ $Z = X^{- 1} Y$ , then we can rewrite the system in the form

Z' = JZ

$Z' = JZ$

The block structure of J allows us to break up the system into two simpler systems:

\begin{matrix} \begin{matrix} \begin{matrix} z_{1}^{'} \end{matrix} = z_{2} \\ {\begin{matrix} z \end{matrix}}_{2}^{'} = 0 \end{matrix} & and & \begin{matrix} z_{3}^{'} = 2 z_{3} + z_{4} \\ {\begin{matrix} z \end{matrix}}_{4}^{'} = 2 z_{4} \end{matrix} \end{matrix}

$\begin{matrix} \begin{matrix} \begin{matrix} z_{1}^{'} \end{matrix} = z_{2} \\ {\begin{matrix} z \end{matrix}}_{2}^{'} = 0 \end{matrix} & and & \begin{matrix} z_{3}^{'} = 2 z_{3} + z_{4} \\ {\begin{matrix} z \end{matrix}}_{4}^{'} = 2 z_{4} \end{matrix} \end{matrix}$

The first system is not difficult to solve.

\begin{matrix} z_{1} & = c_{1} t + c_{2} \\ z_{2} & = c_{1} & (c_{1} and c_{2} are  constants) \end{matrix}

$\begin{matrix} z_{1} & = c_{1} t + c_{2} \\ z_{2} & = c_{1} & (c_{1} and c_{2} are constants) \end{matrix}$

To solve the second system, we first solve

z_{4}^{'} = 2 z_{4}

$z_{4}^{'} = 2 z_{4}$

getting

z_{4} = c_{3} e^{2 t}

$z_{4} = c_{3} e^{2 t}$

Thus,

z_{3}^{'} - 2 z_{3} = c_{3} e^{2 t}

$z_{3}^{'} - 2 z_{3} = c_{3} e^{2 t}$

and hence

z_{3} = e^{2 t} \int e^{- 2 t} (c_{3} e^{2 t}) dt = e^{2 t} (c_{3} t + c_{4})

$z_{3} = e^{2 t} \int e^{- 2 t} (c_{3} e^{2 t}) dt = e^{2 t} (c_{3} t + c_{4})$

Finally, we have

Y = XZ= [\begin{array}{l} (c_{1} t + c_{2}) - c_{1} (c_{3} t + c_{4}) e^{2 t} + c_{3} e^{2 t} \\ (c_{1} t + c_{2}) + c_{1} (c_{3} t + c_{4}) e^{2 t} - c_{3} e^{2 t} \\ - (c_{1} t + c_{2}) + (c_{3} t + c_{4}) e^{2 t} \\ (c_{1} t + c_{2}) + (c_{3} t + c_{4}) e^{2 t} \end{array}]

$Y = XZ= [\begin{array}{l} (c_{1} t + c_{2}) - c_{1} (c_{3} t + c_{4}) e^{2 t} + c_{3} e^{2 t} \\ (c_{1} t + c_{2}) + c_{1} (c_{3} t + c_{4}) e^{2 t} - c_{3} e^{2 t} \\ - (c_{1} t + c_{2}) + (c_{3} t + c_{4}) e^{2 t} \\ (c_{1} t + c_{2}) + (c_{3} t + c_{4}) e^{2 t} \end{array}]$

If we set $t = 0$ $t = 0$ and use the initial conditions to solve for the $c_{i}$ $c_{i}$ ’s, we get

\begin{matrix} c_{1} = - 1, & c_{2} = c_{3} = c_{4} = 1 \end{matrix}

$\begin{matrix} c_{1} = - 1, & c_{2} = c_{3} = c_{4} = 1 \end{matrix}$

Thus, the solution to the initial value problem is

\begin{matrix} y_{1} & = & - t - {te}^{2 t} \\ y_{2} & = & - t + {te}^{2 t} \\ y_{3} & = & - 1 + t + (1 + t) e^{2 t} \\ y_{4} & = & 1 - t + (1 + t) e^{2 t} \end{matrix}

$\begin{matrix} y_{1} & = & - t - {te}^{2 t} \\ y_{2} & = & - t + {te}^{2 t} \\ y_{3} & = & - 1 + t + (1 + t) e^{2 t} \\ y_{4} & = & 1 - t + (1 + t) e^{2 t} \end{matrix}$

The Jordan canonical form not only provides a nice representation of an operator, but it also allows us to solve systems of the form $Y' = A Y$ $Y' = A Y$ even when the coefficient matrix is defective. From a theoretical point of view, its importance cannot be questioned. As far as practical applications go, however, it is generally not very useful.

If $n \geq 5$ $n \geq 5$ , it is usually necessary to calculate the eigenvalues of A by some numerical method. The calculated $λ_{i}$ $λ_{i}$ ’s are only approximations to the actual eigenvalues. Thus, we could have calculated values $λ_{1}^{'}$ $λ_{1}^{'}$ and $λ_{2}^{'}$ $λ_{2}^{'}$ , which are unequal while actually $λ_{1} = λ_{2}$ $λ_{1} = λ_{2}$ . So in practice, it may be difficult to determine the correct multiplicity of the eigenvalues. Furthermore, in order to solve $Y' = A Y$ $Y' = A Y$ , we need to find the similarity matrix X such that $A = {XJX}^{- 1}$ $A = {XJX}^{- 1}$ . However, when A has multiple eigenvalues, the matrix X may be very sensitive to perturbations and, in practice, one is not guaranteed that the entries of the computed similarity matrix will have any digits of accuracy whatsoever. A recommended alternative is to compute the matrix exponential $e^{A}$ $e^{A}$ and use it to solve the system $Y' = A Y$ $Y' = A Y$ .

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 8.2 The Jordan Canonical Form

Create new playlist

Sign In

Sign Up

Table of Contents for
8.2 The Jordan Canonical Form