Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

3.5 Change of Basis

Many applied problems can be simplified by changing from one coordinate system to another. Changing coordinate systems in a vector space is essentially the same as changing from one basis to another. For example, in describing the motion of a particle in the plane at a particular time, it is often convenient to use a basis for $ℝ^{2}$ $ℝ^{2}$ consisting of a unit tangent vector t and a unit normal vector n instead of the standard basis ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ .

In this section, we discuss the problem of switching from one coordinate system to another. We will show that this can be accomplished by multiplying a given coordinate vector x by a nonsingular matrix S. The product $y = S x$ $y = S x$ will be the coordinate vector for the new coordinate system.

Changing Coordinates in $ℝ^{2}$ $ℝ^{2}$

The standard basis for $ℝ^{2}$ $ℝ^{2}$ is ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ . Any vector x in $ℝ^{2}$ $ℝ^{2}$ can be expressed as a linear combination:

x = x_{1} e_{1} + x_{2} e_{2}

$x = x_{1} e_{1} + x_{2} e_{2}$

The scalars $x_{1}$ $x_{1}$ and $x_{2}$ $x_{2}$ can be thought of as the coordinates of x with respect to the standard basis. Actually, for any basis ${y, z}$ ${y, z}$ for $ℝ^{2}$ $ℝ^{2}$ , it follows from Theorem 3.3.2 that a given vector x can be represented uniquely as a linear combination:

x = α y + β z

$x = α y + β z$

The scalars $α$ $α$ and $β$ $β$ are the coordinates of x with respect to the basis ${y, z}$ ${y, z}$ . Let us order the basis elements so that y is considered the first basis vector and z is considered the second, and denote the ordered basis by $[y, z]$ $[y, z]$ . We can then refer to the vector ${(α, β)}^{T}$ ${(α, β)}^{T}$ as the coordinate vector of x with respect to $[y, z]$ $[y, z]$ . Note that, if we reverse the order of the basis vectors and take $[z, y]$ $[z, y]$ , then we must also reorder the coordinate vector. The coordinate vector of x with respect to $[z, y]$ $[z, y]$ will be ${(β, α)}^{T}$ ${(β, α)}^{T}$ . When we refer to a basis using subscripts, such as ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ , the subscripts assign an ordering to the basis vectors.

Example 1

Let $y = {(2, 1)}^{T}$ $y = {(2, 1)}^{T}$ and $z = {(1, 4)}^{T}$ $z = {(1, 4)}^{T}$ . The vectors y and z are linearly independent and hence form a basis for $ℝ^{2}$ $ℝ^{2}$ . The vector $x = {(7, 7)}^{T}$ $x = {(7, 7)}^{T}$ can be written as a linear combination:

x = 3 y + z

$x = 3 y + z$

Thus, the coordinate vector of x with respect to $[y, z]$ $[y, z]$ is ${(3, 1)}^{T}$ ${(3, 1)}^{T}$ . Geometrically, the coordinate vector specifies how to get from the origin to the point $(7, 7)$ $(7, 7)$ by moving first in the direction of y and then in the direction of z. If, instead, we treat z as our first basis vector and y as the second basis vector, then

x = z + 3 y

$x = z + 3 y$

The coordinate vector of x with respect to the ordered basis $[z, y]$ $[z, y]$ is ${(1, 3)}^{T}$ ${(1, 3)}^{T}$ . Geometrically, this vector tells us how to get from the origin to $(7, 7)$ $(7, 7)$ by moving first in the direction of z and then in the direction of y (see Figure 3.5.1).

A graph displays a parallelogram in the first quadrant.

Figure 3.5.1. Full Alternative Text

As an example of a problem for which it is helpful to change coordinates, consider the following application.

Application 1

Population Migration

Suppose that the total population of a large metropolitan area remains relatively fixed; however, each year 6 percent of the people living in the city move to the suburbs and 2 percent of the people living in the suburbs move to the city. If, initially, 30 percent of the population lives in the city and 70 percent lives in the suburbs, what will these percentages be in 10 years? 30 years? 50 years? What are the long-term implications?

The changes in population can be determined by matrix multiplications. If we set

\begin{matrix} \begin{matrix} A = [\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}] & and \end{matrix} & x_{0} = [\begin{matrix} 0.30 \\ 0.70 \end{matrix}] \end{matrix}

$\begin{matrix} \begin{matrix} A = [\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}] & and \end{matrix} & x_{0} = [\begin{matrix} 0.30 \\ 0.70 \end{matrix}] \end{matrix}$

then the percentages of people living in the city and suburbs after one year can be calculated by setting $x_{1} = A x_{0}$ $x_{1} = A x_{0}$ . The percentages after two years can be calculated by setting $x_{2} = A x_{1} = A^{2} x_{0}$ $x_{2} = A x_{1} = A^{2} x_{0}$ . In general, the percentages after n years will be given by $x_{n} = A^{n} x_{0}$ $x_{n} = A^{n} x_{0}$ . If we calculate these percentages for $n = 10, 30$ $n = 10, 30$ , and 50 years and round to the nearest percent, we get

\begin{matrix} \begin{matrix} x_{10} = [\begin{matrix} 0.27 \\ 0.73 \end{matrix}] & x_{30} = [\begin{matrix} 0.25 \\ 0.75 \end{matrix}] \end{matrix} & x_{50} = [\begin{matrix} 0.25 \\ 0.75 \end{matrix}] \end{matrix}

$\begin{matrix} \begin{matrix} x_{10} = [\begin{matrix} 0.27 \\ 0.73 \end{matrix}] & x_{30} = [\begin{matrix} 0.25 \\ 0.75 \end{matrix}] \end{matrix} & x_{50} = [\begin{matrix} 0.25 \\ 0.75 \end{matrix}] \end{matrix}$

In fact, as n increases, the sequence of vectors $x_{n} = A^{n} x_{0}$ $x_{n} = A^{n} x_{0}$ converges to a limit $x = {(0.25, 0.75)}^{T}$ $x = {(0.25, 0.75)}^{T}$ . The limit vector x is called a steady-state vector for the process.

To understand why the process approaches a steady state, it is helpful to switch to a different coordinate system. For the new coordinate system, we will pick vectors $u_{1}$ $u_{1}$ and $u_{2}$ $u_{2}$ , for which it is easy to see the effect of multiplication by the matrix A. In particular, if we pick $u_{1}$ $u_{1}$ to be any multiple of the steady-state vector x, then $A u_{1}$ $A u_{1}$ will equal $u_{1}$ $u_{1}$ . Let us choose $u_{1} = {(1 3)}^{T}$ $u_{1} = {(1 3)}^{T}$ and $u_{2} = {(- 1 1)}^{T}$ $u_{2} = {(- 1 1)}^{T}$ . The second vector was chosen because the effect of multiplying by A is just to scale the vector by a factor of 0.92. Thus, our new basis vectors satisfy

\begin{array}{l} A u_{1} = [\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}] [\begin{matrix} 1 \\ 3 \end{matrix}] = [\begin{matrix} 1 \\ 3 \end{matrix}] = u_{1} \\ A u_{2} = [\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}] [\begin{array}{r} - 1 \\ 1 \end{array}] = [\begin{array}{r} - 0.92 \\ 0.92 \end{array}] = u_{2} \end{array}

$\begin{array}{l} A u_{1} = [\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}] [\begin{matrix} 1 \\ 3 \end{matrix}] = [\begin{matrix} 1 \\ 3 \end{matrix}] = u_{1} \\ A u_{2} = [\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}] [\begin{array}{r} - 1 \\ 1 \end{array}] = [\begin{array}{r} - 0.92 \\ 0.92 \end{array}] = u_{2} \end{array}$

The initial vector $x_{0}$ $x_{0}$ can be written as a linear combination of the new basis vectors:

\begin{matrix} x_{0} = [\begin{matrix} 0.30 \\ 0.70 \end{matrix}] = 0.25 [\begin{matrix} 1 \\ 3 \end{matrix}] - 0.05 [\begin{array}{r} - 1 \\ 1 \end{array}] = 0.25 u_{1} \end{matrix} - 0.05 u_{2}

$\begin{matrix} x_{0} = [\begin{matrix} 0.30 \\ 0.70 \end{matrix}] = 0.25 [\begin{matrix} 1 \\ 3 \end{matrix}] - 0.05 [\begin{array}{r} - 1 \\ 1 \end{array}] = 0.25 u_{1} \end{matrix} - 0.05 u_{2}$

It follows that

\begin{matrix} x_{n} = A^{n} x_{0} = 0.25 u_{1} \end{matrix} - 0.05 {(0.92)}^{n} u_{2}

$\begin{matrix} x_{n} = A^{n} x_{0} = 0.25 u_{1} \end{matrix} - 0.05 {(0.92)}^{n} u_{2}$

The entries of the second component approach 0 as n gets large. In fact, for $n > 27$ $n > 27$ , the entries will be small enough so that the rounded values of $x_{n}$ $x_{n}$ are all equal to

0.25 u_{1} = [\begin{matrix} 0.25 \\ 0.75 \end{matrix}]

$0.25 u_{1} = [\begin{matrix} 0.25 \\ 0.75 \end{matrix}]$

This application is an example of a type of mathematical model called a Markov process. The sequence of vectors $x_{1}, x_{2}, \dots$ $x_{1}, x_{2}, \dots$ is called a Markov chain. The matrix A has a special structure in that its entries are nonnegative and its columns all add up to 1. Such matrices are called stochastic matrices. More precise definitions will be given later when we study these types of applications in Chapter 6. What we want to stress here is that the key to understanding such processes is to switch to a basis for which the effect of the matrix is quite simple. In particular, if A is $n \times n$ $n \times n$ , then we will want to choose basis vectors so that the effect of the matrix A on each basis vector $u_{j}$ $u_{j}$ is simply to scale it by some factor $λ_{j}$ $λ_{j}$ , that is,

\begin{matrix} A u_{j} = λ u_{j} & j = 1, 2, \dots, n \end{matrix}

$\begin{matrix} A u_{j} = λ u_{j} & j = 1, 2, \dots, n \end{matrix}$ (1)

In many applied problems involving an $n \times n$ $n \times n$ matrix A, the key to solving the problem often is to find basis vectors $u_{1}, \dots, u_{n}$ $u_{1}, \dots, u_{n}$ and scalars $λ_{1}, \dots, λ_{n}$ $λ_{1}, \dots, λ_{n}$ such that (1) is satisfied. The new basis vectors can be thought of as a natural coordinate system to use with the matrix A, and the scalars can be thought of as natural frequencies for the basis vectors. We will study these types of applications in more detail in Chapter 6.

Changing Coordinates

Once we have decided to work with a new basis, we have the problem of finding the coordinates with respect to that basis. Suppose, for example, that instead of using the standard basis ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ for $ℝ^{2}$ $ℝ^{2}$ , we wish to use a different basis, say,

\begin{matrix} u_{1} = [\begin{matrix} 3 \\ 2 \end{matrix}], & u_{2} = [\begin{matrix} 1 \\ 1 \end{matrix}] \end{matrix}

$\begin{matrix} u_{1} = [\begin{matrix} 3 \\ 2 \end{matrix}], & u_{2} = [\begin{matrix} 1 \\ 1 \end{matrix}] \end{matrix}$

Indeed, we may want to switch back and forth between the two coordinate systems. Let us consider the following two problems:

Given a vector $x = {(x_{1}, x_{2})}^{T}$ $x = {(x_{1}, x_{2})}^{T}$ , find its coordinates with respect to $u_{1}$ $u_{1}$ and $u_{2}$ $u_{2}$ .
Given a vector $c_{1} u_{1} + c_{2} u_{2}$ $c_{1} u_{1} + c_{2} u_{2}$ , find its coordinates with respect to $e_{1}$ $e_{1}$ and $e_{2}$ $e_{2}$ .

We will solve II first, since it turns out to be the easier problem. To switch bases from ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ to ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ , we must express the old basis elements $u_{1}$ $u_{1}$ and $u_{2}$ $u_{2}$ in terms of the new basis elements $e_{1}$ $e_{1}$ and $e_{2}$ $e_{2}$ .

\begin{matrix} u_{1} & = & {3 e}_{1} + 2 e_{2} \\ u_{2} & = & e_{1} + e_{2} \end{matrix}

$\begin{matrix} u_{1} & = & {3 e}_{1} + 2 e_{2} \\ u_{2} & = & e_{1} + e_{2} \end{matrix}$

It follows then that

\begin{matrix} c_{1} u_{1} + c_{2} u_{2} & = & (3 c_{1} e_{1} + 2 c_{1} e_{2}) + (c_{2} e_{1} + c_{2} e_{2}) \\ = & (3 c_{1} + c_{2}) e_{1} + (2 c_{1} + c_{2}) e_{1} \end{matrix}

$\begin{matrix} c_{1} u_{1} + c_{2} u_{2} & = & (3 c_{1} e_{1} + 2 c_{1} e_{2}) + (c_{2} e_{1} + c_{2} e_{2}) \\ = & (3 c_{1} + c_{2}) e_{1} + (2 c_{1} + c_{2}) e_{1} \end{matrix}$

Thus, the coordinate vector of $c_{1} u_{1} + c_{2} u_{2}$ $c_{1} u_{1} + c_{2} u_{2}$ with respect to ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ is

x = [\begin{matrix} 3 c_{1} + c_{2} \\ 2 c_{1} + c_{2} \end{matrix}] = [\begin{matrix} 3 & 1 \\ 2 & 1 \end{matrix}] [\begin{matrix} c_{1} \\ c_{2} \end{matrix}]

$x = [\begin{matrix} 3 c_{1} + c_{2} \\ 2 c_{1} + c_{2} \end{matrix}] = [\begin{matrix} 3 & 1 \\ 2 & 1 \end{matrix}] [\begin{matrix} c_{1} \\ c_{2} \end{matrix}]$

If we set

U = (u_{1}, u_{2}) = [\begin{matrix} 3 & 1 \\ 2 & 1 \end{matrix}]

$U = (u_{1}, u_{2}) = [\begin{matrix} 3 & 1 \\ 2 & 1 \end{matrix}]$

then, given any coordinate vector c with respect to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ , to find the corresponding coordinate vector x with respect to ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ , we simply multiply U times c:

x = U c

$x = U c$ (2)

The matrix U is called the transition matrix from the ordered basis ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ to the standard basis ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ .

To solve problem I, we must find the transition matrix from ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ . The matrix U in (2) is nonsingular, since its column vectors, $u_{1}$ $u_{1}$ and $u_{2}$ $u_{2}$ , are linearly independent. It follows from (2) that

c = U^{- 1} x

$c = U^{- 1} x$

Thus, given a vector

x = {(x_{1}, x_{2})}^{T} = x_{1} e_{1} + x_{2} e_{2}

$x = {(x_{1}, x_{2})}^{T} = x_{1} e_{1} + x_{2} e_{2}$

we need only multiply by $U^{- 1}$ $U^{- 1}$ to find its coordinate vector with respect to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ . $U^{- 1}$ $U^{- 1}$ is the transition matrix from ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ .

Example 2

Let $u_{1} = {(3, 2)}^{T}, u_{2} = {(1, 1)}^{T}$ $u_{1} = {(3, 2)}^{T}, u_{2} = {(1, 1)}^{T}$ , and $x = (7, 4)^{T}$ $x = (7, 4)^{T}$ . Find the coordinates of x with respect to $u_{1}$ $u_{1}$ and $u_{2}$ $u_{2}$ .

SOLUTION

By the preceding discussion, the transition matrix from ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ is the inverse of

U = (u_{1}, u_{2}) = [\begin{matrix} 3 & 1 \\ 2 & 1 \end{matrix}]

$U = (u_{1}, u_{2}) = [\begin{matrix} 3 & 1 \\ 2 & 1 \end{matrix}]$

Thus,

c = U^{- 1} x = [\begin{matrix} 1 & - 1 \\ - 2 & 3 \end{matrix}] [\begin{matrix} 7 \\ 4 \end{matrix}] = [\begin{array}{r} 3 \\ - 2 \end{array}]

$c = U^{- 1} x = [\begin{matrix} 1 & - 1 \\ - 2 & 3 \end{matrix}] [\begin{matrix} 7 \\ 4 \end{matrix}] = [\begin{array}{r} 3 \\ - 2 \end{array}]$

is the desired coordinate vector and

x = 3 u_{1} - 2 u_{2}

$x = 3 u_{1} - 2 u_{2}$

Example 3

Let $b_{1} = {(1, - 1)}^{T}$ $b_{1} = {(1, - 1)}^{T}$ and $b_{2} = {(- 2, 3)}^{T}$ $b_{2} = {(- 2, 3)}^{T}$ . Find the transition matrix from ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ to ${b_{1}, b_{2}}$ ${b_{1}, b_{2}}$ and determine the coordinates of $x = {(1, 2)}^{T}$ $x = {(1, 2)}^{T}$ with respect to ${b_{1}, b_{2}}$ ${b_{1}, b_{2}}$ .

SOLUTION

The transition matrix from ${b_{1}, b_{2}}$ ${b_{1}, b_{2}}$ to ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ is

B = (b_{1}, b_{2}) = [\begin{matrix} 1 & - 2 \\ - 1 & 3 \end{matrix}]

$B = (b_{1}, b_{2}) = [\begin{matrix} 1 & - 2 \\ - 1 & 3 \end{matrix}]$

and hence the transition matrix from ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ to ${b_{1}, b_{2}}$ ${b_{1}, b_{2}}$ is

B^{- 1} = [\begin{matrix} 3 & 2 \\ 1 & 1 \end{matrix}]

$B^{- 1} = [\begin{matrix} 3 & 2 \\ 1 & 1 \end{matrix}]$

The coordinate vector of x with respect to ${b_{1}, b_{2}}$ ${b_{1}, b_{2}}$ is

c = B^{- 1} x = [\begin{matrix} 3 & 2 \\ 1 & 1 \end{matrix}] [\begin{matrix} 1 \\ 2 \end{matrix}] = [\begin{matrix} 7 \\ 3 \end{matrix}]

$c = B^{- 1} x = [\begin{matrix} 3 & 2 \\ 1 & 1 \end{matrix}] [\begin{matrix} 1 \\ 2 \end{matrix}] = [\begin{matrix} 7 \\ 3 \end{matrix}]$

and hence

x = 7 b_{1} + 3 b_{2}

$x = 7 b_{1} + 3 b_{2}$

Now let us consider the general problem of changing from one ordered basis ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ of $ℝ^{2}$ $ℝ^{2}$ to another ordered basis ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ . In this case, we assume that, for a given vector x, its coordinates with respect to ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ are known:

x = c_{1} v_{1} + c_{2} v_{2}

$x = c_{1} v_{1} + c_{2} v_{2}$

Now we wish to represent x as a sum $d_{1} u_{1} + d_{2} u_{2}$ $d_{1} u_{1} + d_{2} u_{2}$ . Thus, we must find scalars $d_{1}$ $d_{1}$ and $d_{2}$ $d_{2}$ so that

c_{1} v_{1} + c_{2} v_{2} = d_{1} u_{1} + d_{2} u_{2}

$c_{1} v_{1} + c_{2} v_{2} = d_{1} u_{1} + d_{2} u_{2}$ (3)

If we set $V = (v_{1}, v_{2})$ $V = (v_{1}, v_{2})$ and $U = (u_{1}, u_{2})$ $U = (u_{1}, u_{2})$ , then Equation (3) can be written in matrix form:

V c = U d

$V c = U d$

It follows that

d = U^{- 1} V c

$d = U^{- 1} V c$

Thus, given a vector x in $ℝ^{2}$ $ℝ^{2}$ and its coordinate vector c with respect to the ordered basis ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ , to find the coordinate vector of x with respect to the new basis ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ , we simply multiply c by the transition matrix $S = U^{- 1} V$ $S = U^{- 1} V$ .

Example 4

Find the transition matrix corresponding to the change of basis from ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ , where

\begin{matrix} v_{1} = [\begin{matrix} 5 \\ 2 \end{matrix}], & v_{2} = [\begin{matrix} 7 \\ 3 \end{matrix}] & and & u_{1} = [\begin{matrix} 3 \\ 2 \end{matrix}], & u_{2} = [\begin{matrix} 1 \\ 1 \end{matrix}] \end{matrix}

$\begin{matrix} v_{1} = [\begin{matrix} 5 \\ 2 \end{matrix}], & v_{2} = [\begin{matrix} 7 \\ 3 \end{matrix}] & and & u_{1} = [\begin{matrix} 3 \\ 2 \end{matrix}], & u_{2} = [\begin{matrix} 1 \\ 1 \end{matrix}] \end{matrix}$

SOLUTION

The transition matrix from ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ is given by

S = U^{- 1} V = [\begin{matrix} 1 & - 1 \\ - 2 & 3 \end{matrix}] [\begin{matrix} 5 & 7 \\ 2 & 3 \end{matrix}] = [\begin{matrix} 3 & 4 \\ - 4 & - 5 \end{matrix}]

$S = U^{- 1} V = [\begin{matrix} 1 & - 1 \\ - 2 & 3 \end{matrix}] [\begin{matrix} 5 & 7 \\ 2 & 3 \end{matrix}] = [\begin{matrix} 3 & 4 \\ - 4 & - 5 \end{matrix}]$

The change of basis from ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ can also be viewed as a two-step process. First we change from ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ to the standard basis, ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ , and then we change from the standard basis to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ . Given a vector x in $ℝ^{2}$ $ℝ^{2}$ , if c is the coordinate vector of x with respect to ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ and d is the coordinate vector of x with respect to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ , then

c_{1} v_{1} + c_{2} v_{2} = x_{1} e_{1} + x_{2} e_{2} = d_{1} u_{1} + d_{2} u_{2}

$c_{1} v_{1} + c_{2} v_{2} = x_{1} e_{1} + x_{2} e_{2} = d_{1} u_{1} + d_{2} u_{2}$

Since V is the transition matrix from ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ to ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ and $U^{- 1}$ $U^{- 1}$ is the transition matrix from ${e_{1}, e_{2}}$ ${e_{1}, e_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ , it follows that

\begin{matrix} V c = x & and & U^{- 1} x = d \end{matrix}

$\begin{matrix} V c = x & and & U^{- 1} x = d \end{matrix}$

and hence

U^{- 1} V c = U^{- 1} x = d

$U^{- 1} V c = U^{- 1} x = d$

As before, we see that the transition matrix from ${v_{1}, v_{2}}$ ${v_{1}, v_{2}}$ to ${u_{1}, u_{2}}$ ${u_{1}, u_{2}}$ is $U^{- 1} V$ $U^{- 1} V$ (see Figure 3.5.2).

A diagram displays a right triangle formed by three vectors.

Figure 3.5.2. Full Alternative Text

Change of Basis for a General Vector Space

Everything we have done so far can easily be generalized to apply to any finite-dimensional vector space. We begin by defining coordinate vectors for an n-dimensional vector space.

The examples considered so far have all dealt with changing coordinates in $ℝ^{2}$ $ℝ^{2}$ . Similar techniques could be used for $ℝ^{n}$ $ℝ^{n}$ . In the case of $ℝ^{n}$ $ℝ^{n}$ , the transition matrices will be $n \times n$ $n \times n$ .

Example 5

v_{1} = [\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}], v_{2} = [\begin{matrix} 2 \\ 3 \\ 2 \end{matrix}], v_{3} = [\begin{matrix} 1 \\ 5 \\ 4 \end{matrix}]

$v_{1} = [\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}], v_{2} = [\begin{matrix} 2 \\ 3 \\ 2 \end{matrix}], v_{3} = [\begin{matrix} 1 \\ 5 \\ 4 \end{matrix}]$

and

u_{1} = [\begin{matrix} 1 \\ 1 \\ 0 \end{matrix}], u_{2} = [\begin{matrix} 1 \\ 2 \\ 0 \end{matrix}], u_{3} = [\begin{matrix} 1 \\ 2 \\ 1 \end{matrix}]

$u_{1} = [\begin{matrix} 1 \\ 1 \\ 0 \end{matrix}], u_{2} = [\begin{matrix} 1 \\ 2 \\ 0 \end{matrix}], u_{3} = [\begin{matrix} 1 \\ 2 \\ 1 \end{matrix}]$

then $E = {v_{1}, v_{2}, v_{3}}$ $E = {v_{1}, v_{2}, v_{3}}$ and $F = {u_{1}, u_{2}, u_{3}}$ $F = {u_{1}, u_{2}, u_{3}}$ are ordered bases for $ℝ^{3}$ $ℝ^{3}$ . Let

\begin{matrix} x = 3 v_{1} + 2 v_{2} - v_{3} & and & y = v_{1} - 3 v_{2} + 2 v_{3} \end{matrix}

$\begin{matrix} x = 3 v_{1} + 2 v_{2} - v_{3} & and & y = v_{1} - 3 v_{2} + 2 v_{3} \end{matrix}$

Find the transition matrix from E to F and use it to find the coordinates of x and y with respect to the ordered basis F.

SOLUTION

As in Example 4, the transition matrix is given by

U^{- 1} V = [\begin{matrix} 2 & - 1 & 0 \\ - 1 & 1 & - 1 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 2 & 1 \\ 1 & 3 & 5 \\ 1 & 2 & 4 \end{matrix}] = [\begin{matrix} 1 & 1 & - 3 \\ - 1 & - 1 & 0 \\ 1 & 2 & 4 \end{matrix}]

$U^{- 1} V = [\begin{matrix} 2 & - 1 & 0 \\ - 1 & 1 & - 1 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 2 & 1 \\ 1 & 3 & 5 \\ 1 & 2 & 4 \end{matrix}] = [\begin{matrix} 1 & 1 & - 3 \\ - 1 & - 1 & 0 \\ 1 & 2 & 4 \end{matrix}]$

The coordinate vectors of x and y with respect to the ordered basis F are given by

{[x]}_{F} = [\begin{matrix} 1 & 1 & - 3 \\ - 1 & - 1 & 0 \\ 1 & 2 & 4 \end{matrix}] [\begin{array}{r} 3 \\ 2 \\ - 1 \end{array}] = [\begin{array}{r} 8 \\ - 5 \\ 3 \end{array}]

${[x]}_{F} = [\begin{matrix} 1 & 1 & - 3 \\ - 1 & - 1 & 0 \\ 1 & 2 & 4 \end{matrix}] [\begin{array}{r} 3 \\ 2 \\ - 1 \end{array}] = [\begin{array}{r} 8 \\ - 5 \\ 3 \end{array}]$

and

{[y]}_{F} = [\begin{matrix} 1 & 1 & - 3 \\ - 1 & - 1 & 0 \\ 1 & 2 & 4 \end{matrix}] [\begin{array}{r} 1 \\ - 3 \\ 2 \end{array}] = [\begin{array}{r} - 8 \\ 2 \\ 3 \end{array}]

${[y]}_{F} = [\begin{matrix} 1 & 1 & - 3 \\ - 1 & - 1 & 0 \\ 1 & 2 & 4 \end{matrix}] [\begin{array}{r} 1 \\ - 3 \\ 2 \end{array}] = [\begin{array}{r} - 8 \\ 2 \\ 3 \end{array}]$

The reader may verify that

\begin{array}{r} 8 u_{1} & - & 5 u_{2} & + & 3 u_{3} & = & 3 v_{1} & + & 2 v_{2} & - & v_{3} \\ - 8 u_{1} & + & 2 u_{2} & + & 3 u_{3} & = & v_{1} & - & 3 v_{2} & + & 2 v_{3} \end{array}

$\begin{array}{r} 8 u_{1} & - & 5 u_{2} & + & 3 u_{3} & = & 3 v_{1} & + & 2 v_{2} & - & v_{3} \\ - 8 u_{1} & + & 2 u_{2} & + & 3 u_{3} & = & v_{1} & - & 3 v_{2} & + & 2 v_{3} \end{array}$

If V is any n-dimensional vector space, it is possible to change from one basis to another by means of an $n \times n$ $n \times n$ transition matrix. We will show that such a transition matrix is necessarily nonsingular. To see how this is done, let $E = {w_{1}, \dots, w_{n}}$ $E = {w_{1}, \dots, w_{n}}$ and $F = {v_{1}, \dots, v_{n}}$ $F = {v_{1}, \dots, v_{n}}$ be two ordered bases for V. The key step is to express each basis vector $w_{j}$ $w_{j}$ as a linear combination of the $v_{i} ’s$ $v_{i} ’s$ .

\begin{matrix} w_{1} & = s_{11} v_{1} + s_{21} v_{2} + \dots + s_{n 1} v_{n} \\ w_{2} & = s_{12} v_{1} + s_{22} v_{2} + \dots + s_{n 2} v_{n} \\ ⋮ \\ w_{n} & = s_{1 n} v_{1} + s_{2 n} v_{2} + \dots + s_{n n} v_{n} \end{matrix}

$\begin{matrix} w_{1} & = s_{11} v_{1} + s_{21} v_{2} + \dots + s_{n 1} v_{n} \\ w_{2} & = s_{12} v_{1} + s_{22} v_{2} + \dots + s_{n 2} v_{n} \\ ⋮ \\ w_{n} & = s_{1 n} v_{1} + s_{2 n} v_{2} + \dots + s_{n n} v_{n} \end{matrix}$ (4)

Let $v \in V$ $v \in V$ . If $x = [v]_{E}$ $x = [v]_{E}$ , it follows from (4) that

\begin{matrix} v & = & x_{1} w_{1} + x_{2} w_{2} + \dots + x_{n} w_{n} \\ = & (\sum_{j = 1}^{n} s_{1 j} x_{j}) v_{1} + (\sum_{j = 1}^{n} s_{2 j} x_{j}) v_{2} + \dots + (\sum_{j = 1}^{n} s_{n j} x_{j}) v_{n} \end{matrix}

$\begin{matrix} v & = & x_{1} w_{1} + x_{2} w_{2} + \dots + x_{n} w_{n} \\ = & (\sum_{j = 1}^{n} s_{1 j} x_{j}) v_{1} + (\sum_{j = 1}^{n} s_{2 j} x_{j}) v_{2} + \dots + (\sum_{j = 1}^{n} s_{n j} x_{j}) v_{n} \end{matrix}$

Thus, if $y = [v]_{F}$ $y = [v]_{F}$ , then

\begin{matrix} y_{i} = \sum_{j = 1}^{n} s_{i j} x_{j} & i = 1, \dots, n \end{matrix}

$\begin{matrix} y_{i} = \sum_{j = 1}^{n} s_{i j} x_{j} & i = 1, \dots, n \end{matrix}$

and hence,

y = S x

$y = S x$

The matrix S defined by (4) is referred to as the transition matrix. Once S has been determined, it is a simple matter to change coordinate systems. To find the coordinates of $v = x_{1} w_{1} + \dots + x_{n} w_{n}$ $v = x_{1} w_{1} + \dots + x_{n} w_{n}$ with respect to ${v_{1}, \dots, v_{n}}$ ${v_{1}, \dots, v_{n}}$ , we need only calculate $y = S x$ $y = S x$ .

The transition matrix S corresponding to the change of basis from ${w_{1}, \dots, w_{n}}$ ${w_{1}, \dots, w_{n}}$ to ${v_{1}, \dots, v_{n}}$ ${v_{1}, \dots, v_{n}}$ can be characterized by the condition

\begin{matrix} S x = y & if and only if & x_{1} w_{1} + \dots + x_{n} w_{n} = y_{1} v_{1} + \dots + y_{n} v_{n} \end{matrix}

$\begin{matrix} S x = y & if and only if & x_{1} w_{1} + \dots + x_{n} w_{n} = y_{1} v_{1} + \dots + y_{n} v_{n} \end{matrix}$ (5)

Taking $y = 0$ $y = 0$ in (5), we see that $S x = 0$ $S x = 0$ implies that

x_{1} w_{1} + \dots + x_{n} w_{n} = 0

$x_{1} w_{1} + \dots + x_{n} w_{n} = 0$

Since the $w_{i} ’s$ $w_{i} ’s$ are linearly independent, it follows that $x = 0$ $x = 0$ . Thus, the equation $S x = 0$ $S x = 0$ has only the trivial solution and hence the matrix S is nonsingular. The inverse matrix is characterized by the condition

\begin{matrix} S^{- 1} y = x & if and only if & y_{1} v_{1} + \dots + y_{n} v_{n} = x_{1} w_{1} + \dots + x_{n} w_{n} \end{matrix}

$\begin{matrix} S^{- 1} y = x & if and only if & y_{1} v_{1} + \dots + y_{n} v_{n} = x_{1} w_{1} + \dots + x_{n} w_{n} \end{matrix}$

Thus, $S^{- 1}$ $S^{- 1}$ is the transition matrix used to change basis from ${v_{1}, \dots, v_{n}}$ ${v_{1}, \dots, v_{n}}$ to ${w_{1}, \dots, w_{n}}$ ${w_{1}, \dots, w_{n}}$ .

Example 6

Suppose that in $P_{3}$ $P_{3}$ we want to change from the ordered basis $[1, x, x^{2}]$ $[1, x, x^{2}]$ to the ordered basis $[1, 2 x, 4 x^{2} - 2]$ $[1, 2 x, 4 x^{2} - 2]$ . Because $[1, x, x^{2}]$ $[1, x, x^{2}]$ is the standard basis for $P_{3}$ $P_{3}$ , it is easier to find the transition matrix from $[1, 2 x, 4 x^{2} - 2]$ $[1, 2 x, 4 x^{2} - 2]$ to $[1, x, x^{2}]$ $[1, x, x^{2}]$ . Since

\begin{matrix} 1 & = & 1 \cdot 1 + 0 x + 0 x^{2} \\ 2 x & = & 0 \cdot 1 + 2 x + 0 x^{2} \\ 4 x^{2} - 2 & = & - 2 \cdot 1 + 0 x + 4 x^{2} \end{matrix}

$\begin{matrix} 1 & = & 1 \cdot 1 + 0 x + 0 x^{2} \\ 2 x & = & 0 \cdot 1 + 2 x + 0 x^{2} \\ 4 x^{2} - 2 & = & - 2 \cdot 1 + 0 x + 4 x^{2} \end{matrix}$

the transition matrix is

S = [\begin{matrix} 1 & 0 & - 2 \\ 0 & 2 & 0 \\ 0 & 0 & 4 \end{matrix}]

$S = [\begin{matrix} 1 & 0 & - 2 \\ 0 & 2 & 0 \\ 0 & 0 & 4 \end{matrix}]$

The inverse of S will be the transition matrix from $[1, x, x^{2}]$ $[1, x, x^{2}]$ to $[1, 2 x, 4 x^{2} - 2]$ $[1, 2 x, 4 x^{2} - 2]$ :

S^{- 1} = [\begin{matrix} 1 & 0 & \frac{1}{2} \\ 0 & \frac{1}{2} & 0 \\ 0 & 0 & \frac{1}{4} \end{matrix}]

$S^{- 1} = [\begin{matrix} 1 & 0 & \frac{1}{2} \\ 0 & \frac{1}{2} & 0 \\ 0 & 0 & \frac{1}{4} \end{matrix}]$

Given any $p (x) = a + b x + c x^{2}$ $p (x) = a + b x + c x^{2}$ in $P_{3}$ $P_{3}$ , to find the coordinates of $p (x)$ $p (x)$ with respect to $[1, 2 x, 4 x^{2} - 2]$ $[1, 2 x, 4 x^{2} - 2]$ , we multiply

[\begin{matrix} 1 & 0 & \frac{1}{2} \\ 0 & \frac{1}{2} & 0 \\ 0 & 0 & \frac{1}{4} \end{matrix}] [\begin{matrix} a \\ b \\ c \end{matrix}] = [\begin{matrix} a + \frac{1}{2} c \\ \frac{1}{2} b \\ \frac{1}{4} c \end{matrix}]

$[\begin{matrix} 1 & 0 & \frac{1}{2} \\ 0 & \frac{1}{2} & 0 \\ 0 & 0 & \frac{1}{4} \end{matrix}] [\begin{matrix} a \\ b \\ c \end{matrix}] = [\begin{matrix} a + \frac{1}{2} c \\ \frac{1}{2} b \\ \frac{1}{4} c \end{matrix}]$

Thus,

p (x) = (a + \frac{1}{2} c) \cdot 1 + (\frac{1}{2} b) \cdot 2 x + \frac{1}{4} c \cdot (4 x^{2} - 2)

$p (x) = (a + \frac{1}{2} c) \cdot 1 + (\frac{1}{2} b) \cdot 2 x + \frac{1}{4} c \cdot (4 x^{2} - 2)$

We have seen that each transition matrix is nonsingular. Actually, any nonsingular matrix can be thought of as a transition matrix. If S is an $n \times n$ $n \times n$ nonsingular matrix and ${v_{1}, \dots, v_{n}}$ ${v_{1}, \dots, v_{n}}$ is an ordered basis for V, then define ${w_{1}, w_{2}, \dots, w_{n}}$ ${w_{1}, w_{2}, \dots, w_{n}}$ by (4). To see that the $w_{j} ’s$ $w_{j} ’s$ are linearly independent, suppose that

\sum_{j = 1}^{n} x_{j} w_{j} = 0

$\sum_{j = 1}^{n} x_{j} w_{j} = 0$

It follows from (4) that

\sum_{i = 1}^{n} (\sum_{j = 1}^{n} s_{i j} x_{j}) v_{j} = 0

$\sum_{i = 1}^{n} (\sum_{j = 1}^{n} s_{i j} x_{j}) v_{j} = 0$

By the linear independence of the $v_{i} ’s$ $v_{i} ’s$ , it follows that

\begin{matrix} {\begin{matrix} \sum \end{matrix}}_{j = 1}^{n} s_{i j} x_{j} = 0 & i = 1, \dots, n \end{matrix}

$\begin{matrix} {\begin{matrix} \sum \end{matrix}}_{j = 1}^{n} s_{i j} x_{j} = 0 & i = 1, \dots, n \end{matrix}$

or, equivalently,

S x = 0

$S x = 0$

Since S is nonsingular, x must equal 0. Therefore, $w_{1}, \dots, w_{n}$ $w_{1}, \dots, w_{n}$ are linearly independent and hence they form a basis for V. The matrix S is the transition matrix corresponding to the change from the ordered basis ${w_{1}, \dots, w_{n}}$ ${w_{1}, \dots, w_{n}}$ to ${v_{1}, \dots, v_{n}}$ ${v_{1}, \dots, v_{n}}$ .

In many applied problems, it is important to use the right type of basis for the particular application. In Chapter 5, we will see that the key to solving least squares problems is to switch to a special type of basis called an orthonormal basis. In Chapter 6, we will consider a number of applications involving the eigenvalues and eigenvectors associated with an $n \times n$ $n \times n$ matrix A. The key to solving these types of problems is to switch to a basis for $ℝ^{n}$ $ℝ^{n}$ consisting of eigenvectors of A.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 3.5 Change of Basis

Create new playlist

Sign In

Sign Up

Table of Contents for
3.5 Change of Basis