3.5 Change of Basis

Many applied problems can be simplified by changing from one coordinate system to another. Changing coordinate systems in a vector space is essentially the same as changing from one basis to another. For example, in describing the motion of a particle in the plane at a particular time, it is often convenient to use a basis for 2 consisting of a unit tangent vector t and a unit normal vector n instead of the standard basis {e1,e2}.

In this section, we discuss the problem of switching from one coordinate system to another. We will show that this can be accomplished by multiplying a given coordinate vector x by a nonsingular matrix S. The product y=Sx will be the coordinate vector for the new coordinate system.

Changing Coordinates in 2

The standard basis for 2 is {e1,e2}. Any vector x in 2 can be expressed as a linear combination:

x=x1e1+x2e2

The scalars x1 and x2 can be thought of as the coordinates of x with respect to the standard basis. Actually, for any basis {y,z} for 2, it follows from Theorem 3.3.2 that a given vector x can be represented uniquely as a linear combination:

x=αy+βz

The scalars α and β are the coordinates of x with respect to the basis {y,z}. Let us order the basis elements so that y is considered the first basis vector and z is considered the second, and denote the ordered basis by [y,z]. We can then refer to the vector (α,β)T as the coordinate vector of x with respect to [y,z]. Note that, if we reverse the order of the basis vectors and take [z,y], then we must also reorder the coordinate vector. The coordinate vector of x with respect to [z,y] will be (β,α)T. When we refer to a basis using subscripts, such as {u1,u2}, the subscripts assign an ordering to the basis vectors.

Example 1

Let y=(2,1)T and z=(1,4)T. The vectors y and z are linearly independent and hence form a basis for 2. The vector x=(7,7)T can be written as a linear combination:

x=3y+z

Thus, the coordinate vector of x with respect to [y,z] is (3,1)T. Geometrically, the coordinate vector specifies how to get from the origin to the point (7,7) by moving first in the direction of y and then in the direction of z. If, instead, we treat z as our first basis vector and y as the second basis vector, then

x=z+3y

The coordinate vector of x with respect to the ordered basis [z,y] is (1,3)T. Geometrically, this vector tells us how to get from the origin to (7,7) by moving first in the direction of z and then in the direction of y (see Figure 3.5.1).

Figure 3.5.1.

A graph displays a parallelogram in the first quadrant.

As an example of a problem for which it is helpful to change coordinates, consider the following application.

Changing Coordinates

Once we have decided to work with a new basis, we have the problem of finding the coordinates with respect to that basis. Suppose, for example, that instead of using the standard basis {e1,e2} for 2, we wish to use a different basis, say,

u1=[32],u2=[11]

Indeed, we may want to switch back and forth between the two coordinate systems. Let us consider the following two problems:

  1. Given a vector x=(x1,x2)T, find its coordinates with respect to u1 and u2.

  2. Given a vector c1u1+c2u2, find its coordinates with respect to e1 and e2.

We will solve II first, since it turns out to be the easier problem. To switch bases from {u1,u2} to {e1,e2}, we must express the old basis elements u1 and u2 in terms of the new basis elements e1 and e2.

u1=3e1+2e2u2=3e1+e2

It follows then that

c1u1+c2u2=(3c1e1+2c1e2)+(c2e1+c2e2)=(3c1+c2)e1+(2c1+c2)e1

Thus, the coordinate vector of c1u1+c2u2 with respect to {e1,e2} is

x=[3c1+c22c1+c2]=[3121][c1c2]

If we set

U=(u1,u2)=[3121]

then, given any coordinate vector c with respect to {u1,u2}, to find the corresponding coordinate vector x with respect to {e1,e2}, we simply multiply U times c:

x=Uc
(2)

The matrix U is called the transition matrix from the ordered basis {u1,u2} to the standard basis {e1,e2}.

To solve problem I, we must find the transition matrix from {e1,e2} to {u1,u2}. The matrix U in (2) is nonsingular, since its column vectors, u1 and u2, are linearly independent. It follows from (2) that

c=U1x

Thus, given a vector

x=(x1,x2)T=x1e1+x2e2

we need only multiply by U1 to find its coordinate vector with respect to {u1,u2}. U1 is the transition matrix from {e1,e2} to {u1,u2}.

Example 2

Let u1=(3,2)T,u2=(1,1)T, and x=(7,4)T. Find the coordinates of x with respect to u1 and u2.

SOLUTION

By the preceding discussion, the transition matrix from {e1,e2} to {u1,u2} is the inverse of

U=(u1,u2)=[3121]

Thus,

c=U1x=[1123] [74]=[32]

is the desired coordinate vector and

x=3u12u2

Example 3

Let b1=(1,1)T and b2=(2,3)T. Find the transition matrix from {e1,e2} to {b1,b2} and determine the coordinates of x=(1,2)T with respect to {b1,b2}.

SOLUTION

The transition matrix from {b1,b2} to {e1,e2} is

B=(b1,b2)=[1213]

and hence the transition matrix from {e1,e2} to {b1,b2} is

B1=[3211]

The coordinate vector of x with respect to {b1,b2} is

c=B1x=[3211] [12]=[73]

and hence

x=7b1+3b2

Now let us consider the general problem of changing from one ordered basis {v1,v2} of 2 to another ordered basis {u1,u2}. In this case, we assume that, for a given vector x, its coordinates with respect to {v1,v2} are known:

x=c1v1+c2v2

Now we wish to represent x as a sum d1u1+d2u2. Thus, we must find scalars d1 and d2 so that

c1v1+c2v2=d1u1+d2u2
(3)

If we set V=(v1,v2) and U=(u1,u2), then Equation (3) can be written in matrix form:

Vc=Ud

It follows that

d=U1Vc

Thus, given a vector x in 2 and its coordinate vector c with respect to the ordered basis {v1,v2}, to find the coordinate vector of x with respect to the new basis {u1,u2}, we simply multiply c by the transition matrix S=U1V.

Example 4

Find the transition matrix corresponding to the change of basis from {v1,v2} to {u1,u2}, where

v1=[52],v2=[73]andu1=[32],u2=[11]

SOLUTION

The transition matrix from {v1,v2} to {u1,u2} is given by

S=U1V=[1123][5723]=[3445]

The change of basis from {v1,v2} to {u1,u2} can also be viewed as a two-step process. First we change from {v1,v2} to the standard basis, {e1,e2}, and then we change from the standard basis to {u1,u2}. Given a vector x in 2, if c is the coordinate vector of x with respect to {v1,v2} and d is the coordinate vector of x with respect to {u1,u2}, then

c1v1+c2v2=x1e1+x2e2=d1u1+d2u2

Since V is the transition matrix from {v1,v2} to {e1,e2} and U1 is the transition matrix from {e1,e2} to {u1,u2}, it follows that

Vc=xandU1x=d

and hence

U1Vc=U1x=d

As before, we see that the transition matrix from {v1,v2} to {u1,u2} is U1V (see Figure 3.5.2).

Figure 3.5.2.

A diagram displays a right triangle formed by three vectors.

Change of Basis for a General Vector Space

Everything we have done so far can easily be generalized to apply to any finite-dimensional vector space. We begin by defining coordinate vectors for an n-dimensional vector space.

The examples considered so far have all dealt with changing coordinates in 2. Similar techniques could be used for n. In the case of n, the transition matrices will be n×n.

Example 5

If

v1=[111],v2=[232],v3=[154]

and

u1=[110],u2=[120],u3=[121]

then E={v1,v2,v3} and F={u1,u2,u3} are ordered bases for 3. Let

x=3v1+2v2v3andy=v13v2+2v3

Find the transition matrix from E to F and use it to find the coordinates of x and y with respect to the ordered basis F.

SOLUTION

As in Example 4, the transition matrix is given by

U1V=[210111001] [121135124]=[113110124]

The coordinate vectors of x and y with respect to the ordered basis F are given by

[x]F=[113110124] [321]=[853]

and

[y]F=[113110124] [132]=[823]

The reader may verify that

8u15u2+3u3=3v1+2v2v38u1+2u2+3u3=v13v2+2v3

If V is any n-dimensional vector space, it is possible to change from one basis to another by means of an n×n transition matrix. We will show that such a transition matrix is necessarily nonsingular. To see how this is done, let E={w1,,wn} and F={v1,,vn} be two ordered bases for V. The key step is to express each basis vector wj as a linear combination of the vi’s.

w1=s11v1+s21v2++sn1vnw2=s12v1+s22v2++sn2vnwn=s1nv1+s2nv2++snnvn
(4)

Let vV. If x=[v]E, it follows from (4) that

v=x1w1+x2w2++xnwn=(j=1ns1jxj)v1+(j=1ns2jxj)v2++(j=1nsnjxj)vn

Thus, if y=[v]F, then

yi=j=1nsijxji=1,,n

and hence,

y=Sx

The matrix S defined by (4) is referred to as the transition matrix. Once S has been determined, it is a simple matter to change coordinate systems. To find the coordinates of v=x1w1++xnwn with respect to {v1,,vn}, we need only calculate y=Sx.

The transition matrix S corresponding to the change of basis from {w1,,wn} to {v1,,vn} can be characterized by the condition

Sx=yif and only ifx1w1++xnwn=y1v1++ynvn
(5)

Taking y=0 in (5), we see that Sx=0 implies that

x1w1++xnwn=0

Since the wi’s are linearly independent, it follows that x=0. Thus, the equation Sx=0 has only the trivial solution and hence the matrix S is nonsingular. The inverse matrix is characterized by the condition

S1y=xif and only ify1v1++ynvn=x1w1++xnwn

Thus, S1 is the transition matrix used to change basis from {v1,,vn} to {w1,,wn}.

Example 6

Suppose that in P3 we want to change from the ordered basis [1,x,x2] to the ordered basis [1,2x,4x22]. Because [1,x,x2] is the standard basis for P3, it is easier to find the transition matrix from [1,2x,4x22] to [1,x,x2]. Since

1=11+0x+0x22x=01+2x+0x24x22=21+0x+4x2

the transition matrix is

S=[102020004]

The inverse of S will be the transition matrix from [1,x,x2] to [1,2x,4x22]:

S1=[101201200014]

Given any p(x)=a+bx+cx2 in P3, to find the coordinates of p(x) with respect to [1,2x,4x22], we multiply

[101201200014] [abc]=[a+12c12b14c]

Thus,

p(x)=(a+12c)1+(12b)2x+14c(4x22)

We have seen that each transition matrix is nonsingular. Actually, any nonsingular matrix can be thought of as a transition matrix. If S is an n×n nonsingular matrix and {v1,,vn} is an ordered basis for V, then define {w1,w2,,wn} by (4). To see that the wj’s are linearly independent, suppose that

j=1nxjwj=0

It follows from (4) that

i=1n(j=1nsijxj)vj=0

By the linear independence of the vi’s, it follows that

j=1nsijxj=0i=1,,n

or, equivalently,

Sx=0

Since S is nonsingular, x must equal 0. Therefore, w1,,wn are linearly independent and hence they form a basis for V. The matrix S is the transition matrix corresponding to the change from the ordered basis {w1,,wn} to {v1,,vn}.

In many applied problems, it is important to use the right type of basis for the particular application. In Chapter 5, we will see that the key to solving least squares problems is to switch to a special type of basis called an orthonormal basis. In Chapter 6, we will consider a number of applications involving the eigenvalues and eigenvectors associated with an n×n matrix A. The key to solving these types of problems is to switch to a basis for n consisting of eigenvectors of A.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.190.156.212