5.4 Inner Product Spaces

Scalar products are useful not only in n, but also in a wide variety of contexts. To generalize this concept to other vector spaces, we introduce the following definition.

Definition and Examples

A vector space V with an inner product is called an inner product space.

The Vector Space n

The standard inner product for n is the scalar product

x,y=xTy

Given a vector w with positive entries, we could also define an inner product on n by

x,y=i=1nxiyiwi
(1)

The entries wi are referred to as weights.

The Vector Space m×n

Given A and B in m×n, we can define an inner product by

A,B=i=1mj=1naijbij

We leave it to the reader to verify that (2) does indeed define an inner product on m×n.

The Vector Space C[a, b]

We may define an inner product on C[a, b] by

f,g=abf(x)g(x)dx
(3)

Note that

f,f=ab(f(x))2dx0

If f(x0)0 for some x0 in [a, b], then, since (f(x))2 is continuous, there exists a subinterval I of [a, b] containing x0 such that (f(x))2(f(x0))2/2 for all x in I. If we let p represent the length of I, then it follows that

f,f=ab(f(x))2dxI(f(x))2dx(f(x0))2P2>0

So if f,f=0, then f (x) must be identically zero on [a, b]. We leave it to the reader to verify that (3) satisfies the other two conditions specified in the definition of an inner product.

If w(x) is a positive continuous function on [a, b], then

f,g=abf(x)g(x)w(x)dx
(4)

also defines an inner product on C[a, b]. The function w(x) is called a weight function. Thus, it is possible to define many different inner products on C[a, b].

The Vector Space Pn

Let x1,x2,,xn be distinct real numbers. For each pair of polynomials in Pn, define

p,q=i=1nP(xi)g(xi)
(5)

It is easily seen that (5) satisfies conditions (ii) and (iii) of the definition of an inner product. To show that (i) holds, note that

p,p=i=1n(p(xi))20

If p,p=0, then x1,x2,,xn must be roots of p(x)=0. Since p(x) is of degree less than n, it must be the zero polynomial.

If w(x) is a positive function, then

p,q=i=1nP(xi)q(xi)w(xi)

also defines an inner product on Pn.

Basic Properties of Inner Product Spaces

The results presented in Section 5.1 for scalar products in n all generalize to inner product spaces. In particular, if v is a vector in an inner product space V, the length, or norm of v is given by

v=v,v

Two vectors u and v are said to be orthogonal if u,v=0. As in n, a pair of orthogonal vectors will satisfy the Pythagorean law.

Theorem 5.4.1 The Pythagorean Law

If u and v are orthogonal vectors in an inner product space V, then

u+v2=u2+v2

Proof

u+v2=u+v,u+v=u,u+2u,v+v,v=u2+v2

Interpreted in 2, this is just the familiar Pythagorean theorem as shown in Figure 5.4.1.

Figure 5.4.1.

Three vectors represent Pythagorean theorem.

Example 1

Consider the vector space C[1,1] with an inner product defined by (3). The vectors 1 and x are orthogonal, since

1,x=111xdx=0

To determine the lengths of these vectors, we compute

1,1=1111 dx=2x,x=11x2dx=23

It follows that

1=(1,1)1/2=2x=(x,x)1/2=63

Since 1 and x are orthogonal, they satisfy the Pythagorean law:

1+x2=12+x2=2+23=83

The reader may verify that

1+x2=1+x,1+x=11(1+x)2dx=83

Example 2

For the vector space C[π,π], if we use a constant weight function w(x)=1/π to define an inner product

f,g=1πππf(x)g(x)dx
(6)

then

cosx,sinx=1πππcosx sinx dx=0cosx,cosx=1πππcosx cosx dx=1sinx,sinx=1πππsinx sinx dx=1

Thus, cos x and sin x are orthogonal unit vectors with respect to this inner product. It follows from the Pythagorean law that

cosx+sinx=2

The inner product (6) plays a key role in Fourier analysis applications involving a trigonometric approximation of functions. We will look at some of these applications in Section 5.5.

For the vector space m×n, the norm derived from the inner product (2) is called the Frobenius norm and is denoted by F. Thus, if Am×n, then

AF=(A,A)1/2=(i=1mj=1naij2)1/2

Example 3

If

A=[111233]andB=[113034]

then

A,B=11+11+13+20+33+34=6

Hence, A is not orthogonal to B. The norms of these matrices are given by

AF=(1+1+1+1+9+9)1/2=5BF=(1+1+9+0+9+16)1/2=6

Example 4

In P5, define an inner product by (5) with xi=(i1)/4 for i=1,2,,5. The length of the function P(x)=4x is given by

4x=(4x,4x)1/2=(i=1516xi2)1/2=(i=15(i1)2)1/2=30

Observations

If v0 and p is the vector projection of u onto v, then

  1. up and p are orthogonal.

  2. u=p if and only if u is a scalar multiple of v.

Proof of Observation I

Since

p,p=αvv,αvv=(αv)2v,v=α2

and

u,p=(u,v)2v,v=α2

it follows that

up,p=u,pp,p=α2α2=0

Therefore, up and p are orthogonal.

Proof of Observation II

If u=βv, then the vector projection of u onto v is given by

p=βv,vv,vv=βv=u

Conversely, if u=p, it follows from (7) that

u=βvwhereβ=αv

Observations I and II are useful for establishing the following theorem.

Theorem 5.4.2 The Cauchy—Schwarz Inequality

If u and v are any two vectors in an inner product space V, then

|u,v|u  v
(8)

Equality holds if and only if u and v are linearly dependent.

Proof

If v=0, then

|u,v|=0=u   v

If v0, then let p be the vector projection of u onto v. Since p is orthogonal to up, it follows from the Pythagorean law that

p2+up2=u2

Thus,

(u,v)2v2=p2=u2up2

and hence

(u,v)2=u2v2up2v2u2v2

Therefore,

|u,v|u  v

Equality holds in (9) if and only if u=p. It follows from observation II that equality will hold in (8) if and only if v=p or u is a multiple of v. More simply stated, equality will hold if and only if u and v are linearly dependent.

One consequence of the Cauchy–Schwarz inequality is that if u and v are nonzero vectors, then

1u,vuv1

and hence there is a unique angle θ in [0, π] such that

cosθ=u,vuv
(10)

Thus, equation (10) can be used to define the angle θ between two nonzero vectors u and v.

Norms

The word norm in mathematics has its own meaning that is independent of an inner product and its use here should be justified.

The third condition is called the triangle inequality (see Figure 5.4.2).

Figure 5.4.2.

A triangle is formed by three vectors.

Theorem 5.4.3

If V is an inner product space, then the equation

v=v,vfor allvV

defines a norm on V.

Proof

It is easily seen that conditions I and II of the definition are satisfied. We leave this for the reader to verify and proceed to show that condition III is satisfied.

u+v2=u+v,u+v=u,u+2u,v+v,vu2+2u  v+v2(CauchySchwarz)=(u+v)2

Thus,

u+vu+v

It is possible to define many different norms on a given vector space. For example, in n we could define

x1=i=1n|xi|

for every x=(x1,x2,,xn)T. It is easily verified that 1 defines a norm on n. Another important norm on n is the uniform norm or infinity norm, which is defined by

x=max1in|xi|

More generally, we could define a norm on n by

xp=(i=1n|xi|p)1/p

for any real number p1. In particular, if p=2, then

x2=(i=1n|xi|2)1/2=x,x

The norm 2 is the norm on n derived from the inner product. If p2,p does not correspond to any inner product. In the case of a norm that is not derived from an inner product, the Pythagorean law will not hold. For example,

x1=[12]andx2=[42]

are orthogonal; however,

x12+x22=4+16=20

while

x1+x22=16

If, however, 2 is used, then

x122+x222=5+20=25=x1+x222

Example 5

Let x be the vector (4,5,3)T in 3. Compute x1,x2, and x.

x1=|4|+|5|+|3|=12x2=16+25+9=52x=max(|4|,|5|,|3|)=5

It is also possible to define different matrix norms for m×n. In Chapter 7, we will study other types of matrix norms that are useful in determining the sensitivity of linear systems.

In general, a norm provides a way of measuring the distance between vectors.

Many applications involve finding a unique closest vector in a subspace S to a given vector v in a vector space V. If the norm used for V is derived from an inner product, then the closest vector can be computed as a vector projection of v onto the subspace S. This type of approximation problem is discussed further in the next section.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.191.239.123