6.1 Inner Products and Norms

Many geometric notions such as angle, length, and perpendicularity in R2R2 and R3R3 may be extended to more general real and complex vector spaces. All of these ideas are related to the concept of inner product.

Definition.

Let V be a vector space over F. An inner product on V is a function that assigns, to every ordered pair of vectors x and y in V, a scalar in F, denoted x, yx, y, such that for all x, y, and z in V and all c in F, the following hold:

  1. (a) x+z, y=x, y+z, y.x+z, y=x, y+z, y.

  2. (b) cx, y=cx, y.cx, y=cx, y.

  3. (c) ¯x, y=y, xx, y¯¯¯¯¯¯¯¯¯=y, x, where the bar denotes complex conjugation.

  4. (d) If x0x0 then x, xx, x is a positive real number.

Note that (c) reduces to x, y=y, xx, y=y, x if F=R.F=R. Conditions (a) and (b) simply require that the inner product be linear in the first component.

It is easily shown that if a1, a2, , anFa1, a2, , anF and y, v1, v2, , vnV,y, v1, v2, , vnV, then

ni=1aivi, y=ni=1aivi, y.
i=1naivi, y=i=1naivi, y.

Example 1

For x=(a1, a2, , an)x=(a1, a2, , an) and y=(b1, b2, , bn)y=(b1, b2, , bn) in FnFn, define

x, y=ni=1aiˉbi.
x, y=i=1naib¯i.

The verification that , ,  satisfies conditions (a) through (d) is easy. For example, if z=(c1, c2, , cn)z=(c1, c2, , cn), we have for (a)

x+z, y=ni=1(ai+ci)ˉbi=ni=1aiˉbi+ni=1ciˉbi=x, y+z, y.
x+z, y==i=1n(ai+ci)b¯i=i=1naib¯i+i=1ncib¯ix, y+z, y.

Thus, for x=(1+i, 4)x=(1+i, 4) and y=(23i, 4+5i)y=(23i, 4+5i) in C2C2, x, y=(1+i)(2+3i)+4(45i)=1515i.x, y=(1+i)(2+3i)+4(45i)=1515i.

The inner product in Example 1 is called the standard inner product on FnFn. When F=RF=R, the conjugations are not needed, and in early courses this standard inner product is usually called the dot product and is denoted by xyxy instead of x, yx, y.

Example 2

If x, yx, y is any inner product on a vector space V and r>0r>0, we may define another inner product by the rule x, y=rx, yx, y=rx, y. If r0r0, then (d) would not hold.

Example 3

Let V=C([0, 1])V=C([0, 1]), the vector space of real-valued continuous functions on [0, l]. For f, gVf, gV, define f, g=10f(t)g(t) dtf, g=10f(t)g(t) dt. Since the preceding integral is linear in f, (a) and (b) are immediate, and (c) is trivial. If f0f0, then f2f2 is bounded away from zero on some subinterval of [0, 1] (continuity is used here), and hence f, f=10[f(t)]2 dt>0f, f=10[f(t)]2 dt>0.

Definition.

Let AMm×n(F)AMm×n(F). We define the conjugate transpose or adjoint of A to be the n×mn×m matrix A* such that (A*)ij=ˉAji(A*)ij=A¯¯¯ji for all i, j.

Example 4

Let

A=(i1+2i23+4i).
A=(i21+2i3+4i).

Then

A*=(i212i34i).
A*=(i12i234i).

Notice that if x and y are viewed as column vectors in FnFn, then x, y=y*x.x, y=y*x.

The conjugate transpose of a matrix plays a very important role in the remainder of this chapter. In the case that A has real entries, A* is simply the transpose of A.

Example 5

Let V=Mn×n(F)V=Mn×n(F), and define A, B=tr(B*A)A, B=tr(B*A) for A, BVA, BV. (Recall that the trace of a matrix A is defined by tr(A)=ni=1Aiitr(A)=ni=1Aii.) We verify that (a) and (d) of the definition of inner product hold and leave (b) and (c) to the reader. For this purpose, let A, B, CVA, B, CV. Then (using Exercise 6 of Section 1.3)

A+B, C=tr(C*(A+B))=tr(C*A+C*B)=tr(C*A)+tr(C*B)=A, C+B, C.
A+B, C==tr(C*(A+B))=tr(C*A+C*B)tr(C*A)+tr(C*B)=A, C+B, C.

Also

A, A=tr(A*A)=ni=1(A*A)ii=ni=1nk=1(A*)ikAki=ni=1nk=1ˉAkiAki=ni=1nk=1|Aki|2.
A, A==tr(A*A)=i=1n(A*A)ii=i=1nk=1n(A*)ikAkii=1nk=1nA¯¯¯kiAki=i=1nk=1n|Aki|2.

Now if AOAO, then Aki0Aki0 for some k and i. So A, A>0A, A>0.

The inner product on Mn×n(F)Mn×n(F) in Example 5 is called the Frobenius inner product.

A vector space V over F endowed with a specific inner product is called an inner product space. If F=CF=C, we call V a complex inner product space, whereas if F=RF=R, we call V a real inner product space.

It is clear that if V has an inner product x, yx, y and W is a subspace of V, then W is also an inner product space when the same function x, yx, y is restricted to the vectors x, yWx, yW.

Thus Examples 1, 3, and 5 also provide examples of inner product spaces. For the remainder of this chapter, FnFn denotes the inner product space with the standard inner product as defined in Example 1. Likewise, Mn×n(F)Mn×n(F) denotes the inner product space with the Frobenius inner product as defined in Example 5. The reader is cautioned that two distinct inner products on a given vector space yield two distinct inner product spaces. For instance, it can be shown that both

f(x), g(x)1=10f(t)g(t) dtandf(x), g(x)2=11f(t)g(t) dt
f(x), g(x)1=10f(t)g(t) dtandf(x), g(x)2=11f(t)g(t) dt

are inner products on the vector space P(R). Even though the underlying vector space is the same, however, these two inner products yield two different inner product spaces. For example, the polynomials f(x)=xf(x)=x and g(x)=x2g(x)=x2 are orthogonal in the second inner product space, but not in the first.

A very important inner product space that resembles C([0, 1]) is the space H of continuous complex-valued functions defined on the interval [0, 2π][0, 2π] with the inner product

f, g=12π2π0f(t)¯g(t) dt.
f, g=12π2π0f(t)g(t)¯¯¯¯¯¯ dt.

The reason for the constant 1/2π1/2π will become evident later. This inner product space, which arises often in the context of physical situations, is examined more closely in later sections.

At this point, we mention a few facts about integration of complex-valued functions. First, the imaginary number i can be treated as a constant under the integration sign. Second, every complex-valued function f may be written as f=f1+if2f=f1+if2, where f1f1 and f2f2 are real-valued functions. Thus we have

f dt=f1 dt+i f2 dtand¯f dt=ˉf dt.
f dt=f1 dt+i f2 dtandf¯¯¯¯¯¯ dt=f¯ dt.

From these properties, as well as the assumption of continuity, it follows that H is an inner product space (see Exercise 16(a)).

Some properties that follow easily from the definition of an inner product are contained in the next theorem.

Theorem 6.1.

Let V be an inner product space. Then for x, y, zVx, y, zV and cFcF, the following statements are true.

  1. (a) x, y+z=x, y+x, z.x, y+z=x, y+x, z.

  2. (b) x, cy=ˉcx, y.x, cy=c¯x, y.

  3. (c) x, 0=0, x=0.x, 0=0, x=0.

  4. (d) x, x=0x, x=0 if and only if x=0x=0.

  5. (e) if x, y=x, zx, y=x, z for all xVxV then y=zy=z.

Proof.

(a) We have

x, y+z=¯y+z, x=¯y, x+z, x=¯y, x+¯z, x=x, y+x, z.
x, y+z==y+z, x¯¯¯¯¯¯¯¯¯¯¯¯¯¯=y, x+z, x¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯y, x¯¯¯¯¯¯¯¯¯+z, x¯¯¯¯¯¯¯¯=x, y+x, z.

The proofs of (b), (c), (d), and (e) are left as exercises.

The reader should observe that (a) and (b) of Theorem 6.1 show that the inner product is conjugate linear in the second component.

In order to generalize the notion of length in R3R3 to arbitrary inner product spaces, we need only observe that the length of x=(a, b, c)R3x=(a, b, c)R3 is given by a2+b2+c2=x, xa2+b2+c2=x, x. This leads to the following definition.

Definition.

Let V be an inner product space. For xVxV, we define the norm or length of x by ||x||=x, x||x||=x, x.

Example 6

Let V=FnV=Fn. If x=(a1, a2 , an)x=(a1, a2 , an), then

||x||=||(a1, a2 , an)||=[ni=1|ai|2]1/2
||x||=||(a1, a2 , an)||=[i=1n|ai|2]1/2

is the Euclidean definition of length. Note that if n=1n=1, we have ||a||=|a|||a||=|a|.

As we might expect, the well-known properties of Euclidean length in R3R3 hold in general, as shown next.

Theorem 6.2.

Let V be an inner product space over F. Then for all x, yVx, yV and cFcF the following statements are true.

  1. (a) ||cx||=|c|||x||.||cx||=|c|||x||.

  2. (b) ||x||=0||x||=0 if and only if x=0x=0. In any case, ||x||0||x||0.

  3. (c) (Cauchy-Schwarz Inequality) |x, y|||x||||y|||x, y|||x||||y||.

  4. (d) (Triangle Inequality) ||x+y||||x||+||y||||x+y||||x||+||y||.

Proof.

We leave the proofs of (a) and (b) as exercises.

(c) If y=0y=0, then the result is immediate. So assume that y0y0. For any cFcF, we have

0xcy, xcy=x, xcycy, xcy=x, xˉcx, ycy, x+cˉcy, y.
0xcy, xcy==x, xcycy, xcyx, xc¯x, ycy, x+cc¯y, y.

In particular, if we set

c=x, yy, y,
c=x, yy, y,

then each of ˉcx, y, cy, xc¯x, y, cy, x, and cˉcy, ycc¯y, y equals x, yy, xy, y=|x, y|2||y||2x, yy, xy, y=|x, y|2||y||2. So the preceding inequality becomes

0x, x|x, y|2y, y=||x||2|x, y|2||y||2,
0x, x|x, y|2y, y=||x||2|x, y|2||y||2,

from which (c) follows.

(d) We have

||x+y||2=x+y, x+y=x, x+y, x+x, y+y, y=||x||2+2x, y+||y||2||x||2+2|x, y|+||y||2||x||2+2||x||||y||+||y||2=(||x||+||y||)2,

where x, y denotes the real part of the complex number x, y. Note that we used (c) to prove (d).

The case when equality results in (c) and (d) is considered in Exercise 15.

Example 7

For Fn, we may apply (c) and (d) of Theorem 6.2 to the standard inner product to obtain the following well-known inequalities:

|ni=1aiˉbi|[ni=1|ai|2]1/2[ni=1|bi|2]1/2

and

[ni=1|ai+bi|2]1/2[ni=1|ai|2]1/2+[ni=1|bi|2]1/2.

The reader may recall from earlier courses that, for x and y in R3 or R2 , we have that x, y=||x||||y||cos θ, where θ (0θπ) denotes the angle between x and y. This equation implies (c) immediately since |cos θ|1. Notice also that nonzero vectors x and y are perpendicular if and only if cos θ=0, that is, if and only if x, y=0.

We are now at the point where we can generalize the notion of perpendicularity to arbitrary inner product spaces.

Definitions.

Let V be an inner product space. Vectors x and y in V are orthogonal (or perpendicular) if x, y=0. A subset S of V is orthogonal if any two distinct vectors in S are orthogonal. A vector x in V is a unit vector if ||x||=1. Finally, a subset S of V is orthonormal if S is orthogonal and consists entirely of unit vectors.

Note that if S={v1, v2, }, then S is orthonormal if and only if vi, vj=δij, where δij denotes the Kronecker delta. Also, observe that multiplying vectors by nonzero scalars does not affect their orthogonality and that if x is any nonzero vector, then (1/||x||)x is a unit vector. The process of multiplying a nonzero vector by the reciprocal of its length is called normalizing.

Example 8

In F3, {(1, 1, 0), (1, 1, 1), (1, 1, 2)} is an orthogonal set of nonzero vectors, but it is not orthonormal; however, if we normalize the vectors in the set, we obtain the orthonormal set

{12(1, 1, 0), 13(1, 1, 1), 16(1, 1, 2)}.

Our next example is of an infinite orthonormal set that is important in analysis. This set is used in later examples in this chapter.

Example 9

Recall the inner product space H (defined on page 330). We introduce an important orthonormal subset S of H. For what follows, i is the imaginary number such that i2=1. For any integer n, let fn(t)=eint, where 0t2π. (Recall that eint=cos nt+i sin nt.) Now define S={fn:n is an integer}.

Clearly S is a subset of H. Using the property that ¯eit=eit for every real number t, we have, for mn,

fm, fn=12π2π0eimt¯eint dt=12π2π0ei(mn)t dt=12πi(mn)ei(mn)t|2π0=0.

Also,

fn, fn=12π2π0ei(nn)t dt=12π2π01 dt=1.

In other words, fm, fn=δmn.

Exercises

  1. Label the following statements as true or false.

    1. (a) An inner product is a scalar-valued function on the set of ordered pairs of vectors.

    2. (b) An inner product space must be over the field of real or complex numbers.

    3. (c) An inner product is linear in both components.

    4. (d) There is exactly one inner product on the vector space Rn.

    5. (e) The triangle inequality only holds in finite-dimensional inner product spaces.

    6. (f) Only square matrices have a conjugate-transpose.

    7. (g) If x, y, and z are vectors in an inner product space such that x, y=x, z, then y=z.

    8. (h) If x, y=0 for all x in an inner product space, then y=0.

  2. Let x=(2, 1+i, i) and y=(2i, 2, 1+2i) be vectors in C3. Compute x, y, ||x||, ||y||, and ||x+y||. Then verify both the Cauchy-Schwarz inequality and the triangle inequality.

  3. In C([0, 1]), let f(t)=t and g(t)=et. Compute f, g (as defined in Example 3), ||f||, ||g||, and ||f+g||. Then verify both the Cauchy-Schwarz inequality and the triangle inequality.

    1. (a) Complete the proof in Example 5 that ,  is an inner product (the Frobenius inner product) on Mn×n(F).

    2. (b) Use the Frobenius inner product to compute ||A||, ||B||, and A, B for

      A=(12+i3i)andB=(1+i0ii).
  4. In C2, show that x, y=xAy* is an inner product, where

    A=(1ii2).

    Compute x, y for x=(1i, 2+3i) and y=(2+i, 32i).

  5. Complete the proof of Theorem 6.1.

  6. Complete the proof of Theorem 6.2.

  7. Provide reasons why each of the following is not an inner product on the given vector spaces.

    1. (a) (a, b), (c, d)=acbd on R2

    2. (b) A, B=tr(A+B) on M2×2(R)

    3. (c) f(x), g(x)=10f(t)g(t) dt on P(R), where ‘ denotes differentiation.

  8. Let β be a basis for a finite-dimensional inner product space.

    1. (a) Prove that if x, z=0 for all zβ, then x=0.

    2. (b) Prove that if x, z=y, z for all zβ, then x=y.

  9. † Let V be an inner product space, and suppose that x and y are orthogonal vectors in V. Prove that ||x+y||2=||x||2+||y||2. Deduce the Pythagorean theorem in R2. Visit goo.gl/1iTZzC for a solution.

  10. Prove the parallelogram law on an inner product space V; that is, show that

    ||x+y||2+||xy||2=2||x||2+2||y||2for all x, yV.

    What does this equation state about parallelograms in R2?

  11. † Let {v1, v2, , vk} be an orthogonal set in V, and let a1, a2, , ak be scalars. Prove that

    ki=1aivi2=ki=1|ai|2||vi||2.
  12. Suppose that , 1 and , 2 are two inner products on a vector space V. Prove that , =, 1+, 2 is another inner product on V.

  13. Let A and B be n×n matrices, and let c be a scalar. Prove that (A+cB)*=A*+ˉcB*.

    1. (a) Prove that if V is an inner product space, then |x, y|=||x||||y|| if and only if one of the vectors x or y is a multiple of the other. Hint: If the identity holds and y0, let

      a=x, y||y||2,

      and let z=xay. Prove that y and z are orthogonal and

      |a|=||x||||y||.

      Then apply Exercise 10 to ||x||2=||ay+z||2 to obtain ||z||=0.

    2. (b) Derive a similar result for the equality ||x+y||=||x||+||y||, and generalize it to the case of n vectors.

    1. (a) Show that the vector space H with ,  defined on page 330 is an inner product space.

    2. (b) Let V=C([0, 1]), and define

      f, g=1/20f(t)g(t) dt.

      Is this an inner product on V?

  14. Let T be a linear operator on an inner product space V, and suppose that T(x)=x for all x. Prove that T is one-to-one.

  15. Let V be a vector space over F, where F=R or F=C, and let W be an inner product space over F with inner product , . If T:VW is linear, prove that x, y=T(x), T(y) defines an inner product on V if and only if T is one-to-one.

  16. Let V be an inner product space. Prove that

    1. (a) ||x±y||2=||x||2±2x, y+||y||2 for all x, yV, where x, y denotes the real part of the complex number x, y.

    2. (b) |||x||||y|||||xy|| for all x, yV.

  17. Let V be an inner product space over F. Prove the polar identities: For all x, yV,

    1. (a) x, y=14||x+y||214||xy||2 if F=R;

    2. (b) x, y=144k=1ik||x+iky||2 if F=C, where i2=1.

  18. Let A be an n×n matrix. Define

    A1=12(A+A*)andA2=12i(AA*).
    1. (a) Prove that A*1=A1, A*2=A2, and A=A1+iA2. Would it be reasonable to define A1 and A2 to be the real and imaginary parts, respectively, of the matrix A?

    2. (b) Let A be an n×n matrix. Prove that the representation in (a) is unique. That is, prove that if A=B1+iB2, where B*1=B1 and B*2=B2, then B1=A1 and B2=A2.

  19. Let V be a real or complex vector space (possibly infinite-dimensional), and let β be a basis for V. For x, yV there exist v1, v2, , vnβ such that

    x=ni=1aiviandy=ni=1bivi.

    Define

    x, y=ni=1aiˉbi.
    1. (a) Prove that ,  is an inner product on V and that β is an orthonormal basis for V. Thus every real or complex vector space may be regarded as an inner product space.

    2. (b) Prove that if V=Rn or V=Cn and β is the standard ordered basis, then the inner product defined above is the standard inner product.

  20. Let V=Fn, and let AMn×n(F).

    1. (a) Prove that x, Ay=A*x, y for all x, yV.

    2. (b) Suppose that for some BMn×n(F), we have x, Ay=Bx, y for all x, yV. Prove that B=A*.

    3. (c) Let α be the standard ordered basis for V. For any orthonormal basis β for V, let Q be the n×n matrix whose columns are the vectors in β . Prove that Q* = Q1.

    4. (d) Define linear operators T and U on V by T(x)=Ax and U(x)=A*x. Show that [U]β=[T]*β for any orthonormal basis β for V.

  21. Let V be a complex inner product space with an inner product , . Let [, ] be the real-valued function such that [x, y] is the real part of the complex number x, y for all x, yV. Prove that [, ] is an inner product for V, where V is regarded as a vector space over R. Prove, furthermore, that [x, ix]=0 for all xV.

  22. Let V be a vector space over C, and suppose that [, ] is a real inner product on V, where V is regarded as a vector space over R, such that [x, ix]=0 for all xV. Let ,  be the complex-valued function defined by

    x, y=[x, y]+i[x, iy]forx,yV.

    Prove that ,  is a complex inner product on V.

The following definition is used in Exercises 2630.

Definition.

Let V be a vector space over F, where F is either R or C. Regardless of whether V is or is not an inner product space, we may still define a norm ||||v as a real-valued function on V satisfying the following three conditions for all x, yV and aF:

  1. (1) ||x||v0, and ||x||v=0 if and only if x=0.

  2. (2) ||ax||v=|a|||x||v.

  3. (3) ||x+y||v||x||v+||y||v.

  1. Prove that the following are norms on the given vector spaces V.

    1. (a) V=R2;||(a, b)||v=|a|+|b| for all (a, b)V

    2. (b) V=C([0, 1]);||f||v=maxt[0, 1]|f(t)| for all fV

    3. (c) V=C([0, 1]);||f||v=10|f(t)| dt for all fV

    4. (d) V=Mm×n(F);||A||v=maxi, j|Aij| for all AV

  2. Use Exercise 11 to show that there is no inner product ,  on R2 such that ||x||2V=x, x for all xR2 if the norm is defined as in Exercise 26(a).

  3. Let ||||v be a norm on a vector space V, and define, for each ordered pair of vectors, the scalar d(x, y)=||xy||v, called the distance between x and y. Prove the following results for all x, y, zV.

    1. (a) d(x, y)0.

    2. (b) d(x, y)=d(y, x).

    3. (c) d(x, y)d(x, z)+d(z, y).

    4. (d) d(x, x)=0 if and only if x=0.

    5. (e) d(x, y)0 if xy.

  4. Let ||||v be a norm on a real vector space V satisfying the parallelogram law given in Exercise 11. Define

    x, y=14[||x+y||2v||xy||2v].

    Prove that ,  defines an inner product on V such that ||x||2v=x, x for all xV. Hints:

    1. (a) Prove x, 2y=2x, y for all x, yV.

    2. (b) Prove x+u, y=x, y+u, y for all x, u, yV.

    3. (c) Prove nx, y=nx, y for every positive integer n and every x, yV.

    4. (d) Prove m1mx, y=x, y for every positive integer m and every x, yV.

    5. (e) Prove rx, y=rx, y for every rational number r and every x, yV.

    6. (f) Prove |x, y|||x||v||y||v for every x, yV. Hint: Condition (3) in the definition of norm can be helpful.

    7. (g) Prove that for every cR, every rational number r, and every x, yV,

      |cx, ycx, y|=|(cr)x, y(cr)x, y|2|cr|||x||v||y||v.
    8. (h) Use the fact that for any cR, |cr| can be made arbitrarily small, where r varies over the set of rational numbers, to establish item (b) of the definition of inner product.

  5. Let ||||v be a norm (as defined on page 337) on a complex vector space V satisfying the parallelogram law given in Exercise 11. Prove that there is an inner product ,  on V such that ||x||2v=x, x for all xV. Hint: Apply Exercise 29 to V regarded as a vector space over R. Then apply Exercise 25.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.16.66.156