13.5 Gauss elimination

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

13.5 Gauss elimination

Gauss elimination is a structured process for the elimination of variables in one of the equations. It is easy to generalize to larger systems of equations and it is relatively numerically stable, making it suitable for use with a computer.

Gauss elimination is performed in stages. At Stage 1, we concentrate on the first column, the coefficients of x. The idea is to make the coefficient of the first equation 1 and eliminate the variable from the other equation(s) by using multiples of the first equation. Equation 1 is therefore the pivotal equation for Stage 1.

Example 13.25

Solve

$\begin{matrix} 3 x + 4 y = 7 \\ 5 x - 8 y = 8 \end{matrix}$

si165_e

using Gauss elimination.

Solution We can either write out the equation each time we perform a step or we can abbreviate the solution by expressing the equations in short hand as an augmented matrix

$(\begin{matrix} 3 & 4 & 7 \\ 5 & - 8 & 8 \end{matrix}) .$

si166_e

We shall present both notations at the same time. We shall refer to the elements in the augmented matrix by

$(\begin{matrix} a_{11} & a_{12} & b_{1} \\ a_{21} & a_{22} & b_{2} \end{matrix}) .$

si167_e

For the first step, the first equation is the pivotal equation:

$\begin{matrix} 3 x + 4 y = 7 \\ 5 x - 8 y = 8 \end{matrix} ? (\begin{matrix} 3 & 4 & 7 \\ 5 & - 8 & 8 \end{matrix}) .$

si168_e

Stage 1

Step 1: Divide the first equation by a₁₁:

$\begin{matrix} x + \frac{4}{3} y = \frac{7}{3} \\ 5 x - 8 y = 8 \end{matrix} ? (\begin{matrix} 1 & \frac{4}{3} & \frac{7}{3} \\ 5 & - 8 & 8 \end{matrix}) .$

si169_e

Step 2: Take 5 times the first equation away from the second equation in order to eliminate the term in x in the second equation:

$\begin{array}{l} x + \frac{4}{3} y = \frac{7}{3} \\ ? - 8 y - (\frac{4}{3} y \times 5) = 8 - \frac{7}{3} \times 5 \end{array}$

si170_e

which is the same as

$\begin{array}{l} x + \frac{4}{3} y = \frac{7}{3} \\ ? - \frac{44}{3} y = - \frac{11}{3} \end{array} (\begin{array}{l} 1 & \frac{4}{3} & \frac{7}{3} \\ 0 & - \frac{44}{3} & - \frac{11}{3} \end{array}) .$

si171_e

Stage 2:

Divide the second equation through by the coefficient of y(a₂₂).

$\begin{matrix} x + \frac{4}{3} y = \frac{7}{3} \\ y = \frac{1}{4} \end{matrix} (\begin{matrix} 1 & \frac{4}{3} & \frac{7}{3} \\ 0 & 1 & \frac{1}{4} \end{matrix}) .$

si172_e

We now already have the solution for y and we can obtain the solution for x by substitution into the first equation. This is called ‘back-substitution’.

$\begin{array}{l} x + (\frac{4}{3}) (\frac{1}{4}) = \frac{7}{3} \\ ? \Leftrightarrow ? x = \frac{7}{3} - \frac{1}{3} \\ ? \Leftrightarrow ? x = 2. \end{array}$

si173_e

Therefore, the solution is (2, 0.25).

We see that the point of the exercise is to write the system of equations so that the final equation contains only one variable and the last but one equation has up to two variables, etc. The matrix of coefficients should be in upper triangular form like:

$(\begin{matrix} 1 & \frac{4}{3} \\ 0 & 1 \end{matrix}) .$

si174_e

The augmented matrix is then like

which is said to be in echelon form. Once this form has been achieved, then back-substitution can be performed to find the value of the variables.

Example 13.26

Solve the system of equations

$\begin{matrix} 2 x + y - 2 z = - 1 \\ 2 x - 3 y + 2 z = 9 \\ - ? x + y - z = - 3.5. \end{matrix}$

si175_e

Solution

$\begin{matrix} 2 x + y - 2 z = - 1 \\ 2 x - 3 y + 2 z = 9 \\ - x + y - z = - 3.5 \end{matrix} (\begin{matrix} 2 & 1 & - 2 & - 1 \\ 2 & - 3 & 2 & 9 \\ - 1 & 1 & - 1 & - 3.5 \end{matrix}) .$

si176_e

Stage 1

Stage 1 concerns the first column. We use the first row (the pivotal row) to eliminate the elements below a₁₁.

Step 1: Divide the first equation by a₁₁.

$\begin{matrix} x + 0.5 y - z = - 0.5 \\ 2 x - 3 y + 2 z = 9 \\ - x + y - z = - 3.5 \end{matrix} (\begin{matrix} 1 & 0.5 & - 1 & - 0.5 \\ 2 & - 3 & 2 & 9 \\ - 1 & 1 & - 1 & - 3.5 \end{matrix}) .$

si177_e

Step 2: Eliminate x from the second and third equations by taking away multiples of equation 1. To do this we take Row 2–2 × (Row 1) and

Row 3 − (−1) × (Row 1).

$\begin{matrix} x + 0.5 y ? - ? z ? = ? - 0.5 \\ - 4 y ? + ? 4 z ? = ? 10 \\ 1.5 y ? - ? 2 z ? = ? - 4 \end{matrix} (\begin{matrix} 1 & 0.5 & - 1 & - 0.5 \\ 0 & - 4 & 0 & 10 \\ 0 & 1.5 & - 2 & - 4 \end{matrix})$

si178_e

The calculations can be done ‘in the margin’ and were:

Row 2–2 × (Row 1)

Stage 2

Stage 2 concerns the second column. We use the second row to eliminate the elements below a₂₂. Here Row 2 is the pivotal row.

Step 1: Divide the second equation by the coefficient a₂₂:

$\begin{matrix} x + 0.5 y - z = - 0.5 \\ y ? - ? z = - 2.5 \\ 1.5 y - 2 z = - 4 \end{matrix} (\begin{matrix} 1 & 0.5 & - 1 & - 0.5 \\ 0 & 1 & - 1 & - 2.5 \\ 0 & 1.5 & - 2 & - 4 \end{matrix}) .$

si179_e

Step 2: Eliminate y from the third equation by taking away multiples of the second equation.

$\begin{matrix} x + 0.5 y ? - ? z = - 0.5 \\ y ? - ? z = - 2.5 \\ - 0.5 z = - 0.25 \end{matrix} ? (\begin{matrix} 1 & 0.5 & - 1 & - 0.5 \\ 0 & 1 & - 1 & - 2.5 \\ 0 & 0 & - 0.5 & - 0.25 \end{matrix}) .$

si180_e

Here, the calculation was Row 3–1.5 × (Row 2) and the calculation was as follows

Stage 3

Divide the third equation by the coefficient of z.

$\begin{matrix} x + 0.5 y ? - ? z = - 0.5 \\ y ? - ? z = - 2.5 \\ z = 0.5 \end{matrix} ? (\begin{matrix} 1 & 0.5 & - 1 & - 0.5 \\ 0 & 1 & - 1 & - 2.5 \\ 0 & 0 & 0.5 & - 0.25 \end{matrix})$

si181_e

Back-substitution: We have now finished the elimination stage and we can easily solve the equations using back-substitution.

From the third equation, z =−0.5

Find y from the second equation

$\begin{array}{l} y = - 2.5 + z \Leftrightarrow y = - 2.5 + 0.5 \\ ? \Leftrightarrow ? y = - 2 \end{array}$

si182_e

Substitute into the first equation to find x

$\begin{array}{l} x + 0.5 (- 2) - 0.5 = - 0.5 \\ ? \Leftrightarrow ? x - 1.5 = - 0.5 \\ ? \Leftrightarrow ? x = 1 \end{array}$

si183_e

So the solution of the system of equations is (1, −2,0.5).

Check: To check, substitute x =1, y =−2, and z =0.5 into the original equations

$\begin{matrix} 2 x + y - 2 z = - 1 \\ 2 x - 3 y + 2 z = 9 \\ - x + y - z = - 3.5 \end{matrix}$

si184_e

giving

$\begin{matrix} 2 (1) + (- 2) - 2 (0.5) = - 1, ? w h i c h ? i s ? t r u e \\ 2 (1) - 3 (- 2) + 2 (0.5) = 9, ? w h i c h ? i s ? t r u e \\ - (1) + (- 2) - 0.5 = - 3.5, ? w h i c h ? i s ? t r u e . \end{matrix}$

si185_e

Now we can solve the system of equations for the electrical network, which was the introductory example of Section 13.4.

Example 13.27

Solve, using Gauss elimination, the system of equations

$\begin{matrix} I_{1} - I_{2} - I_{3} = 0 \\ 3 I_{2} - 2 I_{3} = 0 \\ 7 I_{1} + ? 2 I_{3} = 8 \end{matrix}$

si186_e

Solution We shall only show the augmented matrix in this example, so we begin with

$(\begin{matrix} 1 & - 1 & - 1 & 0 \\ 0 & 3 & - 2 & 0 \\ 7 & 0 & 2 & 8 \end{matrix})$

si187_e

Stage 1

Stage 1 concerns the first column. We use the first row to eliminate the elements below a₁₁.

Step 1: Divide the first equation by a_11. As this is already 1 we do not need to divide by it.

Step 2: Eliminate elements in the first column below a ₁₁ by taking away multiples of Row 1 from Rows 2 and 3. Row 2 already has no entry in the first column so we leave it alone. We take Row 3 – (7) × (Row 1).

$(\begin{matrix} 1 & - 1 & - 1 & 0 \\ 0 & 3 & - 2 & 0 \\ 0 & 7 & 9 & 8 \end{matrix})$

si188_e

The calculations performed here was: Row 2 – 7 × (Row 1)

Stage 2

Stage 2 concerns the second column. We use the second row to eliminate the elements below a₂₂.

Step 1: Divide the second equation by the coefficient of a₂₂.

$(\begin{matrix} 1 & - 1 & - 1 & 0 \\ 0 & 1 & - \frac{2}{3} & 0 \\ 0 & 7 & 9 & 8 \end{matrix})$

si189_e

Step 2: Eliminate the element in the second column below a₂₂ by taking away multiples of Row 2 from Row 3.

$(\begin{matrix} 1 & - 1 & - 1 & 0 \\ 0 & 1 & - \frac{2}{3} & 0 \\ 0 & 0 & \frac{41}{3} & 8 \end{matrix})$

si190_e

Here the calculation was Row 3 – (7) × (Row 2), and the calculation was as follows:

Stage 3

Divide the third equation by the coefficient of z:

$(\begin{matrix} 1 & - 1 & - 1 & 0 \\ 0 & 1 & - \frac{2}{3} & 0 \\ 0 & 0 & 1 & \frac{24}{41} \end{matrix})$

si191_e

Back-substitution: We have now finished the elimination stage and we can easily solve the equations using back substitution.

From the third equation, $I_{3} = \frac{24}{41}$ si192_e . Find I₂ from the second equation:

$\begin{matrix} I_{2} - \frac{2}{3} I_{3} = 0 \\ I_{2} - \frac{2}{3} \times \frac{24}{41} = 0 ? ? ? ? ? \Leftrightarrow ? ? ? ? ? I_{2} = \frac{16}{41} . \end{matrix}$

si193_e

Substitute into the first equation to find I₁:

$I_{1} - \frac{16}{41} - \frac{24}{41} = 0 ? ? ? ? \Leftrightarrow ? ? ? ? ? I_{1} = \frac{40}{41}$

si194_e

So the solution of the system of equations is $(\frac{40}{41}, ? \frac{16}{41}, ? \frac{24}{41}) .$ si195_e

Check: To check, substitute $I_{1} = \frac{40}{41}, ? I_{2} = \frac{16}{41}, ? a n d ? I_{3} = \frac{24}{41}$ into the original equations

$\begin{array}{l} I_{1} - I_{2} - I_{3} = 0 \\ 3 I_{2} - 2 I_{3} = 0 \\ 7 I_{1} + 2 I_{3} = 8 \end{array}$

si197_e

giving

$\begin{array}{l} \frac{40}{41} - \frac{16}{41} - \frac{24}{41} ? = ? 0, ? w h i c h ? i s ? t r u e \\ 3 (\frac{16}{41}) - 2 (\frac{24}{41}) ? = ? 0, ? w h i c h ? i s ? t r u e \\ 7 (\frac{40}{41}) + 2 (\frac{24}{41}) ? = ? 8, ? w h i c h ? i s ? t r u e . \end{array}$

si198_e

Indeterminacy and inconsistency

When we analysed systems of equations in Section 13.5, we saw that one of the equations reducing to 0 =0, indicates that we have an indeterminate system; that is that there will be many solutions. If, in the process of performing Gauss elimination, we find a row of zeros then we know that we have an indeterminate system. We can use the remaining equations to eliminate as many of the unknowns as possible, giving a solution which will still involve one or more of the variables. This will give a whole line or possibly (in three dimensions) a plane of solutions.

If we come across a row that is zero everywhere in the matrix of coefficients but has a non-zero constant term we have found an equation 0 = c, which is false. This is an inconsistent system and has no solutions.

Order of the equations

At the beginning of each stage in Gauss elimination, the order of the rows may be swapped (other than those already used in previous stages as the pivotal equation). We will have to swap the equations if the next ‘pivotal’ equation has a zero coefficient for the next variable to be eliminated. It is usual, although we have not illustrated this point, to always consider swapping the order of the equations, in order to choose the equation with the largest absolute value of the coefficient in the term to be used for eliminating, as the pivotal equation. This is called partial pivoting. This procedure as an attempt to avoid problems with equations that may become ill conditioned in the course if performing the elimination. The equations are ill conditioned when a small change in the coefficients of the equations causes a large change in the values of the solutions, and in such equations rounding errors can become large and cause significant inaccuracies in the solutions. We have not performed partial pivoting in these examples as they are only presented to give an idea of the method. It is assumed that for any real life problem a computer algorithm will be used to solve the system of equations, and such an algorithm will incorporate partial pivoting.

13.6 The inverse and determinant of a 3 × 3 matrix

Finding the inverse by elimination

To find the inverse using elimination, we write the matrix we need to invert on the left and the unit matrix on the right. We perform operations on both matrices at the same time. The method, called Gauss–Jordan elimination, begins in the same way as Gauss elimination. When we have upper triangular form for the matrix we metaphorically turn the problem upside down and eliminate the upper triangle also.

Example 13.28

Find the inverse of

$(\begin{matrix} 4 & 0 & - 4 \\ 3 & 4 & 2 \\ - 1 & - 1 & 1 \end{matrix}) .$

si199_e

Solution We start by writing the matrix along with the unit matrix

$(\begin{array}{l} 4 & 0 & - 4 & 1 & 0 & 0 \\ 3 & 4 & 2 & 0 & 1 & 0 \\ - 1 & - 1 & 1 & 0 & 0 & 1 \end{array}) .$

si200_e

Stage 1

Step 1: Divide the first row by a₁₁

$(\begin{array}{l} 1 & 0 & - 1 & 0.25 & 0 & 0 \\ 3 & 4 & 2 & 0 & 1 & 0 \\ - 1 & - 1 & 1 & 0 & 0 & 1 \end{array}) .$

si201_e

Step 2: Eliminate the first column below a₁₁ by subtracting multiples of the first row from the second and third rows

$(\begin{array}{l} 1 & 0 & - 1 & 0.25 & 0 & 0 \\ 0 & 4 & 5 & - 0.75 & 1 & 0 \\ 0 & - 1 & 0 & 0.25 & 0 & 1 \end{array}) .$

si202_e

The calculations were as follows:

Row 2–3 × Row 1

Row 3 – (−1) × Row 1

Stage 2

Step 1: Divide the second row by a₂₂(4)

$(\begin{array}{l} 1 & 0 & - 1 & 0.25 & 0 & 0 \\ 0 & 1 & 1.25 & - 0.1875 & 0.25 & 0 \\ 0 & - 1 & 0 & 0.25 & 0 & 1 \end{array}) .$

si203_e

Step 2: Eliminate the elements in the second column below a₂₂ by subtracting multiples of the second row from the third row

$(\begin{array}{l} 1 & 0 & - 1 & 0.25 & 0 & 0 \\ 0 & 1 & 1.25 & - 0.1875 & 0.25 & 0 \\ 0 & 0 & 1.25 & 0.0625 & 0.25 & 1 \end{array}) .$

si204_e

The calculations were as follows:

Row 3 – (−1) × Row 2

Stage 3

Step 1: Divide the third row by a₃₃ (1.25):

$(\begin{array}{l} 1 & 0 & - 1 & 0.25 & 0 & 0 \\ 0 & 1 & 1.25 & - 0.1875 & 0.25 & 0 \\ 0 & 0 & 1 & 0.05 & 0.2 & 0.08 \end{array})$

si205_e

Step 2: Turn the problem metaphorically upside down and use the third row to eliminate elements in the third column above a₃₃ by subtracting multiples of the third row from the first row and the second row

$(\begin{array}{l} 1 & 0 & 0 & 0.3 & 0.2 & 0.8 \\ 0 & 1 & 0 & - 0.25 & 0 & - 1 \\ 0 & 0 & 1 & 0.05 & 0.2 & 0.8 \end{array})$

si206_e

The calculations were as follows:

Row 1 – (−1) × Row 3

Row 2 – 1.25 × Row 3

The matrix on the right-hand side is now the inverse of the original matrix.

The inverse is

$(\begin{array}{l} 0.3 & 0.2 & 0.8 \\ - 0.25 & 0 & - 1 \\ 0.05 & 0.2 & 0.8 \end{array})$

si207_e

Check: Multiply the original matrix by its inverse

$\begin{array}{l} (\begin{array}{l} 4 & 0 & - 4 \\ 3 & 4 & 2 \\ - 1 & - 1 & 1 \end{array}) (\begin{array}{l} 0.3 & 0.2 & 0.8 \\ - 0.25 & 0 & - 1 \\ 0.05 & 0.2 & 0.8 \end{array}) \\ ? ? ? ? ? ? ? ? = (\begin{array}{l} 4 (0 .3) + 0 (- 0.25) - 4 (0.05) & 4 (0.2) + 0 (0) - 4 (0.2) & 4 (0.8) + 0 (- 1) - 4 (0.8) \\ 3 (0.3) + 4 (0.25) + 2 (0.05) & 3 (0.2) + 4 (0) + 2 (0.2) & 3 (0.8) + 4 (- 1) + 2 (0.8) \\ - 1 (0.3) - 1 (- 0.25) + 1 (0.05) & - 1 (0.2) - 1 (0) + 1 (0.2) & - 1 (0.8) - 1 (- 1) + 1 (0.8) \end{array}) \\ ? ? ? ? ? ? ? ? = (\begin{array}{l} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array}) \end{array}$

si208_e

Therefore we have correctly found the inverse of the matrix.

The determinant of a 3 × 3 matrix

The definition of the (2 × 2) determinant has been given as

$| \begin{array}{l} a_{1} & b_{1} \\ a_{2} & b_{2} \end{array} | = a_{1} b_{2} - a_{2} b_{1} .$

si209_e

Each of the terms on the right-hand side of this definition is of the form a_i, b_j where i and j are different choices of the numbers 1 and 2. We can define higher order determinants by using ideas of permutations. We notice that the term a₁b₂ above has a positive sign because the indices 1 and 2 appear in order, whereas the term a₂b₁ has a negative sign because the indices 2,1 are reversed.

To define

$| \begin{array}{l} a_{1} & b_{1} & c_{1} \\ a_{2} & b_{2} & c_{2} \\ a_{3} & b_{3} & c_{3} \end{array} |$

si210_e

we write down all terms of the form a_i b _jc_k and give each term a + sign or a – sign depending on whether the permutation ijk is even or odd. A permutation of 123 is even if it can be achieved by an even number of swaps of the numbers, beginning with the order 123. If it can only be obtained by an odd number of swaps then the permutation is odd. For example, 231 is even because we can reach it by first swapping 1 and 2 giving 213 and then swapping 1 and 3. Alternatively, we could have interchanged 2 and 3 giving 132 and 1 and 3 giving 312, 2 and 1 giving 321,3 and 2 giving 231. Whatever way we use to get to the order 231 involves an even number of steps. Similarly we say that a permutation of 123 is odd if it involves an odd number of adjacent interchanges.

This definition gives the determinant of a 3 × 3 array as

$| \begin{array}{l} a_{1} & b_{1} & c_{1} \\ a_{2} & b_{2} & c_{2} \\ a_{3} & b_{3} & c_{3} \end{array} | = a_{1} b_{2} c_{3} - a_{1} b_{3} c_{2} - a_{2} b_{1} c_{3} + a_{2} b_{3} c_{1} + a_{3} b_{1} c_{2} - a_{3} b_{2} c_{1}$

si211_e

This expression may be written in such a way that it involves 2 × 2 determinants as follows:

$\begin{array}{l} a_{1} b_{2} c_{3} - a_{1} b_{3} c_{2} - a_{2} b_{1} c_{3} + a_{3} b_{1} c_{2} + a_{2} b_{3} c_{1} - a_{3} b_{2} c_{1} \\ ? ? ? ? ? ? ? ? = a_{1} (b_{2} c_{3} - b_{3} c_{2}) - b_{1} (a_{2} c_{3} - a_{3} c_{1}) + c_{1} (a_{2} b_{3} - a_{3} b_{2}) \\ ? ? ? ? ? ? ? ? = a_{1} | \begin{array}{l} b_{2} & c_{2} \\ b_{3} & c_{3} \end{array} | - b_{1} | \begin{array}{l} a_{2} & c_{2} \\ a_{3} & c_{3} \end{array} | + c_{1} | \begin{array}{l} a_{2} & b_{2} \\ a_{3} & b_{3} \end{array} | . \end{array}$

si212_e

The 2 × 2 determinants that appear in this expression are called minors. This formula for the determinant is called the expansion by the first row, because the numbers a₁, b₁, c₁ which multiply the minors are from the first row of the matrix.

Note that the minor multiplying a₁ is the (2 × 2) determinant obtained from the original array by crossing out the row and column in which a_l appears, as follows

gives the minor of a₁as

$| \begin{array}{l} b_{2} & c_{2} \\ b_{3} & c_{3} \end{array} | .$

si213_e

Similarly, the number multiplying b₁ is the determinant found by crossing out the row and the column in which b₁ appears.

We could also find the determinant by expanding about the first column

$\begin{array}{l} a_{1} b_{2} c_{3} - a_{1} b_{3} c_{2} - a_{2} b_{1} c_{3} + a_{2} b_{3} c_{1} + a_{3} b_{1} c_{2} - a_{3} b_{2} c_{1} \\ ? ? ? ? ? ? ? ? = a_{1} (b_{2} c_{3} - b_{3} c_{2}) - a_{2} (b_{1} c_{3} - b_{3} c_{1}) + a_{3} (b_{1} c_{2} - b_{2} c_{1}) \\ ? ? ? ? ? ? ? ? = a_{1} | \begin{array}{l} b_{2} & c_{2} \\ b_{3} & c_{3} \end{array} | - a_{2} | \begin{array}{l} b_{1} & c_{1} \\ b_{3} & c_{3} \end{array} | + a_{3} | \begin{array}{l} b_{1} & c_{1} \\ b_{2} & c_{2} \end{array} | . \end{array}$

si214_e

Again we see that the minor of a₂, for instance, can be found by crossing out the row and column that a₂appears in from the original array.

To find the sign multiplying each term in the expansion for the determinant we can remember the following pattern

$\begin{array}{l} + & - & + \\ - & + & - \\ + & - & + \end{array}$

si215_e

To find the determinant we can expand about any row and column, multiplying each term a_ijby its respective minor and find the sign by multiplying by (−l)ⁱ⁺^j.

Example 13.29

Find the following determinant

$| \begin{array}{l} - 1 & 2 & 3 \\ 6 & - 1 & 2 \\ 4 & 0 & - 1 \end{array} |$

si216_e

Solution Expanding about the first row

$\begin{matrix} | \begin{array}{l} - 1 & 2 & 3 \\ 6 & - 1 & 2 \\ 4 & 0 & - 1 \end{array} | = - 1 | \begin{array}{l} - 1 & 2 \\ 0 & - 1 \end{array} | - 2 | \begin{array}{l} 6 & 2 \\ 4 & - 1 \end{array} | + 3 | \begin{array}{l} 6 & - 1 \\ 4 & 0 \end{array} | \\ = - 1 (1 - 0) - 2 (- 6 - 8) + 3 (0 + 4) \\ = - 1 + 28 + 12 = 39. \end{matrix}$

si217_e

Alternatively, expanding about the first column we get

$\begin{matrix} - 1 | \begin{array}{l} - 1 & 2 \\ 0 & - 1 \end{array} | - 6 | \begin{array}{l} 2 & 3 \\ 0 & - 1 \end{array} | + 4 | \begin{array}{l} 2 & 3 \\ - 1 & 2 \end{array} | \\ = - 1 (1 - 0) - 6 (- 2 - 0) + 4 (4 - (- 3)) \\ = - 1 + 12 + 28 = 39. \end{matrix}$

si218_e

The inverse of a matrix using (Adjoint(A))/|A|

We have already seen how to find the inverse of a matrix by using elimination. It is also possible to find the inverse by the following procedure:

(1) Find the matrix of minors.

(2) Multiply the minor for row i and column j by (−l)ⁱ⁺^j. This is then called the matrix of cofactors.

(3) Take the transpose of the matrix of cofactors to find the adjoint matrix.

(4) Divide by the determinant of the original matrix.

This procedure rarely needs to be used and only usually if we have a matrix which involves some unknown variables or expresses some formula and we would like to find the inverse formula. It would never be used as a numerical procedure, as it is both numerically unstable and also uses a very large number of operations (of the order of n! operations, where n is the dimension of the matrix, whereas elimination is only of the order of n³).

Example 13.30

Find the inverse of

$(\begin{matrix} 4 & 0 & - 4 \\ 3 & 4 & 2 \\ - 1 & - 1 & 1 \end{matrix})$

si219_e

using A⁻¹ =(Adjoint(A))/|A|.

Solution Find the matrix of minors for each term in the matrix. The minor for the ith row and jth column is found by crossing out that row and column and finding the determinant of the remaining elements.

This gives the matrix of minors as

$(\begin{matrix} | \begin{matrix} 4 & 2 \\ - 1 & 1 \end{matrix} | & | \begin{matrix} 3 & 2 \\ - 1 & 1 \end{matrix} | & | \begin{matrix} 3 & 4 \\ - 1 & - 1 \end{matrix} | \\ | \begin{matrix} 0 & - 4 \\ - 1 & 1 \end{matrix} | & | \begin{matrix} 4 & - 4 \\ - 1 & 1 \end{matrix} | & | \begin{matrix} 4 & 0 \\ - 1 & - 1 \end{matrix} | \\ | \begin{matrix} 0 & - 4 \\ 4 & 2 \end{matrix} | & | \begin{matrix} 4 & - 4 \\ 3 & 2 \end{matrix} | & | \begin{matrix} 4 & 0 \\ 3 & 4 \end{matrix} | \end{matrix}) = (\begin{matrix} 6 & 5 & 1 \\ - 4 & 0 & - 4 \\ 16 & 20 & 16 \end{matrix}) .$

si220_e

To find the matrix of cofactors we multiply by the pattern

$\begin{matrix} + & - & + \\ - & + & - \\ + & - & + \end{matrix}$

si221_e

giving

$(\begin{matrix} 6 & - 5 & 1 \\ 4 & 0 & 4 \\ 16 & - 20 & 16 \end{matrix}) .$

si222_e

To find the adjoint, we take the transpose of the above, giving

$(\begin{matrix} 6 & 4 & 16 \\ - 5 & 0 & - 20 \\ 1 & 4 & 16 \end{matrix}) .$

si223_e

Now we find the determinant – expanding about the first row, this gives

$4 (4 - (- 2)) - 0 (3 (1) - 2 (- 1)) - 4 (3 (- 1) - (- 1) (4)) = 20$

Finally, we divide the adjoint by the determinant to find the inverse giving

$\frac{1}{20} (\begin{matrix} 6 & 4 & 16 \\ - 5 & 0 & - 20 \\ 1 & 4 & 16 \end{matrix}) = (\begin{matrix} 0.3 & 0.2 & 0.8 \\ - 0.25 & 0 & - 1 \\ 0.05 & 0.2 & 0.8 \end{matrix}) .$

si225_e

Check: To check that the calculation is correct, we multiply the original matrix by the inverse. If we get the unit matrix as the result we can conclude that we have indeed found the inverse.

$(\begin{matrix} 4 & 0 & - 4 \\ 3 & 4 & 2 \\ - 1 & - 1 & 1 \end{matrix}) (\begin{matrix} 0.3 & 0.2 & 0.8 \\ - 0.25 & 0 & - 1 \\ 0.05 & 0.2 & 0.8 \end{matrix}) = (\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix})$

si226_e

which is correct.

13.7 Eigenvectors and eigenvalues

In Example 13.17, we looked at the problem of scaling along the line x =y and we saw that the matrix

$(\begin{matrix} \frac{3}{2} & \frac{1}{2} \\ \frac{1}{2} & \frac{3}{2} \end{matrix})$

si227_e

represents a scaling along the line y = x and it leaves points along the line y = –x unchanged. This means that any vector in the direction (1,1) will simply be multiplied by 2 and any vector in the direction (–1,1) will remain unchanged after multiplication by this matrix. Other vectors will undergo a mixed effect.

Supposing we know that a matrix A represents a scaling but without knowing the direction of the scaling or by how much it scales. Is there any way we can find that direction and the scaling constant?

The problem then is to find a vector v which is simply scaled by some currently unknown amount λ when multiplied by A, and v must be such that

$A v ? = ? λ v$

If we manage to find values of λ and v we call these the eigenvalues and eigenvectors of the matrix A.

We shall solve this for

$(\begin{matrix} \frac{3}{2} & \frac{1}{2} \\ \frac{1}{2} & \frac{3}{2} \end{matrix})$

si229_e

as we know the result that we expect to get.

Example 13.31

Find λ and v such that Av = λ v where

$A ? = ? (\begin{matrix} \frac{3}{2} & \frac{1}{2} \\ \frac{1}{2} & \frac{3}{2} \end{matrix}) .$

si230_e

Solution Subtract λv from both sides of the equation

$A v ? = λ v ? ? ? ? ? ? ? \Leftrightarrow ? ? ? ? ? ? ? ? A v - λ v = 0 ? ? ? ? ? ? ? ? ? (\begin{matrix} \frac{3}{2} & \frac{1}{2} \\ \frac{1}{2} & \frac{3}{2} \end{matrix}) v - λ v = 0$

si231_e

We put in the unit matrix as v = Iv and combine the terms.

$\begin{array}{l} (\begin{matrix} \frac{3}{2} & \frac{1}{2} \\ \frac{1}{2} & \frac{3}{2} \end{matrix}) v - λ (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) v = (\begin{array}{l} 0 \\ 0 \end{array}) \\ ? ? ? ? ? ? ? ? ? ? ? (\begin{matrix} \frac{3}{2} - λ & \frac{1}{2} \\ \frac{1}{2} & \frac{3}{2} - λ \end{matrix}) v = (\begin{array}{l} 0 \\ 0 \end{array}) . \end{array}$

si232_e

Now substitute

$v = (\begin{array}{l} x \\ y \end{array})$

si233_e

giving

$\begin{array}{l} (\frac{3}{2} - λ) x + \frac{1}{2} y = 0 \\ \frac{1}{2} x + (\frac{3}{2} - λ) y = 0. \end{array}$

si234_e

Unfortunately, the solution to this gives x = 0 and y = 0, which is not very enlightening (it is called the trivial solution).

However, we started by saying that we wanted to find the direction in which this matrix scaled any vector. That is, we want to find a whole line of solutions. We can use a result that we found from solving systems of equations. The equations may have a whole line of solutions if the determinant of the coefficients is 0.

Hence, we need to find λ such that

$| \begin{matrix} \frac{3}{2} - λ & \frac{1}{2} \\ \frac{1}{2} & \frac{3}{2} - λ \end{matrix} | = 0.$

si235_e

Expanding the determinant gives

$\begin{array}{l} (\frac{3}{2} - λ) (\frac{3}{2} - λ) - \frac{1}{4} = 0 \\ \frac{9}{4} - 3 λ + λ^{2} - \frac{1}{4} = 0 \Leftrightarrow λ^{2} - 3 λ + 2 = 0. \end{array}$

si236_e

This factorizes to

$(λ - 2) (λ - 1) = 0 ? ? ? ? \Leftrightarrow ? ? ? ? λ = 1 \lor λ = 1.$

Hence, the eigenvalues of the matrix A are 1 and 2. To find the vectors which go with each of these eigenvalues we substitute into the equations

$\begin{array}{l} (\frac{3}{2} - λ) x + \frac{1}{2} y = 0 \\ \frac{1}{2} x + (\frac{3}{2} - λ) y = 0. \end{array}$

si238_e

For λ = 2

$\begin{array}{l} - \frac{1}{2} x + \frac{1}{2} y = 0 \\ \frac{1}{2} x - \frac{1}{2} y = 0. \end{array}$

si239_e

We notice that these equations are dependent, which we would have expected as by setting the determinant = 0 we were looking for an undetermined system.

We have

$- \frac{1}{2} x + \frac{1}{2} y ? = 0 ? ? ? ? ? ? \Leftrightarrow ? ? ? ? ? ? x = y .$

si240_e

This means that any vector (x, y) where y = x will be scaled by 2 if multiplied by the matrix A. The eigenvector can be given as (1,1) as it is only necessary to indicate the direction.

The other eigenvalue, λ = 1, gives

$\begin{array}{l} \frac{1}{2} x + \frac{1}{2} y ? = 0 \\ \frac{1}{2} x + \frac{1}{2} y ? = 0. \end{array}$

si241_e

Again, the equations are dependent and we have x + y = 0. The eigenvector is any vector (x, y) where x = –y so this gives the direction (–1,1).

As not all matrices represent scaling, it is not always possible to find real eigenvalues. In particular, a rotation matrix has no real eigenvalues for angle of rotation, θ ≠ 0. Another point to note is that in this example the eigenvectors were at right angles to each other. This is only true for symmetric matrices (which we had in this case).

The method can be summarized as follows: To find the eigenvalues and eigenvectors of A

(1) Solve |A – λI| = 0 to find the eigenvalues. This is called the characteristic equation.

(2) For each value of λ found, substitute into (A – λI)v = 0 and find v. This will be an undetermined system so we shall find at least a whole line of solutions. Choose any vector lying in the direction of the line.

Example 13.32

Find the eigenvectors and eigenvalues of

$(\begin{matrix} 1 & 3 \\ 2 & - 4 \end{matrix}) .$

si242_e

Solution Solve |A – λI| = 0, which gives

$\begin{matrix} | \begin{matrix} 1 ? - λ & 3 \\ 2 & - 4 - λ \end{matrix} | & = 0 \\ \Leftrightarrow (1 - λ) (- 4 - λ) - 6 & = 0 \\ \Leftrightarrow λ^{2} + 3 λ - 10 & = 0 \\ \Leftrightarrow (λ + 5) (λ - 2) & = 0 \Leftrightarrow λ = - 5 ? o r ? λ = 2. \end{matrix}$

si243_e

For each value of λ solve

$(\begin{matrix} 1 ? - λ & 3 \\ 2 & - 4 - λ \end{matrix}) (\begin{array}{l} x \\ y \end{array}) = (\begin{array}{l} 0 \\ 0 \end{array}) .$

si244_e

For λ = −5, this gives

$(\begin{matrix} 6 & 3 \\ 2 & 1 \end{matrix}) (\begin{array}{l} x \\ y \end{array}) = (\begin{array}{l} 0 \\ 0 \end{array}) ? ? ? ? ? ? ? ? ? ? \Rightarrow ? ? ? ? ? ? ? ? ? ? \begin{matrix} 6 x + 3 y = 0 \\ 2 x + y = 0 \end{matrix}$

si245_e

We see that these equations are dependent. Solving the first one

$\begin{matrix} 6 x + 3 y = 0 \\ \begin{array}{l} \Leftrightarrow ? ? ? ? ? ? 2 x + y = 0 \\ y = - 2 x \end{array} \end{matrix}$

si246_e

The vector is therefore (x, y) where y = −2x, giving (x, −2x). We only need the direction of the vector so choose (1, −2) by substituting x = 1

For λ = 2 we get

$(\begin{matrix} - 1 & 3 \\ 2 & - 6 \end{matrix}) (\begin{array}{l} x \\ y \end{array}) = (\begin{array}{l} 0 \\ 0 \end{array}) ? \Rightarrow \begin{matrix} - x + 3 y = 0 \\ 2 x - 6 y = 0 \end{matrix}$

si247_e

Solving

$- x + 3 y = 0 ? ? ? ? ? \Leftrightarrow ? ? ? ? ? x = 3 y$

Hence, we have (x, y) where x =3 y giving (3 y, y). Substitute y = 1 giving the vector as (3,1).

We have shown that the matrix

$(\begin{matrix} 1 & 3 \\ 2 & - 4 \end{matrix})$

si249_e

has eigenvalue −5 with eigenvector (1, −2) and eigenvalue 2 with eigenvector (3,1).

13.8 Least squares data fitting

Matrix methods can be employed to the problem of finding the ‘best fit’ line through a set of data. In Chapter 2 we performed a fit by eye to a set of data points. We drew a scatter diagram of the data and if the data appeared, more or less, to fit on a line then we would draw the line by hand and then use any two points lying on the line, to find the equation of the line. We do not expect experimental data to be exact and that is the reason that the data points do not lie exactly on a line. We shall now look at the method of least squares, which can be used to compute the ‘best fit’ line. The method is called ‘least squares’ because it minimizes the squared error between the data points and the equation of the line found. The method is justified in the following example.

We start with two sets of data which we suspect are related linearly.

Example 13.33

A student was late for a particularly interesting engineering maths lecture and was therefore walking briskly toward the lecture hall in a straight line at approximately constant speed b. The student's position x (metres) at time t (seconds) is given by

We can plot these points on a scatter diagram, as in Figure 13.18.

f13-18-9780750658553 — Figure 13.18 A scatter diagram of the data for Example 13.33.

We can see that they lie on an approximate straight line. We need to decide which is the ‘best’ straight line to draw. One way is to fit the straight line, y = a + bx, to a set of data points (x_i, y_i) so that the sum of the squares of the vertical distances of the points from the straight line drawn is minimum. This is illustrated in Figure 13.19.

f13-19-9780750658553 — Figure 13.19 Vertical distances between the data points and the line represent the error for each point. The method of least squares minimises the sum of the squares of these errors.

In order to fit the line y = a +bx, we can vary the values of a and b until it satisfies our condition for minimum squared error. If the data points are (x_i, y_i), then the sum of the squares of the errors is given by

$E ? = ? \sum_{i} {(y_{i} - a - b x_{i})}^{2}$

We can vary a and b to make this a minimum. However, E is a function of two variables a and b. We know how to find the minima or maxima with respect to one variable, which we looked at in Chapter 11. We shall look in more detail at functions of two variables in Chapter 17. To minimize with respect to two variables we begin by differentiating with respect to each variable in turn keeping the other variable constant. This is called partial differentiation. The ideas used in finding stationary values are the same as that for one variable although the method of distinguishing between types of stationary values is slightly more involved because the function represents a two-dimensional surface drawn in three dimensions rather than a simple curve. A partial derivative is indicated by using a curly d, ∂, for the derivative and ∂ E/∂a is read as ‘partial dE by da’

$\begin{array}{l} \frac{\partial E}{\partial a} ? = ? - 2 \sum_{i} (y_{i} - a - b x_{i}) \\ \frac{\partial E}{\partial b} ? = ? - 2 \sum_{i} x_{i} (t_{i} - a - b x_{i}) \end{array}$

si251_e

For a minimum value we must have ∂E/∂a = 0 and ∂E/∂b = 0 giving

$\begin{matrix} - 2 \sum_{i} (y_{i} - a - b x_{i}) = 0 \Leftrightarrow \sum_{i} y_{i} - \sum_{i} a - b \sum_{i} x_{i} = 0 \\ - 2 \sum_{i} x_{i} (y_{i} - a - b x_{i}) = 0 \Leftrightarrow \sum_{i} x_{i} y_{i} - a \sum_{i} x_{i} - b \sum_{i} x_{i}^{2} = 0 \end{matrix}$

si252_e

Finally, we get the normal equations

$\begin{array}{l} a n + b \sum_{i} x_{i} = \sum_{i} y_{i} \\ a \sum_{i} x_{i} + b \sum_{i} x_{i}^{2} = \sum_{i} x_{i} y_{i} \end{array}$

si253_e

where n is the number of data points.

Here, we have not attempted to justify that this is actually a minimum point (we have only shown it to give a stationary point). We can now illustrate the method for finding the values of a and b which minimize the sum of the squared errors. At the start of this example we had a set of data which we wish to fit to a function x = a+bt where the dependent variable is x and the independent variable is t. We wish to find the values of a and b so that x = a + bt gives a least squares fit to the data. The number of data points is 6, so we have as the normal equations:

$\begin{array}{r} 6 a + b Σ_{i = 1}^{6} t_{i} = Σ_{i = 1}^{6} x_{i} \\ a Σ_{i = 1}^{6} t_{i} + b Σ_{i = 1}^{6} t_{i}^{2} = Σ_{i = 1}^{6} t_{i} x_{i} \end{array}$

si254_e

Make a table from the data, as in Table 13.1.

Table 13.1

A table made from the data of Example 13.32

t	x	t²	tx
0	100	0	0
5	111	25	555
10	119	100	1190
15	132	225	1980
20	151	625	3775
75	753	1375	10 300

cetable1

Then the normal equations become:

$\begin{matrix} 6 a + 75 b = 753 \\ 75 a + 1375 b = 10300 \end{matrix}$

si255_e

Solving

$\begin{matrix} 450 a + 5625 b = 56475 \\ 450 a + 8250 b = 61800 \\ 2625 b = 5325 \end{matrix}$

si256_e

$\begin{array}{l} \Rightarrow b \approx 2.03 \\ 6 a + 75 (2.03) = 754 \Rightarrow a \approx 100.14 \end{array}$

si257_e

We have a = 100.14 and b = 2.03. Hence, the line of best fit is

$x = 100.14 + 2.03 t .$

Curve fitting

The same method for fitting a straight line can be generalized to fit any polynomial. For example, it could appear that our data would be better fitted to a parabola.

$y = b_{0} + b_{1} x + b_{2} x^{2}$

The normal equations in this case are

$\begin{matrix} b_{0} n + b_{1} \sum_{i} x_{i} + b_{2} \sum_{i} x_{i}^{2} = \sum_{i} y_{i} \\ b_{0} \sum_{i} x_{i} + b_{1} \sum_{i} x_{i}^{2} + b_{2} \sum_{i} x_{i}^{3} = \sum_{i} x_{i} y_{i} \\ b_{0} \sum_{i} x_{i}^{2} + b_{1} \sum_{i} x_{i}^{3} + b_{2} \sum_{i} x_{i}^{4} = \sum_{i} x_{i}^{2} y_{i} \end{matrix}$

si260_e

We can solve these system of equations using Gaussian elimination.

Example 13.34

Find the best fit parabola by the method of least squares for (0,3) (1,1) (2,0) (4,1) (6,4).

Solution The data are given in Table 13.2. The normal equations are:

$\begin{matrix} 5 b_{0} + 13 b_{1} + 57 b_{2} = 9 \\ 13 b_{0} + 57 b_{1} + 289 b_{2} = 29 \\ 57 b_{0} + 289 b_{1} + 1569 b_{2} = 161 \end{matrix}$

si261_e

Table 13.2

Table made from the data of Example 13.34

x	y	x²	x³	x⁴	xy	x²y
0	3	0	0	0	0
1	1	1	1	1	1	1
2	0	4	8	16	0	0
4	1	16	64	256	4	16
6	4	36	216	1296	24	144
13	9	57	289	1569	29	161

cetable2

Solving these using Gauss elimination gives

$(\begin{matrix} 5 & 13 & 57 & 9 \\ 13 & 57 & 289 & 29 \\ 57 & 289 & 1569 & 161 \end{matrix})$

si262_e

Stage 1

$(\begin{matrix} 1 & 2.6 & 11.4 & 1.8 \\ 13 & 57 & 289 & 29 \\ 57 & 289 & 1569 & 161 \end{matrix})$

si263_e

$(\begin{matrix} 1 & 2.6 & 11.4 & 1.8 \\ 0 & 23.2 & 140.8 & 5.6 \\ 0 & 140.8 & 919.2 & 58.4 \end{matrix})$

si264_e

Stage 2

$\begin{array}{l} (\begin{matrix} 1 & 2.6 & 11.4 & 1.8 \\ 0 & 1 & 6.069 & 0.241 \\ 0 & 140.8 & 919.2 & 58.4 \end{matrix}) \\ (\begin{matrix} 1 & 2.6 & 11.4 & 1.8 \\ 0 & 1 & 6.069 & 0.241 \\ 0 & 0 & 64.685 & 24.467 \end{matrix}) \end{array}$

si265_e

Stage 3

$\begin{array}{l} (\begin{matrix} 1 & 2.6 & 11.4 & 1.8 \\ 0 & 1 & 6.069 & 0.241 \\ 0 & 0 & 1 & 0.378 \end{matrix}) \\ B a c k - s u b s t i t u t i o n \\ \begin{matrix} b_{0} + 2.6 b_{1} + 11.4 b_{2} = 1.8 \\ b_{1} + 6.069 b_{2} = 0.241 \\ b_{2} = 0.378 \end{matrix} \end{array}$

si266_e

gives b₂ = 0.378, b₁ = −2.054, and b₀= 2.831.

Hence, the best fit parabola is y = 2.831 – 2.054x + 0.378x².

13.9 Summary

1. Matrices are used to represent information in a way suitable for use by a computer. They can represent, among other things, systems of linear equations, transformations, and networks. A matrix is a rectangular array of numbers of dimension m × n where m is the number of rows and n is the number of columns.

2. To add or subtract matrices add or subtract each corresponding element. The matrices must be of exactly the same dimension. To multiply two matrices, C = AB, the number of columns in matrix A must equal the number of rows in matrix B. The i, jth element of C is found by multiplying the ith row of A by the jth column of B.

3. The unit matrix, I, leaves any matrix unchanged under multiplication.

$\begin{array}{l} I = ? (\begin{matrix} 1 & 0 \\ 1 & 0 \end{matrix}) (2 ? dim e n s i o n s) \\ I = ? (\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}) (3 ? dim e n s i o n s) \end{array}$

si267_e

IA = AI = A where A is any matrix.

4. The inverse of a matrix is represented by A⁻¹ and can be found for square, non-singular matrices. A matrix is singular if its determinant is 0.

5. The 2 × 2 determinant is defined by

$| \begin{matrix} a & b \\ c & d \end{matrix} | = a d - c b .$

si268_e

6. The inverse of a 2 × 2 non-singular matrix $(\begin{matrix} a & b \\ c & d \end{matrix})$ si269_e is

$\frac{1}{(a d - c b)} (\begin{matrix} d & - b \\ - c & a \end{matrix}) .$

si270_e

7. Transformations of the plane ² can be defined using vectors and matrices as in Section 13.13.

8. Systems of linear equations may be determined (a single solution), indeterminate (many solutions) or inconsistent (no solutions).

9. Systems of linear equations can be solved using Gaussian elimination.

10. The inverse of a matrix, if it exists, can be found using Gauss–Jordan elimination.

11. The determinant of a 3 × 3 matrix may be found by expanding about any row or column, where

$| \begin{matrix} a_{1} & b_{1} & c_{1} \\ a_{2} & b_{2} & c_{2} \\ a_{3} & b_{3} & c_{3} \end{matrix} | = a_{1} | \begin{matrix} b_{2} & c_{2} \\ b_{3} & c_{3} \end{matrix} | - b_{1} | \begin{matrix} a_{2} & c_{2} \\ a_{3} & c_{3} \end{matrix} | + c_{1} | \begin{matrix} a_{2} & b_{2} \\ a_{3} & b_{3} \end{matrix} |$

si271_e

gives the expansion about the first row.

12. The inverse of a non-singular matrix can be found using

$A^{- 1} = (1 / | A |) (A d j o int (A)) .$

13. The eigenvalues and eigenvectors of a matrix A are the values of and v such that Av = λv.

14. The method of least squares is used to fit a line or a curve through experimental data in such a way as the sum of the square of the errors is a minimum.

13.10 Exercises

13.1

$\begin{matrix} A = (\begin{matrix} 2 & 3 \\ 0 & - 4 \\ 0 & 2 \end{matrix}) ? ? ? ? ? ? ? B = (\begin{matrix} 2 & - 1 & 0 \\ 1 & 6 & 1 \\ 0 & 3 & 4 \end{matrix}) \\ C = (4 ? ? 0 ? ? - 1) ? ? ? ? ? ? ? D = (\begin{matrix} 0 & 3 \\ - 1 & 4 \end{matrix}) \\ E = ? (\begin{matrix} 3 & 2 \\ - 1 & \frac{- 2}{3} \end{matrix}) \end{matrix}$

si273_e

Find the following, where possible

(a) A

(b) A^TB

(d) BA

(e) C^T

(f) AC

(g) CA

(h) CB

(i) 3CA

(j) D+E

(k) 3D− $\frac{1}{2}$ E

(l) $\frac{1}{2}$ B+A

(m) B²

(n) E³

(o) AC²

(p) A^TBC^T.

13.2 Use

$A = (\begin{matrix} a_{11} & a_{12} \\ a_{21} & a_{22} \end{matrix}) ? ? B = (\begin{matrix} b_{11} & b_{12} \\ b_{21} & b_{22} \end{matrix}) ? ? C = (\begin{matrix} c_{11} & c_{12} \\ c_{21} & c_{22} \end{matrix})$

to justify the associative law for 2 × 2 matrices

$A (B C) ? = ? (A B) C$

13.3. Find a real square matrix which is both symmetric and skew symmetric.

13.4. A network as in Figure 13.20(a), (b), or (c) can be used to represent an electrical network, as system of one-way streets, or a communication system. An incidence matrix can be defined for a network in the following way (the lines are called arcs and the dots are called vertices).

a_ij = 1 if the arc j is leaving the vertex i

a_ij =–1 if the arc j is entering the vertex i

a_ij = 0 if the arc j does not touch the vertex i

For instance, Figure 13.20(a) has incidence matrix

$Vertex ? ? ? ? ? \begin{matrix} 1 \\ 2 \\ 3 \end{matrix} ? ? ? ? ? (\begin{matrix} a \\ - 1 \\ \begin{array}{l} 1 \\ 0 \end{array} \end{matrix} ? \begin{matrix} Arc \\ b \\ 0 \\ \begin{array}{l} - 1 \\ 1 \end{array} \end{matrix} ? \begin{matrix} c \\ - 1 \\ \begin{array}{l} 0 \\ 1 \end{array} \end{matrix})$

si278_e

(i) Find incidence matrices for the networks in Figure 13.20(b) and (c).

(b) Draw networks that have the following incidence matrices

(i)

$Vertex \begin{matrix} \begin{matrix} Arc \\ \begin{matrix} a & b & c \end{matrix} \end{matrix} \\ \begin{matrix} 1 \\ 2 \\ 3 \\ 4 \end{matrix} (\begin{matrix} - 1 & 1 & 1 \\ 0 & - 1 & 0 \\ 1 & 0 & - 1 \\ 0 & 0 & 0 \end{matrix}) \end{matrix}$

si279_e

(ii)

$Vertex \begin{matrix} \begin{matrix} Arc \\ \begin{matrix} a & b & c & d & e \end{matrix} \end{matrix} \\ \begin{matrix} 1 \\ 2 \\ 3 \end{matrix} (\begin{matrix} - 1 & 0 & - 1 & 1 & 0 \\ 1 & - 1 & 0 & - 1 & 1 \\ 0 & 1 & 1 & 0 & - 1 \end{matrix}) \end{matrix}$

si280_e

f13-20-9780750658553 — Figure 13.20 Networks for Exercise 13.4.

13.5. Represent the following transformations using matrices and vectors. In each case, apply the transformations to A(0,0), B(1,0), C(1,1), and D(0,1) to find their images A′, B′, C′ and D′ and draw your result:

(a) Rotation about the origin through 120º.

(b) Translation by (–4, 1).

(d) Scaling in the y-direction by 5.

(e) Rotation through 120º about the origin followed by a translation by (–2,3).

(f) Translation by (–1,3) followed by rotation through 120º about the origin.

(g) Rotation through 120º about the origin followed by scaling by 5 in the y-direction.

(h) Reflection in the line y = x.

(i) Scaling along a line at a 30º angle to the x-axis, by a factor of 4.

(k) Find inverse transformations for those given in parts (a),(b),(c),(e),(g), and (i) and check in each case that the inverse transformation returns A′, B′, C′, and D′ to A,B,C,D.

13.6. Sketch the following systems of equations and solve them using Gauss elimination. In each case, state whether the system is determined, indeterminate, or inconsistent.

(a) $\begin{array}{l} 4 x + y = 3 \\ - 2 x - y = - 3 \end{array}$ si281_e

(b) $\begin{array}{l} x + y = 3 \\ x - y = 7 \end{array}$

(d) $\begin{array}{l} - 5 x - y = 5 \\ 10 x + 2 y = - 10 \end{array}$

(e) $\begin{array}{l} 3 x + 2 y = - 17 \\ 10 x + y = 0 \end{array}$

(f) $\begin{array}{l} x + 6 y = 4 \\ 3 x + 18 y = 10 \end{array}$

(g) $\begin{array}{l} - 3 x + 2 y = 1 \\ 1.5 x - y = 0.5 \end{array}$

(h) $\begin{array}{l} - 7 x + 8 y = - 10 \\ - 2 x + y = - 8. \end{array}$

13.7. Solve the following using Gauss elimination. In each case state whether the system is determined, indeterminate or inconsistent.

(a) $\begin{array}{l} 2 x - 3 y - z = 4 \\ 5 x + 5 y + 2 z = - 23 \\ x ? ? ? + ? ? ? ? z = - 1 \end{array}$ si289_e

(b) $\begin{array}{l} 4 x - y - 2 z = 40 \\ 3 x + y + 9 z = 5 \\ x - y + z = - 55 \end{array}$ si290_e

(d) $\begin{array}{l} ? x - y + z = 3 \\ x + z = 3 \\ - 4 x - y - 4 z = - 10. \end{array}$ si292_e

13.8. Find inverses of the following matrices A, if they exist, and check that A⁻¹A = I

(a) $(\begin{matrix} 1 & - 2 \\ 0 & 1 \end{matrix})$

(b) $(\begin{matrix} 2 & 3 \\ - 1 & 6 \end{matrix})$

(d) $(\begin{matrix} 1 & - 1 & 0 \\ - 2 & 4 & 2 \\ - 3 & 1 & 2 \end{matrix})$ si296_e

(e) $(\begin{matrix} 0 & 5 & 1 \\ - 1 & 5 & 3 \\ 2 & 0 & 2 \end{matrix})$ si297_e

(f) $(\begin{matrix} 1 & - 1 & 2 \\ 6 & 1 & 3 \\ - 5 & - 2 & - 1 \end{matrix})$ si298_e

13.9. Find the following determinants:

(a) $| \begin{matrix} 1 & 6 \\ 2 & 3 \end{matrix} |$

(b) $| \begin{matrix} 5 & 1 \\ - 6 & 2 \end{matrix} |$ si300_e

(d) $| \begin{matrix} 6 & - 3 & - 2 \\ 1 & - 1 & 8 \\ 0 & - 1 & 0 \end{matrix} | .$ si302_e

13.10. The vector product of two three-dimensional vectors can be defined using a determinant as follows:

$(a_{1}, ? a_{2,} ? a_{3}) \times (b_{1}, b_{2}, b_{3}) = | \begin{matrix} i & j & k \\ a_{1} & a_{2} & a_{3} \\ b_{1} & b_{2} & b_{3} \end{matrix} |$

si303_e

where i, j, and k are the unit vectors in the x, y, and z directions, respectively. Use this definition to find the following:

(a) (1,3,6) × (–1,2,2)

(b) ( $\frac{1}{2}, \frac{1}{2}$ , −1) × (0,0,3).

13.11. The scalar triple produce of three vectors a, b, c, given by a. (b × c) can be found using a determinant as follows:

$\begin{array}{l} (a_{1}, ? a_{2,} ? a_{3}) \cdot ((b_{1}, b_{2}, b_{3}) \times (c_{1}, ? c_{2}, ? c_{3})) \\ ? ? ? ? ? ? ? = | \begin{matrix} a_{1} & a_{2} & a_{3} \\ b_{1} & b_{2} & b_{3} \\ c_{1} & c_{2} & c_{3} \end{matrix} | \end{array}$

si305_e

The absolute value of this can be interpreted as the volume of the parallelepiped which has a, b, c as its adjacent edges.

(a) Find the volume of the parallelepiped with adjacent edges given by the vectors

(i) (1,0, −3), (0,1,1), and (3,0,1)

(ii) (1, −2,2), (3,2, −1), and (2, 1, 1)

(b) Explain why in case (a), (ii) you could conclude that the three vectors lie in the same plane.

13.12 In a homogeneous, isotropic and linearly elastic material it is found that the strains on a section of the material, represented by ε_x, ε_y, and ε_z for the x-, y-, and z-directions, respectively, can be related to the stresses, (σ_x, σ_y, and σ_zby the following matrix equation.

$(\begin{array}{l} ɛ_{x} \\ ɛ_{y} \\ ɛ_{z} \end{array}) = \frac{1}{E} (\begin{matrix} 1 & - ν & - ν \\ - ν & 1 & - ν \\ - ν & - ν & 1 \end{matrix}) (\begin{array}{l} σ_{x} \\ σ_{y} \\ σ_{z} \end{array})$

si306_e

where E is the modulus of elasticity (also called Young's modulus) and v is Poisson's ratio which relates the lateral and axial strains. Find (σ_x, σ_y, σ_z, in terms of ε_x, ε_y, and ε_zand express the relationship in matrix form.

13.13. Find eigenvalues and eigenvectors of the following:

(a)

$(\begin{matrix} 6 & - 1 \\ 0 & 2 \end{matrix})$

si307_e

(b)

$(\begin{matrix} 1 & 5 \\ 1 & - 3 \end{matrix})$

si308_e

(c)

$(\begin{matrix} 3 & 2 \\ 1 & 2 \end{matrix}) .$

si309_e

13.14. A simple circuit comprising a variable voltage V_s, a diode and a resistor is shown in Figure 13.21. As V_s is varied, values of V_R and I are recorded. The values are given in Table 13.3.
Using the method of least squares, determine Z and V_D in the equation V_R = ZI + V_D.

f13-21-9780750658553 — Figure 13.21 A simple circuit with a variable voltage V_s (Exercise 13.14).

Table 13.3

Voltage (V_R) against current (I) for Exercise 13.14

V_R (V)	I (A)
2	0.18
4	0.58
6	0.98
10	1.81

13.5 In an attempt to measure the stiffness of a spring the length of the spring under different loads was measured and the data is given in Table 13.4

(a) Use the data to find an equation that could be used to find the length of the spring given the weight.

(b) From your equation estimate, if possible, the length of the spring when

(i) the load is 2.5 kg,

(ii) the load is 5 kg.

Table 13.4

Loads against spring length for Problem 13.15

Load (kg)	Length (cm)
0	10
0.5	10.8
1	11.5
2	14
3	15.5
4	17.5

13.16. The power dissipation of a n–p–n silicon transistor is thought to vary linearly with temperature. The data given in Table 13.5 were recorded experimentally
Plot these points on a scatter diagram and use the method of least squares to obtain a and b in the equation relating the power (P) to the temperature (T); P= a +bT.

Table 13.5

Power dissipation of a n–p–n silicon transistor recorded against temperature

Temperature (°C)	Power dissipation (W)
25	10
60	7.9
100	5.7
120	4.8
140	3.5

13.17. Fit parabolas to the following sets of data:

(a) (–1,0), (0, −1), (1,4), (2, 14) (3,32)

(b) (–1, 5.5), (0, 1.5), (0.5, 0.5), (2, 5).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 13.5 Gauss elimination

Create new playlist

Sign In

Sign Up

13.5 Gauss elimination

Example 13.25

Example 13.26

Example 13.27

Indeterminacy and inconsistency

Order of the equations

13.6 The inverse and determinant of a 3 × 3 matrix

Finding the inverse by elimination

Example 13.28

The determinant of a 3 × 3 matrix

Example 13.29

The inverse of a matrix using (Adjoint(A))/|A|

Example 13.30

13.7 Eigenvectors and eigenvalues

Example 13.31

Example 13.32

13.8 Least squares data fitting

Example 13.33

Curve fitting

Example 13.34

13.9 Summary

13.10 Exercises

Table of Contents for
13.5 Gauss elimination