22.4 Normal Distributions

  • Normal Distributions • Empirical Rule • z-Score • Standard Normal Distribution • Applications of Normal Distributions • Sampling Distribution of x ¯  • Standard Error

The distributions in the previous sections have been for a limited number of values. Let us now consider a very large population, such as the useable lifetime of all the AA batteries sold in the world in a year. It could have a large number of classes with many values within each class.

We would expect a histogram for this very large population to have its maximum frequency very near the mean and taper off to smaller frequencies on either side. It would probably be shaped something like the curve shown in Fig. 22.10.

A bell-shaped curve begins at the origin, rising to a maximum, then falling and approaching the x-axis.

Fig. 22.10

The smooth bell-shaped curve in Fig. 22.10 shows the normal distribution of a population large enough that the distribution is considered to be continuous. Using advanced methods, its equation is found to be

y = e − (x − μ)2 / 2σ2σ2π
(22.4)

Here, μ is the population mean and σ is the population standard deviation, and π and e are the familiar numbers first used in Chapters 2 and 12, respectively.

From Eq. (22.4), we can see that any particular normal distribution depends on the values of μ and σ. The horizontal location of the curve depends on μ ,  and the spread of the curve depends on σ ,  but the bell shape remains. The next example illustrates the shapes of different normal distributions.

EXAMPLE 1 Different normal distributions

In Fig. 22.11, for the left curve, μ = 10 and σ = 5 ,  whereas for the right curve, μ = 20 and σ = 10.

Two bell-shaped curves rise from the x-axis to a maximum, then fall toward the x-axis. One has a maximum at x = 10. The other has a maximum at x = 20.

Fig. 22.11

Although there are many possible normal distributions (depending on the mean and standard deviation), the total area under any normal curve is 1 (or 100%). Also, the percentages of the values that are within one, two, or three standard deviations of the mean are the same for all normal distributions. This fact is called the empirical rule and is stated below.

EXAMPLE 2 Empirical rule—achievement test scores

The scores on a certain achievement test are normally distributed with a mean of 500 and a standard deviation of 50. What percentage of the scores are (a) between 400 and 600, and (b) higher than 550?

  1. Since 400 is two standard deviations below the mean and 600 is two standard deviations above the mean, about 95% of all scores are between these values.

  2. The value 550 is one standard deviation above the mean. This means that about 34% (half of 68%) of the scores are between 500 and 550. Therefore, the percentage of scores that are higher than 550 is about 16%(50% − 34%) . 

NOTE

[Remember that the total area under any normal curve is 1 (or 100%). Because a normal curve is symmetric, the area on either side of the mean is 0.5 (or 50%).]

In order to find percentages of a normal distribution that lie between fractional increments of the standard deviation, we convert the given values to z-scores (or standard scores) and then use the standard normal distribution. It is calculated using the following formula:

NOTE

[The z-score of a particular value x tells us the number of standard deviations the given value of x is above or below the mean.]

z-score z = x − μσ
(22.5)

A positive z-score indicates the value is above the mean, whereas a negative z-score indicates it is below the mean.

EXAMPLE 3 z-scores—achievement test results

Using the mean and standard deviation given in Example 2, find and interpret the z-scores of achievement test results of (a) 610 and (b) 435.

  1. The test result of 610 has a z-score of z = 610 − 50050 = 2.2. This test result is 2.2 standard deviations above the mean. This is a very high test score, which is rarely obtained.

  2. The test result of 435 has a z-score of z = 435 − 50050 =  − 1.3. This test result is 1.3 standard deviations below the mean. This score is below the mean, but fairly typical.

By converting given values to z-scores, we can use the standard normal distribution to find areas (or percentages). Like any normal curve, the total area under the curve is 1. However, the horizontal axis is labeled with z-scores, which can be interpreted as the number of standard deviations above or below the mean. Figure 22.13 shows the standard normal curve. Note that almost all of the area is between z =  − 3 and z = 3 (99.7% according to the empirical rule). Standard normal tables, such as the one given in Table 22.1 on the next page, can be used to find areas under the standard normal curve.

NOTE

[The standard normal distribution has a mean of 0 and a standard deviation of 1.]

A normal distribution curve rises from the z-axis to a maximum at z = 0, then falls toward the z-axis. The curve is symmetrical about z = 0.

Fig. 22.13

NOTE

[The areas in Table 22.1 represent the area between the mean of 0 and the given z-score.]

Other types of areas can be found by adding or subtracting as needed.

Table 22.1 Standard Normal (z) Distribution

 A curve with standard normal distribution. The curve is bell-shaped and centered about the vertical axis. From the vertical axis, 0, to z, the curve is shaded.
z Area z Area z Area z Area z Area z Area
0.1 0.0398 0.6 0.2257 1.1 0.3643 1.6 0.4452 2.1 0.4821 2.6 0.4953
0.2 0.0793 0.7 0.2580 1.2 0.3849 1.7 0.4554 2.2 0.4861 2.7 0.4965
0.3 0.1179 0.8 0.2881 1.3 0.4032 1.8 0.4641 2.3 0.4893 2.8 0.4974
0.4 0.1554 0.9 0.3159 1.4 0.4192 1.9 0.4713 2.4 0.4918 2.9 0.4981
0.5 0.1915 1.0 0.3413 1.5 0.4332 2.0 0.4772 2.5 0.4938 3.0 0.4987

EXAMPLE 4 Finding area under the standard normal curve

Find the area under the standard normal curve between z = 0.8 and z = 2.4 as shown in Fig. 22.14.

A normal distribution curve with a section shaded from 0.8 to 2.4.

Fig. 22.14

Using Table 22.1, the area between z = 0 and z = 2.4 is 0.4918. The area between z = 0 and z = 0.8 is 0.2881. Therefore, the area between z = 0.8 and z = 2.4 is found by subtracting:

0.4918 − 0.2881 = 0.2037(20.37% of the area)

In applications involving normal distributions, areas are found by first converting the values to z-scores and then using the standard normal distribution. This is illustrated in the next example.

EXAMPLE 5 Application of normal distribution—battery lifetimes

The lifetimes of a certain type of watch battery are normally distributed with a mean of 400 days and a standard deviation of 50 days. What percentage of batteries will last (a) between 360 days and 460 days, (b) more than 320 days, and (c) less than 280 days?

  1. We first find the z-score of each value: z = 360 − 40050 =  − 0.8 and z = 460 − 40050 = 1.2 We wish to find the area between these two z-scores as shown in Fig. 22.15. Using Table 22.1, the area between z = 0 and z =  − 0.8 is 0.2881 and the area between z = 0 and z = 1.2 is 0.3849. Thus, the total area is found by adding: 0.2881 + 0.3849 = 0.6730. This means that 67.3% of the batteries will last between 360 and 460 days.

    A normal distribution curve over an x-axis in days. The curve is symmetric about x = 400. It is shaded from 360 to 460, or from z = negative 0.8 to z = 1.2.

    Fig. 22.15

  2. We begin by finding the z-score of 320 days: z = 320 − 40050 =  − 1.6

    We wish to find the area to the right of z =  − 1.6 ,  shown in Fig. 22.16. The area between z = 0 and z =  − 1.6 is 0.4452. Since the area to the right of z = 0 is 0.5, we add to get the total area: 0.4452 + 0.5 = 0.9452. This means that 94.52% of the batteries will last longer than 320 days.

    A normal distribution curve over x-axis in days. The curve is symmetric about x = 400. It is shaded to the right of 320, or z = negative 1.6.

    Fig. 22.16

  3. The z-score of 280 is z = 280 − 40050 =  − 2.4. The area to the left of this (see Fig. 22.17) is 0.5 − 0.4918 = 0.0082. Therefore, only 0.82% of the batteries will last fewer than 280 days.

A normal distribution curve over x-axis in days. It is symmetric about x = 400. It is shaded to the left of 280, or z = negative 2.4.

Fig. 22.17

Instead of using Table 22.1, a calculator can also be used to find areas under the standard normal curve. The normalcdf feature calculates the area between a specified lower and upper bound. When using z-scores in the normalcdf feature, a 10 can be used as an upper bound for areas that continue indefinitely to the right. This is because there is virtually no area more than 10 standard deviations away from the mean. Similarly, a  − 10 can be used as a lower bound for areas that continue to the left. Figure 22.18 shows the calculator’s evaluations of the areas in this example.

A calculator screen.

Fig. 22.18

Graphing calculator screenshot: bit.ly/2OP9f7R

THE SAMPLING DISTRIBUTION OF x̄

Suppose we selected all possible samples of size n from a population with mean μ and standard deviation σ ,  and we calculated the sample mean x ¯  for each sample. We would have a huge number of sample means. The way in which these sample means are distributed is called the sampling distribution of x ¯ . The following summarizes three very important facts about this sampling distribution.

By knowing the mean, standard deviation, and shape of the sampling distribution, it is possible to find the likelihood that the mean of a sample falls within certain limits. This is illustrated in the following example.

EXAMPLE 6 Sampling distribution of x̄ —airline passenger weights

The FAA standards assume that adult airline passengers and their carry-on bags have an average weight of 190 lb in the summer. If the standard deviation of these weights is 25 lb, find the likelihood that for a sample of 100 passengers, their mean weight is greater than 195 lb.

Since the sample size is n = 100 ,  the sampling distribution of x ¯  is approximately normally distributed with a mean of 190 and a standard deviation of 25100 = 2.5. Thus, the z-score of 195 is

z = 195 − 1902.5 = 2.0

We wish to find the area to the right of z = 2.0 ,  which is 0.5 − 0.4772 = 0.0228. In other words, 2.28% of samples of 100 passengers will have means over 195 lb.

EXERCISES 22.4

In Exercises 14, make the given changes in the indicated examples of this section and then solve the indicated problem.

  1.  In Example 1, change the second σ from 10 to 5 and then describe the curve that would result in terms of either or both curves shown in Fig. 22.11.

  2. In Example 2, what percentage of the test scores are between 350 and 650?

  3. In Example 3, find and interpret the z-score of an achievement test score of 630.

  4. In Example 5(b), change 320 to 360 and then find the resulting percentage of batteries.

In Exercises 58, use the following information. If the weights of cement bags are normally distributed with a mean of 60 lb and a standard deviation of 1 lb, use the empirical rule to find the percent of the bags that weigh the following:

  1. Between 58 lb and 62 lb

  2. Between 59 lb and 61 lb

  3. Less than 63 lb

  4. More than 62 lb

In Exercises 912, use the following information. A standardized math test has a mean score of 200 and a standard deviation of 15. Find and interpret the z-scores of the following math test scores.

  1. 218

  2. 179

  3. 164

  4. 233

In Exercises 1316, use the following data. It has been previously established that for a certain type of AA battery (when newly produced), the voltages are distributed normally with μ = 1.50 V and σ = 0.05 V.

  1. According to the empirical rule, what percent of the batteries have voltages between 1.45 V and 1.55 V?

  2. What percent of the batteries have voltages between 1.52 V and 1.58 V?

  3. What percent of the batteries have voltages below 1.54 V?

  4. What percent of the batteries have voltages above 1.64 V?

In Exercises 1724, use the following data. The lifetimes of a certain type of automobile tire have been found to be distributed normally with a mean lifetime of 100,000 km and a standard deviation of 10,000 km. Answer the following questions.

  1. What percent of the tires will last between 85,000 km and 100,000 km?

  2. What percent of the tires will last between 95,000 km and 115,000 km?

  3. In a sample of 5000 of these tires, how many can be expected to last more than 118,000 km?

  4. If the manufacturer guarantees to replace all tires that do not last 75,000 km, what percent of the tires may have to be replaced under this guarantee?

  5. In a sample of 100 of these tires, find the likelihood that the mean lifetime for the sample is less than 98,200.

  6.  What happens to the standard error of the mean as n increases? Use the formula for the standard error to help explain your answer.

  7. What percent of the samples of 100 of these tires should have a mean lifetime of more than 102,000 km?

  8. If 144 of the tires are randomly selected, find the percent chance that the mean lifetime is more than 102,000.

In Exercises 2530, solve the given problems,

  1. With 75.8% of the area under the normal curve to the right of z, find the z-value.

  2. With 21% of the area under the normal curve between z1 and z2 ,  to the right of z1 = 0.8 ,  find z2.

  3. With 59% of the area under the normal curve between z1 and z2 ,  to the left of z2 = 1.1 ,  find z1.

  4. With 5.8% of the area under the normal curve between z1 and z2 ,  to the left of z2 = 2.0 ,  find z1.

  5. The residents of a city suburb live at a mean distance of 16.0 km from the center of the city, with a standard deviation of 4.0 km. What percent of the residents live between 12.0 km and 18.0 km of the center of the city?

  6.  For the data on the number of Android apps in Exercise 19 of Section 22.1, find the percent of the data that lie within one, two, and three standard deviations of the mean. Compare these percentages with the ones in the empirical rule.

Answer to Practice Exercises

  1. 0.5328

  2. 99.7%

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.144.205.223