Chapter 6

Getting the Most from Minimal-Run Designs

The best carpenters make the fewest chips.
German Proverb

In the previous chapter, we demonstrated how to shave runs from a two-level factorial design by performing only a fraction of all possible combinations. In this chapter, we explore minimal designs with one fewer factor than the number of runs, e.g., seven factors in 8 runs. Statisticians consider such designs to be “saturated” with factors. These resolution III designs confound main effects with two-factor interactions—a major weakness. However, they may be the best option when time and other resources are limited. If you are lucky, nothing will be significant and any questions about aliasing become moot. However, if the results exhibit significance, you must make a big leap of faith to assume that the reported effects are correct. To be safe, do further experimentation (known as design augmentation) to de-alias the main effects and/or two-factor interactions. The most popular method of design augmentation is the foldover. We will illustrate this method with a case study.

Minimal-Resolution Design: The Dancing Raisin Experiment

The dancing raisin experiment provides a vivid demonstration of the power of interactions. It normally involves just two factors:

  1. Liquid: Tap water versus carbonated
  2. Solid: A peanut versus a raisin

Only one out of the four possible combinations produces an effect. Peanuts will generally float in water, and raisins usually sink. Peanuts are even more likely to float in carbonated liquid. However, when you drop in a handful of raisins, the results can be delightful. Most will drop to the bottom. There the raisins become coated with tiny bubbles, which lift some of them to the surface. At the surface, the bubbles pop, and some raisins drop to the bottom again. The up-and-down process can continue for some time, creating a “dancing” effect. Conduct this experiment with your family and friends and follow up by getting their ideas on the cause of this interaction of factors. Consider also why some raisins fail to dance, which may be due to their freshness, the specific brand of carbonated liquid, and so forth. For an alternative, try using a popcorn kernel rather than a raisin. These and other factors listed in Table 6.1 became the subject of a two-level factorial design. Note that the aging of the objects, factor G, was accelerated by baking them under a 100-watt lightbulb for 15 minutes.

Table 6.1

Factors for initial DOE on dancing objects

Factor

Name

Low Level (–)

High Level (+)

A

Material of container

Plastic

Glass

B

Size of container

Small

Large

C

Liquid

Club soda

Lemon lime

D

Temperature

Room

Ice cold

E

Cap on container

No

Yes

F

Type of object

Popcorn

Raisin

G

Age of object

Fresh

Stale

The full two-level factorial for seven factors requires 128 runs. This can be computed as 2 * 2 * 2 * 2 * 2 * 2 * 2 = 128 or, more conveniently, by exponential notation: 27. We chose the 1/16th fraction (2-4 in scientific notation) that requires only 8 runs (= 27 * 2-4 = 27-4 in exponential notation). This is a minimal design with resolution III. At each set of conditions, we rated the dancing performance of 10 objects on a scale of 1 to 5; the higher, the more delightful. Table 6.2 shows the results. The actual order was randomized within one block (Blk) of runs.

Table 6.2

Results from first dancing raisin experiment

Std

Blk

A

B

C

D

E

F

G

Dancing Rating

1

1

+

+

+

1.5

2

1

+

+

+

2.0

3

1

+

+

+

1.0

4

1

+

+

+

4.0

5

1

+

+

+

1.5

6

1

+

+

+

1.0

7

1

+

+

+

5.0

8

1

+

+

+

+

+

+

+

1.0

The half-normal plot of effects is shown in Figure 6.1.

Figure 6.1

Graph of Half-normal plot of effects

Half-normal plot of effects.

Three effects stood out: cap (E), age of object (G), and size of container (B). The analysis of variance (ANOVA) on the resulting model revealed highly significant statistics: An F-value of 27.78 with an associated probability of 0.0039, which falls far below the maximum threshold of 0.05. (Reminder: A probability value of 0.05 indicates a 5% risk of a false positive, i.e., saying something happened because of a specific effect when it actually occurred by chance. An outcome like this is commonly reported to be significant at the 95% confidence level.) The cube plot on Figure 6.2 shows the predicted responses for the three listed factors.

Figure 6.2

Image of Cube plot of predicted responses

Cube plot of predicted responses.

The worst rating (lowest) occurs at the upper-left, back corner of the cube; essentially no reaction at all. It reflects negative impacts of stale objects (G+) and capped liquid (E+), both of which make sense. However, the effect of container size (B) does not make much sense. Could this be an alias for the real culprit, perhaps an interaction? Take a look at the alias structure for this resolution III design, shown in Table 6.3. Each main effect in this experiment is actually aliased with 15 other effects, but we have simplified the table to list only two-factor interactions.

Table 6.3

Alias structure for 27-4 design (7 factors in 8 runs)

Labeled as

Actually

A

A + BD + CE + FG

B

B + AD + CF + EG

C

C + AE + BF + DG

D

D + AB + CG + EF

E

E + AC + BG + DF

F

F + AG + BC + DE

G

G + AF + BE + CD

Can you pick out the likely suspect from the lineup for B? Although the number of possibilities may seem overwhelming, they can be narrowed by assuming that the effects form a family.

The obvious alternative to B (size) is the interaction EG. However, this is only one of several feasible “hierarchical” models (ones that maintain family unity):

  • E, G, and EG (disguised as B)
  • B, E, and BE (disguised as G)
  • B, G, and BG (disguised as E)

Figure 6.3a–c illustrates these three alternative models.

Figure 6.3a–c

Graphs of Alternative interactions EG, BE, and BG

Alternative interactions EG, BE, and BG (left to right).

Notice that each figure predicts the same maximum outcome. However, the actual cause remains murky. The EG interaction seems far more plausible than the alternatives, but further experimentation is needed to verify this.

Complete Foldover of Resolution III Design

By adding a second block of runs with signs reversed on all factors, you can break the aliases between main effects and two-factor interactions. This procedure is called a complete foldover. It works on any resolution III factorial. It’s especially popular with Plackett–Burman designs, such as the 11 factors in 12-run choice. Table 6.4, which shows the second block of experiments with all signs reversed on the control factors, illustrates how the foldover method works on the dancing raisin experiment.

Table 6.4

Second block of runs after complete foldover (selected interactions included)

Std

Blk

A

B

E

C

D

F

G

AD

BE

BG

EG

Dancing Rating

1

1

+

+

+

+

1.5

2

1

+

+

+

+

2.0

3

1

+

+

+

+

+

+

+

1.0

4

1

+

+

+

+

+

4.0

5

1

+

+

+

+

1.5

6

1

+

+

+

+

1.0

7

1

+

+

+

+

+

5.0

8

1

+

+

+

+

+

+

+

+

+

+

+

1.0

9

2

+

+

+

+

+

1.2

10

2

+

+

+

+

+

0.9

11

2

+

+

+

+

+

+

+

+

4.6

12

2

+

+

+

+

+

+

1.4

13

2

+

+

+

+

+

0.6

14

2

+

+

+

+

+

1.3

15

2

+

+

+

+

+

+

1.2

16

2

+

+

+

+

4.5

Notice that the signs of the two-factor interactions do not change from block 1 to block 2. For example, in block 1 the signs of columns B and EG are identical, but in block 2 they differ, thus the combined design no longer aliases B with EG. If B really is the active effect, it should come out on the plot of effects for the combined design.

As you can see in Figure 6.4, factor B has disappeared and AD has taken its place. But, hold on. What happened to family unity? Neither of AD’s parents (A or D) appear in this chosen model.

Figure 6.4

Graph of Half-normal plot of effects after foldover

Half-normal plot of effects after foldover.

The problem is that complete foldover of a resolution III design does not break the aliasing of two-factor interactions, so AD remains aliased with EG as well as CF. The listing of the effect as AD—the interaction of container material with beverage temperature—is arbitrary, by alphabetical order. Figure 6.5 shows the AD interaction with all other factors set arbitrarily at their low levels (specified in Table 6.1). It makes no sense physically for the effect of material (A) to depend on temperature of the beverage (room temperature D– versus ice-cold D+).

Figure 6.5

Graph of Interaction plot for AD

Interaction plot for AD.

Discounting the CF interaction (liquid type versus object type) is not easy, but this new evidence clearly shows that the interaction between E and G is the most plausible, particularly since we now know that these two factors are present as main effects. Figure 6.6 shows the EG interaction.

Figure 6.6

Graph of Plot of interaction EG

Plot of interaction EG.

It appears that the effect of cap (E) depends on the age of the object (G). When the object is stale (the G+ line at the bottom of Figure 6.6), twisting on the bottle cap (going from E– at left to E+ at right) makes little difference. However, when the object is fresh (the G– line at the top), the bottle cap quenches the dancing reaction. More experiments are required to confirm this interaction. One obvious way to do this is to conduct a full factorial on E and G alone. Other ideas on de-aliasing are presented after the next sidebar.

Single-Factor Foldover

Another simple way to de-alias a resolution III design is the “single-factor foldover.” Like a complete foldover, this requires a second block of runs, but in this variation of the general method, you change signs on only one factor. This factor and all its two-factor interactions become clear of any other main effects or interactions. To see how this works, go back to the original resolution III design on the dancing raisins. It makes sense to focus on the biggest effect: E (refer to Figure 6.1). The end result of the foldover on factor E (only) is a design with 16 runs in two blocks of eight. The resulting alias structure (main effects and two-factor interactions only) is shown in Table 6.5.

Table 6.5

Alias structure for 27-4 design after foldover on factor E

Labeled as

Actually

A

A + BD + FG

B

B + AD + CF

C

C + BF + DG

D

D + AB + CG

E

E

F

F + AG + BC

G

G + AF + CD

AC

AC + BG + DF

AE, BE, CE, DE, EF, EG

AE, BE, CE, DE, EF, EG

The combined design remains at resolution III because, with the exception of E, all main effects remained aliased with 2 two-factor interactions. Factor E is a resolution V, because the main effect is clear of troublesome aliases (anything less than a four-factor interaction), and the two-factor interactions are aliased only with three-factor or higher-order interactions. In the case of the dancing raisins, the single-factor foldover would have revealed that B was actually EG, not AD. On the other hand, factor G remains aliased with 2 two-factor interactions. You can’t win either way.

Choose a High-Resolution Design to Reduce Aliasing Problems

The best way to reduce aliasing problems is to run a higher resolution design in the first place by selecting fewer factors and/or a bigger design. For example, in the dancing raisin experiment, we would have prevented much confusion by testing the 7 factors in 32 runs (27-2). This option was discussed in the previous chapter on fractional factorials. It is a resolution IV design, but all 7 main effects and 15 of the 21 two-factor interactions are clear of other two-factor interactions. The remaining 6 two-factor interactions are shown in Table 6.6.

Table 6.6

Problem aliases for 27-2 design (7 factors in 32 runs)

Labeled

Actually

DE

DE + FG

DF

DF + EG

DG

DG + EF

The trick is to label the likely interactors anything but D, E, F, and G. For example, knowing now that capping and age interact in the dancing raisin experiment, we would not label these factors E and G. If only we knew then what we know now.

Another option, one that requires no premonition, is to use a minimum-run resolution V (MR5) design (described in the appendix of this chapter). With just 30 runs, all main effects and two-factor interactions can be estimated.

Practice Problems

Problem 6.1

(Warning: The following story contains liberal doses of fantasy.) One of the authors became envious of the skating ability of his co-author. The “wannabe” skater secretly got together with a local manufacturer of in-line skates and borrowed the latest and greatest experimental gear. Bewildered by all the options, he decided to try various combinations at the local domed stadium, which opened its concourse to skaters when not in use for sporting events. The special skates would have to be returned fairly soon, so the wannabe skater set up a quick-and-dirty fractional factorial. If anything proved to be statistically significant, the result would be a faster time around the track (and, of course, some ego gratification). The experimenter knew that aliasing of effects would obscure a true picture of what really enhanced speed, but he anticipated that this could be figured out later through follow-up designs using the foldover method. Table 6.7 shows the initial design, a 27-4 resolution III fractional factorial, and the resulting times around the track.

Table 6.7

First experiment on in-line skates

Std

A: Pad

B: Bearing

C: Gloves

D: Helmet

E: Wheels

F: Covers

G: Neon

Time (sec)

1

Out

Old

On

Front

Soft

Off

Off

195

2

In

Old

On

Back

Hard

Off

On

192

3

Out

New

On

Back

Soft

On

On

200

4

In

New

On

Front

Hard

On

Off

165

5

Out

Old

Off

Front

Hard

On

On

190

6

In

Old

Off

Back

Soft

On

Off

195

7

Out

New

Off

Back

Hard

Off

Off

166

8

In

New

Off

Front

Soft

Off

On

201

Here is more background on the factors and levels to help you interpret the outcome:

  1. Pad goes inside skate to elevate the heel: Out (−), In (+)
  2. Bearing constructed either from old material (−) or new high-tech alloy (+)
  3. Gloves made specially for in-line skating to protect wrists: On (−), Off (+)
  4. Helmet fits with logo to back (−) or front (+): Can’t tell which is correct, so try both and ignore laughs when wrong
  5. Wheels can be made of hard (−) or soft (+) polymer
  6. Covers go on wheels to make them look faster: On (−), Off (+)
  7. Neon lighting (from generator on skates) for night-time use: Off (−), On (+)

Analyze the data to see if any of these factors appear to be significant. Do the results make sense? Could the real answer be disguised by an alias? (Suggestion: Refer to Table 6.3. Use the software provided with the book. Create a two-level factorial design for 7 factors in 8 runs and sort the resulting layout by standard order, then enter the time data from Table 6.7. Do the analysis as outlined in the factorial tutorial that comes with the program.)

Problem 6.2

This is a continuation of the skating saga from Problem 6.1. Rolling right along, the experimenter decided to do a complete foldover on the initial design. Table 6.8 shows the factor levels and results for this follow-up design.

Table 6.8

Follow-up design (foldover) to initial DOE on in-line skates

Std

A: Pad

B: Bearing

C: Gloves

D: Helmet

E: Wheels

F: Covers

G: Neon

Time (sec)

9

In

New

Off

Back

Hard

On

On

175

10

Out

New

Off

Front

Soft

On

Off

211

11

In

Old

Off

Front

Hard

Off

Off

202

12

Out

Old

Off

Back

Soft

Off

On

205

13

In

New

On

Back

Soft

Off

Off

212

14

Out

New

On

Front

Hard

Off

On

175

15

In

Old

On

Front

Soft

On

On

204

16

Out

Old

On

Back

Hard

On

Off

201

Add these data to the design from Problem 6.1 and analyze it as a second block of data. Do any of the significant model terms turn out to be interactions rather than main effects? Remember that the foldover upgrades the resolution III design to resolution IV, but interactions may still be aliased with other interactions. If interactions do appear, do they make more sense than the aliased alternatives? (Suggestion: Use the software provided with the book. Look for a data file called “6-P2 Skate2,” open it, then do the analysis. View the aliased interactions and try substitutions. Graphically compare the alternatives in a similar manner to that outlined for the dancing raisin case.)

Appendix: Minimum-Run Designs for Screening

In the presence of two-factor interactions, only designs of resolution IV (or higher) can ensure accurate screening. If you are limited by time, materials, or other experimental resources, in most cases (other than eight factors), modern “minimum run” resolution IV (MR4) designs offer savings in runs over the equivalent standard two-level fractional factorial design (2k-p). A nine-factor MR4 design, for example, requires only 18 runs, far fewer than the 32 runs needed for a standard 2k-p. In general, the MR4 designs require only two times the number of factors (2k). Table 6.9 details a minimum-run screening design on six factors that a biotechnologist ran on a fermentation process (from: Mark Anderson and Patrick Whitcomb, October 2006, Chemical Processing. Online at www.chemicalprocessing.com/articles/2006/166/).

Table 6.9

MR4 design done to screen a fermentation process

#

A: Temp. °C

B: pH

C: O2 %

D: Load Density

E: Complex g/min

F: Inducer g/min

Yield Density

1

39

6.8

40

6

4.8

4.8

70.39

2

35

6.8

40

20

4.8

7.2

101.40

3

39

6.8

40

20

3.2

4.8

86.88

4

39

6.8

5

20

4.8

7.2

79.71

5

39

6.8

40

6

3.2

7.2

61.68

6

35

6.8

5

6

3.2

4.8

50.2

7

35

7.4

5

20

3.2

7.2

97.23

8

39

7.4

5

6

3.2

4.8

73.33

9

35

7.4

5

6

4.8

7.2

67.51

10

35

7.4

40

6

3.2

4.8

71.71

11

35

7.4

5

20

4.8

4.8

111.20

12

39

7.4

40

20

4.8

7.2

74.92

What you do as a result of running such a screening design depends on the statistical analysis of the effects:

Scenario 1: Nothing significant. Look for other factors that affect your response(s).

Scenario 2: Only main effects significant. Change these factors to their best levels.

Scenario 3: Two-factor interaction(s) significant. De-alias by performing a semifoldover.

By following this strategy you will increase your odds of uncovering breakthrough main effects and interactions at a relatively minimal cost in experimental runs.

The development of minimum-run, two-level factorial designs has gone to the next level with resolution V “characterization” designs such as the MR5 (minimum-run resolution five) shown in Table 6.10 for six factors in only 22 runs. The following boxed text provides some background and a reference for more details on the new class of MR5 designs.

Table 6.10

MR5 layout for six factors

Row

A

B

C

D

E

F

1

+

+

+

2

+

+

3

+

4

+

+

+

+

5

+

+

+

6

+

+

+

+

7

+

+

+

8

+

+

+

9

+

+

10

+

+

+

+

+

11

+

+

+

12

+

+

+

+

+

13

+

+

+

14

+

+

+

15

+

+

+

16

+

+

+

17

+

+

18

+

19

+

20

+

+

+

+

+

21

+

+

+

+

22

+

+

+

Ruggedness Testing

Before transferring any system, such as a product, process, or test method, find out what the recipients will do to it. For example, a small U.S. medical device manufacturer redesigned the electronics in its 220-volt unit for European customers. It worked without fail in all but one country, where every single unit burned out. The country where this occurred was relatively undeveloped, and its power supply varied more than that of any other European country the device manufacturer supplied. One of the components in the new design could not tolerate such wide variation in voltage. After this fiasco, the engineers developed a standard fractional factorial design to test new designs against this variation and half a dozen or so other variables that could affect the units. This led to redesigning components and the overall system to make it more robust.

For a cookbook approach, see ASTM (American Society for Testing and Materials) Standard E1169 “Standard Practice for Conducting Ruggedness Tests,” available for download from American National Standards Institute (ANSI) eStandards Store at http://webstore.ansi.org.

A Very Scary Thought

Could a positive effect be cancelled by an “antieffect”?

If you use a resolution III design, be prepared for the possibility that a positive main effect may be wiped out by an aliased interaction of the same (but negative) magnitude. The opposite could happen as well. Therefore, if nothing significant emerges from a resolution III design, you cannot always be certain that there are no active effects. Two or more big effects may have cancelled each other out.

An Alias by Any Other Name Is Not Necessarily the Same

You might be surprised that aliased interactions, such as AD and EG, do not look alike. Their coefficients are identical, but the plots differ because they combine the interaction with their parent terms. To check this, construct the interaction plot for CF using the methods described in Chapter 3. This interaction also is aliased with AD. By comparing the graphs of aliased interactions (such as AD versus EG versus CF), you can hazard an educated guess about which interaction merits further investigation.

Foldover Doesn’t Always Add a New Wrinkle

The complete foldover of resolution IV designs may do nothing more than replicate the design so that it remains resolution IV. This would happen if you folded over the 16 runs in Table 6.4. By folding only certain columns of a resolution IV design, you might succeed in de-aliasing some of the two-factor interactions. Other than trying different combinations of columns to fold over, the only sure way to eliminate aliases is the single-factor foldover, which works on resolution IV the same as it would on resolution III designs: The main effect and all the two-factor interactions of the factor you choose will be cleared. If you are really adventurous (or possess design of experiments (DOE) software offering this design augmentation tool), try cutting the single-factor foldover in half to create a “semifoldover.” For details, see How to Save Runs, Yet Reveal Breakthrough Interactions, by Doing Only a Semifoldover on Medium-Resolution Screening Designs, a paper presented by Mark Anderson and Patrick Whitcomb at the 55th Annual Quality Congress of the American Society of Quality in Milwaukee, 2001 (online at www.statease.com).

“As for the Research Department, the Board feels you should try to find whatever you’re looking for the first time you search for it.”

Caption on cartoon in American Scientist 95 (MarchApril 2007): 120.

Definitive Screening Designs

Definitive screening designs (DSD) are recently invented near-minimal run (2K+1) with three (not just two) levels of each factor (Jones, B. and C. Nachtsheim. 2011. A Class of Three-Level Designs for Definitive Screening in the Presence of Second-Order Effects. Journal of Quality Technology, January). DSDs produce clean estimates of main effects. They also generate squared terms, but these are badly aliased with two-factor interactions and, thus, must be approached with caution. For screening continuous numeric factors that can be readily and precisely controlled, running three levels of each, as required by DSD, might be of interest due to it covering the ranges better than simply testing the extremes via more conventional two-level factorials. Normally, when screening commences, it falls on to a planar region, but if you suspect the region will be curvy, then give DSDs strong consideration.

Just in Case Runs Do Not Go as Planned

By choosing minimum-run resolution IV designs, you go to the brink of falling back to an experiment that aliases two-factor interactions with main effects: The loss of one result will push you over the edge. How sure are you that things will not break down somewhere along the way as the experiment progresses? How likely is it that a measurement will be lost? To be on the safe side, choose an MR4 with 2 runs addedan option presented in the software that accompanies this book. Using this option, the six-factor design laid out in Table 6.9 would be tested in 14 runs. Safer yet, and slightly more powerful, would be the 16-run standard resolution IV fraction noted in Table 5.6. When in doubt, build it stout.

PS: For more details on minimum-run resolution IV designs, see the 2004 talk by Anderson and Whitcomb on “Screening Process Factors in the Presence of Interactions,” online at www.statease.com.

Custom-Made Optimal Designs Made Possible with Aid of Computers

Most statistical software offering capabilities for design of experiments includes computer-generated test matrices that are custom built to the model you specify. For example, if you want to produce a resolution V design for 7 factors, specify a model with all 7 main effects, the 21 possible two-factor interactions, and the intercept (overall average of the response) for a total of 29 terms. The computer then performs a search via the algorithm programmed into the DOE software, which typically uses D-optimal criterion. (See Optimality from A to Z (or Maybe Just D) (Chapter 7) of RSM Simplified (Productivity Press, 2004).) The MR5 templates noted above were developed in a similar manner, but balanced off to create equal numbers of low and high levels, i.e., “equireplicated.(See Oehlert and Whitcomb’s 2002 talk on Small, Efficient, Equireplicated Resolution V Fractions of 2k Designs, online at www.statease.com.) With this in place, the 7-factor design lays out 30 runs15 each at the low and high levels. The straight D-optimal design will present some factors with 16 low and 14 high and others with 14 low and 16 high, which is figuratively and literally quite odd.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.141.192.120