Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 13
Experimental Designs

Package(s): BHH2, AlgDesign, granova, multcomp, car, agricolae, phia

Dataset(s): olson, tensile, girder, Hardness, reaction, rocket, rocket_Graeco, battery, bottling, SP, intensity

13.1 Introduction

Experimental Designs, also known as Design of Experiments (DOE), is one of the most important pillars of statistics.

Section 13.2 will introduce the important principles of experimental design, beginning with an interesting real experiment. The first model of the experimental design will be deliberated in Section 13.3, and its extensions to the block design will be detailed in Section 13.4. An effective extension and important class of models of factorial design will be taken up in Section 13.5.

13.2 Principles of Experimental Design

Salsburg (2001) has written a very interesting and historical account in his book titled “The Lady Tasting Tea”. The first chapter, with the same title as the book, has this amazing story of a lady who declared in a cafeteria that she can clearly distinguish between two variants of tea: (i) tea poured into milk, and (ii) milk poured into tea. As Salsburg puts it “A thin, short man, with thick glasses and a Vandyke beard beginning to turn gray, pounced on the problem”, and a live experiment rolled on. The lady was then sent a sequence of different patterns of tea poured into milk cups and milk poured into tea cups. For each cup, the lady would have one sip following which she would declare the process she felt was the underlying preparation, and the results would be noted down. The lady would not be told whether her observation was correct or not at the end of each experiment. This experiment performed by the Vandyke bearded scientist became very famous and laid the foundations of DOE. The reader may be curious to know the answer to two points: (i) Who is this Vandyke bearded scientist? and (ii) What was the result of the experiment? Of course, it was Sir Ronald Fisher who had this Vandyke-styled beard and he put the importance of randomization in action for the tea tasting expert lady. It is important to note here that the goal of the experiment was to test if that lady's claim was correct or not! The randomization prevents the effect of false guesses on the results. If the results of the “The Lady Tasting Tea” experiment were declared, then it was a possibility that people would have failed the concept of “randomization” and not distinguished the fact that this experiment was about the claim of the lady. The lady was correct with all her ten guesses and this was thanks to the fact that she was indeed an expert in making the distinction between the two methods of tea preparation. Salsburg (2001) has built this story in a more fascinating writing and the reader should read the same for more details.

The three important concepts of DOE are (i) Randomization, (ii) Replication, and (iii) Blocking.

Randomization. An experimenter may or may not have biases while conducting an experiment. This influence needs to be done away with before we run the experiment. For example, if Fisher had sent first five times tea poured with milk, and then next five times milk poured with tea to the lady, we are actually having just two observations and not ten observations. Similarly, sending the two types of tea alternatively also sets in predictability. Thus, it is necessary to mix things up and what can be better than sending the tea in a random order for removing the experimenter's bias.

Replication. In “The Lady Tasting Tea” example, randomization alone does not help us if we were sending just two cups of tea to the lady. We need to ensure that the number of cups of “tea poured into milk” and “milk poured into tea” is large enough to support our randomization technique mentioned earlier.

Blocking. Consider an artificial example where we have to decide if the students learn better in classrooms with or without air conditioning (AC). For students from Classes I to IV, we have been careful enough to allocate enough numbers of students to the classrooms with and without AC. Despite our caution, suppose that we have accidentally put all the boys in the classrooms with AC and all the girls in the classrooms without AC. Assume that the result shows that boys perform better than girls. Is the result acceptable? Or assume that the results show that AC students get higher marks than non-AC students. Are the results still acceptable? As we know that the results are not acceptable, we must understand that there are some natural obstacles/restrictions in the nature of the experimental units. This variation in the experimental units cannot be allowed to ruin the results of the experiments. Thus, we need to form blocks which remove this kind of bias/error from the experiment.

We will now consider the simple experimental design, where the importance of randomization will play the central role, in the next section.

13.3 Completely Randomized Designs

Completely Randomized Designs (CRD) is one of the first steps in DOE, and it is a simple setup which involves replication and randomization. If the source of variation in the output is only due to the treatments, CRD is appropriate to deduce the more effective treatments.

13.3.1 The CRD Model

Let $c13-math-0001$ denote the $c13-math-0002$ experimental unit for the $c13-math-0003$ treatment, with $c13-math-0004$ , and $c13-math-0005$ . We assume that the number of observations for each treatment is the same as for any other treatment. Suppose that the experimenter believes that the average yield due to treatment $c13-math-0006$ is $c13-math-0007$ . The CRD model is then expressed by

13.1

where $c13-math-0009$ . This model 13.1 is known as the means model. The more general and useful mathematical model for the CRD is

13.2

The CRD model 13.2 in this form is known as the effects model. The effects model is more feasible from a practical point of view. In this form the mean $c13-math-0011$ is thought of as some guaranteed yield in the absence of any kind of treatment. This parameter is also known as the baseline or control treatment. The parameter values $c13-math-0012$ are a reflection of the effect due to the treatment $c13-math-0013$ . We will consider the second format of the model throughout the rest of this section. The error component, or the noise factor, $c13-math-0014$ are noise factors associated with the $c13-math-0015$ experimental unit. We will assume that the errors are iid as $c13-math-0016$ , where the variance of the normal distribution is not known. Thus, the CRD model says that the probability distribution of the experimental units $c13-math-0017$ is $c13-math-0018$ . The means and effects model are also related in the sense of defining $c13-math-0019$ .

In both models 13.1 and 13.2, each treatment receives an equal number of experimental units, that is, each treatment $c13-math-0020$ receives $c13-math-0021$ number of units. These kinds of models are called balanced designs. In practical setups, it may not be feasible to allocate equal numbers of units, and we allow the $c13-math-0022$ -th treatment of $c13-math-0023$ , number of units. In this case, the model is called the unbalanced design. The inferential aspects of balanced or unbalanced models do not vary drastically from each other, at least for the CRD model, and the coverage for the balanced model is provided in the rest of this section.

The movement to Design Matrix from Covariate Matrix. The covariates $c13-math-0024$ 's are missing in the effects model 13.2! In fact, the $c13-math-0025$ 's will not appear in the rest of this chapter either. Recollect that the median polish model 4.14 also did not have the covariates $c13-math-0026$ 's. However, the covariates are very much present in these models and they have a well-defined format. Indeed, the covariates are designed to appear in a specific way and this is the reason why we call the covariate matrix the Design Matrix. The models in this broad area completely determine the exact structure of the covariate matrix. In R, the function model.matrix will generate the exact design matrix, and we will come to this function later.

Consider the effects in model 13.2. Suppose we have $c13-math-0027$ treatments and $c13-math-0028$ observations for each of the treatments. Let the treatment effect be denoted by $c13-math-0029$ , and let $c13-math-0030$ take the value of 1 if treatment $c13-math-0031$ is assigned to the observation, and 0 otherwise, $c13-math-0032$ . The design matrix $c13-math-0033$ is then defined as in Table 13.1. Note that it is important to drop one of the $c13-math-0034$ !

Table 13.1 Design Matrix of a CRD with $c13-math-0035$ Treatments and $c13-math-0036$ Observations

Observation	Intercept	$c13-math-0037$	$c13-math-0038$
1	1	1	0
2	1	1	0
3	1	1	0
4	1	1	0
5	1	0	1
6	1	0	1
7	1	0	1
8	1	0	1
9	1	0	0
10	1	0	0
11	1	0	0
12	1	0	0

The next small sub-section will help in random allocation of the treatments to the experimental units.

13.3.2 Randomization in CRD

Suppose that we have $c13-math-0039$ number of treatments and that the $c13-math-0040$ treatment, $c13-math-0041$ , is allocated $c13-math-0042$ number of experimental units. Let $c13-math-0043$ be the total number of available experimental units.

Example 13.3.1. Allocation of Experimental Units to Treatments

Randomization for CRD is really simple in R. Here, we have $c13-math-0044$ and $c13-math-0045$ and thus $c13-math-0046$ units. Let the 12 units be labeled from 1 to 12. Now, these 12 units need to be randomly assigned to one of the three treatments, LETTERS[1:3]. We use the sample function to randomly assign the units to treatments.

> treatments <- LETTERS[1:3] # 3 treatments in action
> replicates <- c(4,4,4) # number of replicates
> total_units <- 1:sum(replicates)
> unsort_tr <- rep(treatments,replicates)
> unsort_numbers <- sample(total_units,length(total_units))
> cbind(unsort_tr,unsort_numbers)
      unsort_tr unsort_numbers
 [1,] "A"       "9"
 [2,] "A"       "6"
 [3,] "A"       "11"
 [4,] "A"       "8"
 [5,] "B"       "4"
 [6,] "B"       "10"
 [7,] "B"       "2"
 [8,] "B"       "12"
 [9,] "C"       "3"
[10,] "C"       "7"
[11,] "C"       "5"
[12,] "C"       "1"

The above allocation is to be interpreted as follows. Treatment A is assigned the experimental units 9, 6, 11, and 8, treatment B is assigned 4, 10, 2, and 12, whereas treatment C is allocated with units 3, 7, 5, and 1. Note that this allocation is done before conducting the experiment. The reader may get a different allocation based on running the code at the R terminal. □

The statistical inference for the CRD model will now be discussed.

13.3.3 Inference for the CRD Models

A few standard notations are in order. The $c13-math-0047$ treatment sample sum (mean), denoted by $c13-math-0048$ ( $c13-math-0049$ ) and total sample sum (mean), $c13-math-0050$ ( $c13-math-0051$ ), are defined by

13.3

13.4

Define the total (corrected) sum of squares, denoted by $c13-math-0054$ , as

13.5

The ANOVA technique partitions the $c13-math-0056$ as the sum of two components: (i) the sum of squares due to treatments $c13-math-0057$ , and (ii) the sum of squares due to error $c13-math-0058$ . Here, $c13-math-0059$ and $c13-math-0060$ are defined by

13.6

13.7

Note that $c13-math-0063$ accounts for the between treatments effect, and $c13-math-0064$ accounts for the within treatments difference. Here, $c13-math-0065$ has $c13-math-0066$ degrees of freedom, whereas $c13-math-0067$ and $c13-math-0068$ have respectively $c13-math-0069$ and $c13-math-0070$ degrees of freedom. It is easy to verify that

Define

That is, $c13-math-0073$ is the sampling variance of the $c13-math-0074$ -th treatment. We can pool these $c13-math-0075$ sampling variances and obtain the following:

where $c13-math-0077$ denotes the mean error sum of squares. Note that $c13-math-0078$ is an estimator of the variance $c13-math-0079$ for the $c13-math-0080$ -th treatment, and it may be seen that $c13-math-0081$ is an estimator of the common variance within each of the treatments. Similarly, we can also use the variation of the treatment averages from the grand average, under the assumption that there is no difference among the treatment means, for estimation of $c13-math-0082$ . That is, the mean treatment sum of squares is given by

An interesting hypothesis testing problem is about the equality of effect of the treatment means: $c13-math-0084$ against the alternative $c13-math-0085$ . The details can then be presented in the ANOVA Table 13.2.

Table 13.2 ANOVA for the CRD Model

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0086$ -Statistic
Between Treatments	$c13-math-0087$	$c13-math-0088$	$c13-math-0089$	$c13-math-0090$
Error within Treatments	$c13-math-0091$	$c13-math-0092$	$c13-math-0093$
Total	$c13-math-0094$	$c13-math-0095$

In light of Theorem 6.6.2, the sampling distribution of $c13-math-0096$ and $c13-math-0097$ may be seen as an $c13-math-0098$ -distribution with $c13-math-0099$ and $c13-math-0100$ degrees of freedom respectively. Finally, the sampling distribution of $c13-math-0101$ is seen to be an $c13-math-0102$ -distribution, see Theorem 6.3.4, with $c13-math-0103$ degrees of freedom.

Before a formal illustration of the CRD model, let us take an EDA route for understanding ANOVA. The package granova has a very interesting graphical tool granova.1w, where 1w stands for “one-way layout”. The illustration through example(granova.1w) is first considered. The granova may be useful for outlier identification, skewness, etc.

Example 13.3.2. `granova` Tool for Weight Gain Problem

The weight change for 72 young female anorexia patients is available in the MASS package. Understand more about the dataset with ?anorexia. The patients were randomly allocated to three different treatments: CBT, Cont, and FT. The aim is to find if the weight gain is the same across the three treatments. First, the weight gain is defined by anorexia[,3]-anorexia[,2] and stored in w.gain. The ANOVA technique with aov clearly shows that the weight gain across the three treatments is significantly: aov(w.gain ∼ anorexia[,1]).

> library(MASS)
> data(anorexia)
> w.gain <- anorexia[,3]-anorexia[,2]
> summary(aov(w.gain ∼ anorexia[,1]))
              Df Sum Sq Mean Sq F value   Pr(>F)
anorexia[, 1]  2  614.6 307.322  5.4223 0.006499 **
Residuals     69 3910.7  56.677
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> pdf("Anorexia_granova.pdf")
> granova.1w(w.gain,group=anorexia[,1])
$grandsum
    Grandmean        df.bet      df.with       MS.bet      MS.with
         2.76          2.00        69.00       307.32        56.68
       F.stat        F.prob SS.bet/SS.tot
         5.42          0.01          0.14
$stats
     Size Contrast Coef Wt'd Mean  Mean Trim'd Mean  Var. St. Dev.
Cont   26         -3.21     -0.49 -0.45       -1.16 63.82     7.99
CBT    29          0.24      3.63  3.01        1.80 53.41     7.31
FT     17          4.50      5.15  7.26        7.91 51.23     7.16
> dev.off()
null device
          1

At the center of the plot in Figure 13.1, we see the total sample mean of the observations as given in Equation 13.3. The blue box represents within sum of squares given in Equation 13.6, and the red box between sum of squares in Equation 13.6. Since we reject the hypothesis that the weight gain by different treatments are equal, it may be seen from Figure 13.1 that the weight gain is at a maximum for FT treatment at 7.2647. □

Figure 13.1 “Granova” Plot for the Anorexia Dataset

In the next example, we will find out how R handles the covariates for the CRD model!

Example 13.3.3. The Tensile Strength Experiment

Page 70 of Montgomery (2005). An engineer wants to find out if the cotton weight percentage in a synthetic fiber effects the tensile strength. Towards this, the cotton weight percentage is fixed at 5 different levels of 15, 20, 25, 30, and 35. Each level of the percentage is assigned five experimental units and the tensile strength is measured for each of them. The randomization is specified in the Run_Number column. The goal of the engineer is to investigate if $c13-math-0104$ , where the subscript denotes the percentage level. Towards this goal, a CRD model is fitted and the ANOVA table is obtained following the next program.

> data(tensile)
> tensile$CWP <- as.factor(tensile$CWP)
> tensile_aov <- aov(Tensile_Strength∼CWP, data=tensile)
> summary(tensile_aov)
            Df Sum Sq Mean Sq F value   Pr(>F)
CWP          4  475.8  118.94   14.76 9.13e-06 ***
Residuals   20  161.2    8.06
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> model.matrix(tensile_aov)
   (Intercept) CWP20 CWP25 CWP30 CWP35
1            1     1     0     0     0
2            1     0     0     1     0
3            1     1     0     0     0
4            1     0     0     0     1
12           1     0     0     0     0
13           1     0     1     0     0
14           1     1     0     0     0
24           1     0     0     1     0
25           1     0     0     0     0
attr(,"assign")
[1] 0 1 1 1 1
attr(,"contrasts")
attr(,"contrasts")$CWP
[1] "contr.treatment"

Note that since the cotton weight percentage is specified in numeric, R reads it as a numeric vector. However, it is vital that the variable be read as a factor, otherwise the output will be drastically different and technically wrong. The $c13-math-0105$ -value clearly specifies that the weight percentage is significant for determining the tensile strength of the fabric. The R function model.matrix shows that the design matrix for the CRD model is appropriately prepared as required along the lines in Table 13.3. □

We will consider one more example for the CRD model 13.2 from Dean and Voss (1999).

Example 13.3.4. The Olson Heart Lung Dataset

We need to determine the effect of the number of revolutions per minute (rpm) of the rotary pump head of an Olson heart–lung pump on the fluid flow rate Liters_minute. The rpm's are replicated at 50, 75, 100, 125, and 150 levels with respective frequencies 5, 3, 5, 2, and 5. The fluid flow rate is measured in liters per minute.

> data(olson)
> par(mfrow=c(2,2))
> plot(olson$rpm,olson$Liters_minute,xlim=c(25,175),xlab="RPM", ylab="Flow Rate",main="Scatter Plot")
> boxplot(Liters_minute∼rpm,data=olson,main="Box Plots")
> aggregate(olson$Liters_minute,by=list(olson$rpm),mean)
  Group.1      x
1      50 1.1352
2      75 1.7220
3     100 2.3268
4     125 2.9250
5     150 3.5292
> olson_crd <- aov(Liters_minute ∼ as.factor(rpm), data=olson)
> olson_crd
Call:
   aov(formula = Liters_minute ∼ as.factor(rpm), data = olson)
Terms:
                as.factor(rpm) Residuals
Sum of Squares        16.12551   0.02084
Deg. of Freedom              4        15
Residual standard error: 0.03727412
Estimated effects may be unbalanced
> summary(olson_crd)
               Df Sum Sq Mean Sq F value Pr(>F)
as.factor(rpm)  4 16.126   4.031    2902 <2e-16 ***
Residuals      15  0.021   0.001
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> confint(olson_crd)
                      2.5 %    97.5 %
(Intercept)       1.0996698 1.1707302
as.factor(rpm)75  0.5287795 0.6448205
as.factor(rpm)100 1.1413527 1.2418473
as.factor(rpm)125 1.7233291 1.8562709
as.factor(rpm)150 2.3437527 2.4442473
> anovaPlot(olson_crd,main="Box-Hunter-Hunter ANOVA Plot")
> granova.1w(olson$Liters_minute,group=olson$rpm, main="Graphical ANOVA")
$grandsum
    Grandmean       df.bet       df.with        MS.bet       MS.with
     F.stat       F.prob SS.bet/SS.tot
         2.30         4.00         15.00          4.03          0.00
         2901.61         0.00          1.00
$stats
    Size Contrast Coef Wt'd Mean Mean Trim'd Mean Var. St. Dev.
50     5         -1.16      1.42 1.14        1.13 0.00     0.01
75     3         -0.58      1.29 1.72        1.72 0.00     0.03
100    5          0.03      2.91 2.33        2.33 0.00     0.02
125    2          0.63      1.46 2.92        2.92 0.01     0.08
150    5          1.23      4.41 3.53        3.52 0.00     0.05

The scatter plot clearly indicates, Figure 13.2, that as the RPM is increasing, there is an increase in the flow rate too. This is further confirmed by the box plot which shows that the average, actually median, levels are different, and since there is no overlap of the quantiles, the difference is also significant. The ANOVA table may be visualized as displayed by the output of granova.1w, and the Box-Hunter-Hunter ANOVA plot is left to the reader for interpretation.

Figure 13.2 Box Plots for the Olson Data

Since the $c13-math-0106$ -value, in Pr(>F), is significantly zero, we reject the hypothesis that the treatment effects are equal. The 95% confidence intervals for the four treatments do not contain 0 and hence we can conclude that each of the treatments has a non-zero rpm because of it. □

Validation of the model assumptions is considered next.

13.3.4 Validation of Model Assumptions

The CRD model is a linear regression model, as in 12.1. The assumptions for the model are the same as detailed in Section 12.2, that is, linearity, independent, and normality assumptions. For the model under consideration, the plots, as in Section 12.2.6, give us all that is required. The demonstration is carried out for Example 13.3.3.

Example 13.3.5. The Tensile Strength Experiment

Contd. The following plots are obtained for testing the model adequacy of the CRD model for the tensile strength experiment:

Plot of Residuals vs Predictor Variable
Plot of Absolute Residual Values vs Predictor Variable
Normal Q-Q Plot
Plot of Residuals vs Fitted Values
Sequence Plot of the Residuals
Box Plot of the Residuals

The next program gives the required plots.

> tensile_resid <- residuals(tensile_aov)
> pdf("Tensile_Model_Assumptions.pdf")
> par(mfrow=c(2,3))
> plot(tensile$CWP,tensile_resid,main="Plot of
+ Residuals Vs Predictor Variable",ylab="Residuals",
+ xlab="Predictor Variable")
> plot(tensile$CWP,abs(tensile_resid),main="Plot of
+ Absolute Residual Values 
 Vs Predictor Variable",
+ ylab="Absolute Residuals", xlab="Predictor Variable")
> qqnorm(residuals(tensile_aov))
> qqline(residuals(tensile_aov))
> plot(tensile_aov$fitted.values,tensile_resid, main="Plot of
+ Residuals Vs Fitted Values",ylab="Residuals", xlab="Fitted Values")
> plot.ts(tensile_resid, main="Sequence Plot of the Residuals")
> boxplot(tensile_resid,main="Box Plot of the Residuals")
> dev.off()
null device
          1

The randomness in the plots, see Figure 13.3, “Plot of Residuals vs Predictor Variable”, “Plot of Absolute Residual Values vs Predictor Variable”, “Plot of Residuals vs Fitted Values”, and “Sequence Plot of the Residuals” clearly indicate that the assumptions of linearity and independence are appropriate for the tensile dataset. The box plot “Box Plot of the Residuals” shows that there are no outliers, and finally the “Normal Q-Q Plot” justifies the normality assumption for the error term. □

Figure 13.3 Model Adequacy Plots for the Tensile Strength Experiment

13.3.5 Contrasts and Multiple Testing for the CRD Model

Let us begin with a definition.

The statistical interest is in the problem of testing the hypothesis $c13-math-0114$ against the alternative $c13-math-0115$ . An estimate of the contrast $c13-math-0116$ is given by

The test procedure is to reject $c13-math-0118$ if $c13-math-0119$ , where

The test statistic $c13-math-0121$ is rewritten as

where $c13-math-0123$ and $c13-math-0124$ is defined by

Some contrasts are considered in the next example.

Example 13.3.7. Contrast for the Tensile Strength Experiment

The four contrasts considered are given in the following:

The next program gives the necessary computations.

> tensileisum <- aggregate(tensile$Tensile_Strength,by= list(tensile$CWP),FUN=sum)$x
> mse <- summary(tensile_aov)[[1]][2,3]
> # H : t4 = t5
> ssc <- ((tensileisum[4]-tensileisum[5])ˆ2)/(5*2)
> (f0 <- ssc/mse)
[1] 36.17866
> 1-pf(f0,df1=1,df2=20)
[1] 7.011202e-06
> # H : t1 + t3 = t4 + t5
> ssc <- ((tensileisum[1]+tensileisum[3]-tensileisum[4]- tensileisum[5])ˆ2)/(5*4)
> (f0 <- ssc/mse)
[1] 3.877171
> 1-pf(f0,df1=1,df2=20)
[1] 0.06295952
> # H : t1 = t3
> ssc <- ((tensileisum[1]-tensileisum[3])ˆ2)/(5*2)
> (f0 <- ssc/mse)
[1] 18.87097
> 1-pf(f0,df1=1,df2=20)
[1] 0.0003147387
> # H : 4t2 = t1 + t3 + t4 + t5
> ssc <- ((4*tensileisum[2]-tensileisum[1]-tensileisum[3]-
+ tensileisum[4]-tensileisum[5])ˆ2)/(5*20)
> (f0 <- ssc/mse)
[1] 0.1004963
> 1-pf(f0,df1=1,df2=20)
[1] 0.7545203

It may thus be noted that at the 5% significance level, only two contrasts $c13-math-0127$ and $c13-math-0128$ are significant. □

The problem of multiple testing was dealt with in Section 17.15. Here, the results are specialized to the CRD model.

If the hypothesis $c13-math-0129$ is rejected, we know that at least two treatment levels have significantly different effects. This calls for techniques which will help the reader to identify which treatments are significantly different. The details of the multiple comparison problem may be found in Miller (1981), Hsu (1996), and Bretz, et al. (2011).

For the multiple testing problem we are familiar with the Bonferroni's method and the Holm's method. Tukey's honest significant differences, HSD, and Dunnett's procedures are detailed next.

In a CRD model with $c13-math-0130$ treatment levels, the interest is in comparison of the equality for each possible combination of the levels, that is, there are $c13-math-0131$ possible hypotheses. Tukey's procedure uses the distribution of the Studentized range statistic:

13.9

where $c13-math-0133$ and $c13-math-0134$ are the largest and smallest sample means out of the $c13-math-0135$ possible means. Tukey's HSD procedure declares two means to be significantly different if the absolute value of their differences exceeds

where $c13-math-0137$ is the upper $c13-math-0138$ percentage point of $c13-math-0139$ and $c13-math-0140$ is the number of degrees of freedom associated with the $c13-math-0141$ . Thus, a $c13-math-0142$ % confidence interval for all possible pairs of means is given by

Suppose that one of the $c13-math-0144$ treatment levels is a control level and comparisons are required with respect to this control level. The Dunnett's procedure offers a solution in this case. The hypotheses are then $c13-math-0145$ , which then need to be tested against the set of hypotheses $c13-math-0146$ . The Dunnett's procedure begins with computation of the differences:

The test procedure is to reject the hypothesis $c13-math-0148$ at size $c13-math-0149$ if

where $c13-math-0151$ corresponds to Dunnett's distribution.

The four methods of multiple testing are next illustrated for the tensile strength experiment.

Example 13.3.8. Multiple Testing for the Tensile Strength Experiment

The R package multcomp is useful for multiple testing. For more details about the package, refer to Bretz, et al. (2011). R codes are in action now.

> library(multcomp)
> tensile.mc <- glht(tensile_aov, linfct = mcp(CWP="Dunnett"),
+ alternative="two.sided")
> summary(tensile.mc)
  Simultaneous Tests for General Linear Hypotheses
Multiple Comparisons of Means: Dunnett Contrasts
Fit: aov(formula = Tensile_Strength ∼ CWP, data = tensile)
Linear Hypotheses:
             Estimate Std. Error t value Pr(>|t|)
20 - 15 == 0    5.600      1.796   3.119  0.01867 *
25 - 15 == 0    7.800      1.796   4.344  0.00115 **
30 - 15 == 0   11.800      1.796   6.572  < 0.001 ***
35 - 15 == 0    1.000      1.796   0.557  0.94693
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
(Adjusted p values reported -- single-step method)
> summary(tensile.mc,test=adjusted(type="bonferroni"))
  Simultaneous Tests for General Linear Hypotheses
Multiple Comparisons of Means: Dunnett Contrasts
Fit: aov(formula = Tensile_Strength ∼ CWP, data = tensile)
Linear Hypotheses:
             Estimate Std. Error t value Pr(>|t|)
20 - 15 == 0    5.600      1.796   3.119  0.02164 *
25 - 15 == 0    7.800      1.796   4.344  0.00126 **
30 - 15 == 0   11.800      1.796   6.572 8.43e-06 ***
35 - 15 == 0    1.000      1.796   0.557  1.00000
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
(Adjusted p values reported -- bonferroni method)
> summary(tensile.mc,test=adjusted(type="holm"))
  Simultaneous Tests for General Linear Hypotheses
Multiple Comparisons of Means: Dunnett Contrasts
Fit: aov(formula = Tensile_Strength ∼ CWP, data = tensile)
Linear Hypotheses:
             Estimate Std. Error t value Pr(>|t|)
20 - 15 == 0    5.600      1.796   3.119 0.010818 *
25 - 15 == 0    7.800      1.796   4.344 0.000944 ***
30 - 15 == 0   11.800      1.796   6.572 8.43e-06 ***
35 - 15 == 0    1.000      1.796   0.557 0.583753
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
(Adjusted p values reported -- holm method)
> TukeyHSD(tensile_aov)
  Tukey multiple comparisons of means
    95% family-wise confidence level
Fit: aov(formula = Tensile_Strength ∼ CWP, data = tensile)
$Cotton_Weight_Percentage
       diff         lwr        upr     p adj
20-15   5.6   0.2270417 10.9729583 0.0385024
25-15   7.8   2.4270417 13.1729583 0.0025948
30-15  11.8   6.4270417 17.1729583 0.0000190
35-15   1.0  -4.3729583  6.3729583 0.9797709
25-20   2.2  -3.1729583  7.5729583 0.7372438
30-20   6.2   0.8270417 11.5729583 0.0188936
35-20  -4.6  -9.9729583  0.7729583 0.1162970
30-25   4.0  -1.3729583  9.3729583 0.2101089
35-25  -6.8 -12.1729583 -1.4270417 0.0090646
35-30 -10.8 -16.1729583 -5.4270417 0.0000624

The glht function represents the General Linear Hypotheses according to the description in the multcomp package. In Dunnett, Bonferroni, and Holm's multiple testing procedures, the treatment level 15 is taken as a base for the purposes of comparison. The difference between treatment levels is shown through the $c13-math-0152$ -values. The interpretation of the results is along similar lines as earlier and obvious too. □

The tensile strength experiment is an example of a balanced design. Fortunately, there is a slight change of $c13-math-0153$ , instead of the $c13-math-0154$ , but all the results continue to hold. In R there are no changes in the structure of the commands and functionality. Thus, the user can easily solve the multiple testing problem for the Olson dataset, which is an example of an unbalanced design.

Example 13.3.9. The Olson Heart Lung Dataset

Contd. For the sake of brevity, only the R codes are given here.

olson.mc <- glht(olson_crd, linfct = mcp(rpmf="Dunnett"),
alternative="two.sided")
summary(olson.mc)
summary(olson.mc,test=adjusted(type="bonferroni"))
summary(olson.mc,test=adjusted(type="holm"))
TukeyHSD(olson_crd)

The reader should complete the exercise of running the program and analyze the results. □

$c13-math-0155$

13.4 Block Designs

In the CRD model, the source of variation arises due to the different treatment levels. It is also assumed that the nuisance factor is completely unknown and uncontrollable. In the case of the nuisance factor being known and controllable, the experiment can be designed to account for such a source of error through blocking.

13.4.1 Randomization and Analysis of Balanced Block Designs

In a block model, a comparison needs to be carried out for the different treatment levels and blocks. In a balanced block design, each block will have one observation per treatment level. It is to be noted that randomization is only carried out on the treatments within a block. The effects model for a balanced block design model is specified by

13.10

where $c13-math-0157$ is the overall mean, $c13-math-0158$ is the effect of the $c13-math-0159$ -th treatment, and $c13-math-0160$ is the effect of the $c13-math-0161$ -th block. Also, define $c13-math-0162$ . As previously we assume that the error term $c13-math-0163$ follows a Gaussian distribution $c13-math-0164$ .

The randomization for the above design 13.10 is illustrated next.

Example 13.4.1. Randomization for a Balanced Block Design

Suppose that we have $c13-math-0165$ blocks and $c13-math-0166$ treatment levels. Thus, we need $c13-math-0167$ experimental units. The problem is that of allocating the units in such a way that meets the requirements of the model. The next R program achieves the randomization.

> b <- 4; v <- 5
> eunits <- 1:b*v
> b_alloc <- rep(1:b,each=v)
> tlevel_alloc <- NULL
> for(i in 1:b) tlevel_alloc[((i-1)*v+1):(i*v)] <-
+ sample(1:v,rep=FALSE)
> cbind(sample(1:(b*v),rep=FALSE),b_alloc,tlevel_alloc)
         b_alloc tlevel_alloc
 [1,] 13       1            2
 [2,] 18       1            4
 [3,]  9       1            3
 [4,]  5       1            1
 [5,] 15       1            5
 [6,]  3       2            2
[19,]  7       4            4
[20,] 12       4            2

Note the slight programming difference in randomization for block designs with that in a CRD model. □

Towards inference for the randomized block design, define the following quantities:

The decomposition of the total sum of squares $c13-math-0169$ is given below:

13.11

where

Observe that $c13-math-0172$ has $c13-math-0173$ degrees of freedom, $c13-math-0174$ has $c13-math-0175$ df, $c13-math-0176$ has $c13-math-0177$ df, and finally, $c13-math-0178$ has $c13-math-0179$ df. Thus, the ANOVA table is then set up as given in Table 13.3. The balanced block design will be illustrated now.

Table 13.3 ANOVA for the Randomized Balanced Block Model

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0180$ -Statistic
Between Treatments	$c13-math-0181$	$c13-math-0182$	$c13-math-0183$	$c13-math-0184$
Between Blocks	$c13-math-0185$	$c13-math-0186$	$c13-math-0187$	$c13-math-0188$
Error	$c13-math-0189$	$c13-math-0190$	$c13-math-0191$
Total	$c13-math-0192$	$c13-math-0193$

Example 13.4.2. Strength Dataset of a Girder Experiment

Consider the girder experiment, which has 9 blocks and 4 treatments. The aim of the experiment is to find if the shear strength depends on the blocks, varying from S1.1 to S4.2, or on the method of preparation: Aarau, Karisruhe, Lehigh, and Cardiff. A bit of data preparation is needed from the csv file, and post the preparation, the ANOVA table is obtained using the aov function in R. Recollect the analysis of this dataset in Example 4.5.3 of Section 4.5 through the median polish algorithm. The data arrangement is slightly different in the block design model.

> data(girder)
> names(girder)[2:5] <- c("Aar","Kar","Leh","Car")
> gf <- as.character(rep(girder$Girder,each=4))
> mf <- as.character(rep(colnames(girder)[2:5],9))
> ss <- NULL # Shear Strength
> for(i in 1:nrow(girder)) ss <- c(ss, as.numeric(girder[i,2:5]))
> girdernew <- data.frame(mf, gf, ss)
> ssaov <- aov(ss∼gf + mf, data=girdernew)
> summary(ssaov)
            Df Sum Sq Mean Sq F value  Pr(>F)
gf           8 0.0895  0.0112   1.619   0.172
mf           3 1.5138  0.5046  73.027 3.3e-12 ***
Residuals   24 0.1658  0.0069
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> model.matrix(ssaov)
   (Intercept) gfS1.2  mfCar mfKar mfLeh
1            1      0      0     0     0
2            1      0      0     1     0
3            1      0      0     0     1
4            1      0      1     0     0
5            1      0      0     0     0
6            1      0      0     1     0
7            1      0      0     0     1
8            1      0      1     0     0
33           1      0      0     0     0
34           1      0      0     1     0
35           1      0      0     0     1
36           1      0      1     0     0

After reading the data, the names of the variables have been curtailed to ensure the output of model.matrix displays all the variables in a single line. The codes rep(girder$Girder,each=4) and rep(colnames(girder)[2:5],9) with as.character ensures that the block design is clearly told that these two inputs are factors. The vector ss now contains the shear strength for each combination of the method of preparation and the girder. Finally, aov(ss gf + mf, data=girdernew) sets up the required block design in R. The interpretation is as follows. The large $c13-math-0194$ -value of 0.172 with girderf suggests that the blocks have insignificant effect on the output. However, the method of preparation is found to have significant effects on the strength of the material. □

As seen in Models 13.2, 13.10, etc., these linear models need to be investigated for model assumptions and other regression diagnostics too. Can it be said without loss of generality that leverages are not a concern in DOE since all the covariate values are fixed? Now, we consider an example from Montgomery (2005), wherein we will investigate issues related to model adequacy.

Example 13.4.3. Hardness Example

Four types of tip are used which form the blocks in this experiment. The variable of interest is the hardness which further depends on the type of metal coupon. For each type of tip, the hardness is observed for four different types of metal coupon. In the next R segment, an ANOVA is first obtained to check if the tip type and coupon type effect the hardness of the coupon. The assumption of normality for the model error is validated through the “Q-Q” plot. Diagnostic measures, leverage points, etc., are obtained using the functions as seen earlier for the linear regression model and the CRD model, Sections 12.5 and 13.3.

> data(hardness)
> hardness$Tip_Type <- as.factor(hardness$Tip_Type)
> hardness$Test_Coupon <- as.factor(hardness$Test_Coupon)
> hardness_aov <- aov(Hardness∼Tip_Type+Test_Coupon,data=hardness)
> summary(hardness_aov)
            Df Sum Sq Mean Sq F value   Pr(>F)
Tip_Type     3  0.385 0.12833   14.44 0.000871 ***
Test_Coupon  3  0.825 0.27500   30.94 4.52e-05 ***
Residuals    9  0.080 0.00889
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> p <- 7
> diagnostics_matrix <- matrix(nrow=nrow(hardness),ncol=8)
> colnames(diagnostics_matrix) <- c("Obs No.","y","Pred y",
+ "Residual","Leverage","S.Residual","Cooks.Dist","Outlier")
> diagnostics_matrix[,1] <- 1:nrow(hardness)
> diagnostics_matrix[,2] <- hardness$Hardness
> diagnostics_matrix[,3] <- hardness_aov$fitted.values
> diagnostics_matrix[,4] <- round(hardness_aov$residuals,3)
> diagnostics_matrix[,5] <- influence(hardness_aov)$hat
> diagnostics_matrix[,6] <- round(rstandard(hardness_aov),3)
> diagnostics_matrix[,7] <- round(cooks.distance(hardness_aov),3)
> diagnostics_matrix[,8] <- diagnostics_matrix[,6]*sqrt((16-p-1)/
+ (16-p-diagnostics_matrix[,6]ˆ2))
> diagnostics_matrix
      Obs No.    y  Cooks.Dist Outlier
 [1,]       1  9.3       0.056  -0.686
 [2,]       2  9.4       0.014   0.336
 [3,]       3  9.6       0.125  -1.069
 [4,]       4 10.0       0.222   1.512
[15,]      15 10.0       0.014   0.336
[16,]      16 10.2       0.000   0.000
> par(mfrow=c(2,3)); plot(hardness_aov,which=1:6) # Output suppressed

The R lm complementary functions influence, rstandard, and cooks.distance have again been used for the diagnostics of the fitted hardness_aov object. The “Q-Q” plot, see Figure 13.4, shows that the normality assumption is appropriate for the current dataset. Since the Cooks distance does not exceed 1 for any of the observations, we do not have any influential observation. □

Figure 13.4 A qq-Plot for the Hardness Data

13.4.2 Incomplete Block Designs

The constraints of the real world may not allow each block to have an experimental unit for every treatment level. The restriction may force allocation of only $c13-math-0195$ units within each block. Such models are called incomplete block designs. If in an incomplete block design, any two treatments pair appearing together, across the blocks, occurring an equal number of times, the designs are then called balanced incomplete block designs, abbreviated as BIBD. It may be seen that a BIBD may be constructed in $c13-math-0196$ different ways.

The setup is now recollected again. We have $c13-math-0197$ treatments, $c13-math-0198$ distinct blocks, $c13-math-0199$ is the number of units appearing in a block, and $c13-math-0200$ is the total number of observations, where $c13-math-0201$ is the number of times a treatment appears in the design. Then, the number of times each pair of treatments appears in the same block is given by

13.12

The statistical model for BIBD is given by

13.13

where $c13-math-0204$ is the $c13-math-0205$ observation in the $c13-math-0206$ block, $c13-math-0207$ is the overall mean, $c13-math-0208$ is the effect of the $c13-math-0209$ treatment, and $c13-math-0210$ is the effect of the $c13-math-0211$ block. The error term $c13-math-0212$ is assumed to follow $c13-math-0213$ . The total variability needs to be handled in a slightly different manner. Define

13.14

where

Using the $c13-math-0216$ 's, the adjusted treatment sum of squares is defined by

Thus, the total variability is partitioned by

13.15

where

The ANOVA table for BIBD model is given in Table 13.4.

Table 13.4 ANOVA for the BIBD Model

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0220$ -Statistic
Between Treatments	$c13-math-0221$	$c13-math-0222$	$c13-math-0223$	$c13-math-0224$
Between Blocks	$c13-math-0225$	$c13-math-0226$	$c13-math-0227$	$c13-math-0228$
Error	$c13-math-0229$	$c13-math-0230$	$c13-math-0231$
Total	$c13-math-0232$	$c13-math-0233$

The function BIB.test from the agricolae package is useful to fit a BIBD model.

Example 13.4.4. Chemical Reaction Experiment

For a chemical reaction experiment, the blocks arise due to the Batch number, Catalyst of different types from the treatments, and the reaction time is the output. Due to a restriction, all the catalysts cannot be analysed within each batch and hence we need to look at the BIBD model. Using the BIB.test function from the agricolae package, a summary of the ANOVA model related to the BIBD model is obtained.

> library(agricolae)
> data(reaction)
> attach(reaction)
> BIB.test(block=Batch,trt=Catalyst,y=Reaction)
ANALYSIS BIB:  Reaction
Class level information
Block:  1 2 3 4
Trt  :  1 2 3 4
Number of observations:  16
Analysis of Variance Table
Response: Reaction
            Df Sum Sq Mean Sq F value   Pr(>F)
block.unadj  3  55.00 18.3333  28.205 0.001468 **
trt.adj      3  22.75  7.5833  11.667 0.010739 *
Residuals    5   3.25  0.6500
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
coefficient of variation: 1.1 %
Reaction Means: 72.5

The computed ANOVA table shows that both the catalyst and the batch have a significant effect on the reaction of the units. □

Two very important variations of the BIBD will be considered in the rest of the section.

13.4.3 Latin Square Design

In each of the Examples 13.4.1 to 13.4.4, and the blocking models 13.2 and 13.10, we had a single source of known and controllable source of variation and we use the blocking principle to overcome this. Suppose now that there are two ways or sources of the variation which may be controlled through blocking! Consider the following example from Montgomery (2001), page 144.

In the above example, we need to ensure that each type of formulation occurs exactly once for the type of raw material and also that each formulation is prepared exactly once by each of the five operators. This type of problem is handled by the Latin Square Design, abbreviated as LSD, and we use the Latin letters to represent the type of formulations. A simple technique is to set up an LSD for say $c13-math-0238$ formulations, set up in the first row of the design as 1, 2, …, $c13-math-0239$ , the second row as 2, 3, …, $c13-math-0240$ ,1, the third row as 3, 4, …, $c13-math-0241$ , 2, 1, and so forth until the last row as $c13-math-0242$ , $c13-math-0243$ , …, 3, 2, 1. There are many other techniques available for creating an LSD, such as Wichmann-Hill, Marsaglia-Multicarry, Mersenne-Twister, Super-Duper, etc. A couple of illustrations are given next using the design.lsd function from the agricolae package.

Example 13.4.6. Setting up LSDs

A simple 3-treatment LSD is first set up, and then a 5-treatment LSD is produced. First, we show how the entire LSD setup looks for two examples, and then follow it up with the simple rectangular displays of LSD.

> design.lsd(letters[1:3],kinds="super-duper")
  plots row col letters[1:3]
1     1   1   1            a
2     2   1   2            b
3     3   1   3            c
4     4   2   1            c
5     5   2   2            a
6     6   2   3            b
7     7   3   1            b
8     8   3   2            c
9     9   3   3            a
> design.lsd(letters[1:3],kinds="super-duper")
  plots row col letters[1:3]
1     1   1   1            a
2     2   1   2            b
3     3   1   3            c
4     4   2   1            b
5     5   2   2            c
6     6   2   3            a
7     7   3   1            c
8     8   3   2            a
9     9   3   3            b
> matrix(design.lsd(LETTERS[1:3])[,4],nrow=3)
     [,1] [,2] [,3]
[1,] "A"  "C"  "B"
[2,] "B"  "A"  "C"
[3,] "C"  "B"  "A"
> Formulations <- paste("F",1:5,sep="")
> matrix(design.lsd(Formulations,kinds="Marsaglia-Multicarry") [,4],nrow=5)
     [,1] [,2] [,3] [,4] [,5]
[1,] "F1" "F5" "F2" "F3" "F4"
[2,] "F2" "F1" "F3" "F4" "F5"
[3,] "F3" "F2" "F4" "F5" "F1"
[4,] "F4" "F3" "F5" "F1" "F2"
[5,] "F5" "F4" "F1" "F2" "F3"

Thus, we have used the design.lsd function to set up LSDs. □

The LSD model and its statistical analyses is now detailed. The LSD statistical model is specified by

13.16

Here, $c13-math-0245$ will continue to represent the effect of the $c13-math-0246$ -th treatment (or formulation), $c13-math-0247$ for the row (block) effect (raw material), and $c13-math-0248$ for the column (block) effect (operator). The errors are assumed to follow normal distribution $c13-math-0249$ . The ANOVA decomposition of the sum of squares for the LSD model 13.16 will be as follows:

13.17

where

The ANOVA table for the LSD is given in Table 13.5. For Example 13.4.5, the R method is illustrated next.

Table 13.5 ANOVA for the LSD Model

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0252$ -Statistic
Between Treatments	$c13-math-0253$	$c13-math-0254$	$c13-math-0255$	$c13-math-0256$
Between Rows	$c13-math-0257$	$c13-math-0258$	$c13-math-0259$	$c13-math-0260$
Between Columns	$c13-math-0261$	$c13-math-0262$	$c13-math-0263$	$c13-math-0264$
Error	$c13-math-0265$	$c13-math-0266$	$c13-math-0267$
Total	$c13-math-0268$	$c13-math-0269$

Example 13.4.7. Rocket Propellant Example

The data from Table 4.9 of Montgomery (2002) is available on the rocket. After reading the data in R, we check for the LSD arrangement with matrix(rocket$treat,nrow=5). Since each of the formulations, raw materials, and operators is supposed to be a factor variable, the plot of the burning rate (y in the data frame) against these variables will give us a box plot. The graphical output (the figure is suppressed here) clearly indicates that the burning rate y depends on each of the three factors. The aov function is again used for analyzing the model here.

> data(rocket)
> matrix(rocket$treat,nrow=5)
     [,1] [,2] [,3] [,4] [,5]
[1,] "A"  "B"  "C"  "D"  "E"
[2,] "B"  "C"  "D"  "E"  "A"
[3,] "C"  "D"  "E"  "A"  "B"
[4,] "D"  "E"  "A"  "B"  "C"
[5,] "E"  "A"  "B"  "C"  "D"
> par(mfrow=c(1,3)) # Graphical output suppressed again
> plot(y∼op+batch+treat,rocket)
> rocket.lm <- lm(y∼factor(op)+factor(batch)+treat,rocket)
> anova(rocket.lm)
Analysis of Variance Table
Response: y
              Df Sum Sq Mean Sq F value   Pr(>F)
factor(op)     4    150  37.500  3.5156 0.040373 *
factor(batch)  4     68  17.000  1.5937 0.239059
treat          4    330  82.500  7.7344 0.002537 **
Residuals     12    128  10.667
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The analysis output clearly shows that the formulation level (treat) and the type of operator (op) clearly influence the burning rate (y). □

There are many other useful variants of LSD and the reader may refer to Chapter 10 of Hinkelmann and Kempthorne (2008) for the same. An extension of the LSD is considered in the next topic.

13.4.4 Graeco Latin Square Design

Suppose that there is an extra source of randomness for the Rocket Propellant problem in Example 13.4.5 in test assemblies, which forms an additional type of treatment. Now, it is obvious that this is a second treatment which may be again addressed by another LSD. Let us denote the second treatment by Greek letters, say $c13-math-0270$ , $c13-math-0271$ , $c13-math-0272$ , $c13-math-0273$ , and $c13-math-0274$ . There is symbolic confusion in the sense that the Greek letters of $c13-math-0275$ , $c13-math-0276$ , and $c13-math-0277$ may be confused with the notations used in Model 13.16. However, we will use the notations as elements of the LSD matrix. It should be clear from the context whether the notations are as in the Greek letters required or in the statistical model. Now, the LSD matrices for the two treatments are superimposed on each other to obtain the Graeco-Latin Square Design model. Table 13.6 shows how superimposing is done for two simple LSDs to obtain the GLSD.

Table 13.6 The GLSD Model

LSD			GSD			GLSD
A	B	C	$c13-math-0278$	$c13-math-0279$	$c13-math-0280$	A $c13-math-0281$	B $c13-math-0282$	C $c13-math-0283$
B	C	A	$c13-math-0284$	$c13-math-0285$	$c13-math-0286$	B $c13-math-0287$	C $c13-math-0288$	A $c13-math-0289$
C	A	B	$c13-math-0290$	$c13-math-0291$	$c13-math-0292$	C $c13-math-0293$	A $c13-math-0294$	B $c13-math-0295$

In more formal terms, the Graeco-Latin square design model, abbreviated as GLSD, is specified by

13.18

where $c13-math-0297$ now denotes the effect of the additional treatment. A couple of GLSDs will be first set up.

Example 13.4.8. Setting up the GLSDs

The function design.graeco from the agricolae package helps in setting up a GLSD. In general, LSDs and GLSDs are also nice to visualize and this example will visualize the latter through the R program. Essentially, we would like to produce a GLSD of any order, which can be displayed as in the form of Table 13.6. If we were to skip the Greek letters and obtain the GLSD as two LSDs superimposed on each other, the task is fairly simple. First, we create two LSDs using design.lsd and use the paste function to get the desired result. However, the Greek letters can be displayed in a plot and hence the GLSD can be visualized.

> LSD1 <- matrix(design.lsd(LETTERS[1:3])[,4],nrow=3)
> LSD2 <- matrix(design.lsd(LETTERS[4:6])[,4],nrow=3)
> GLSD <- matrix(paste(LSD1,LSD2,sep=""),nrow=3)
> LSD1; LSD2; GLSD
     [,1] [,2] [,3]
[1,] "A"  "C"  "B"
[2,] "B"  "A"  "C"
[3,] "C"  "B"  "A"
     [,1] [,2] [,3]
[1,] "D"  "F"  "E"
[2,] "E"  "D"  "F"
[3,] "F"  "E"  "D"
     [,1] [,2] [,3]
[1,] "AD" "CF" "BE"
[2,] "BE" "AD" "CF"
[3,] "CF" "BE" "AD"
> GLSD_Length <- function(n){
+ Greek <- c("alpha","beta","gamma","delta","epsilon","zeta","eta",
+ "theta","iota","kappa","lambda","mu","nu","xi","omicron","pi",
+ "rho","sigma","tau","upsilon","phi","chi","psi","ometa")
+ Latin <- LETTERS[1:n]
+ greek <- 1:n; latin <- 1:n
+ My_Graeco <- design.graeco(latin,greek)
+ My_Graeco
+ Latin_Matrix <- matrix(as.numeric(My_Graeco[,4]),nrow=n) # Latin
+ Greek_Matrix <- matrix(as.numeric(My_Graeco[,5]),nrow=n) # Greek
+ plotSquareMatrix <- function(X,Y){
+ n <- nrow(X)
+ reverseIndex <- n:1
+ plot(0:(n+1),0:(n+1),type="n",axes=FALSE,xlab="",ylab="")
+ title("A Graeco-Latin Square Design")
+ for(i in 1:n){
+ for(j in 1:n){
+ text(j,reverseIndex[i], as.expression(substitute(A (B),
+  list(A = as.name(Latin[X[i,j]]),B = as.name(Greek[Y[i,j]])))))
+ }
+ }
+ }
+ plotSquareMatrix(Latin_Matrix,Greek_Matrix)
+ }
> GLSD_Length(10)

The first three lines of the above program simply superimpose two LSDs and give a GLSD as an output. Next, we define the function GLSD_Length, which will help generate a GLSD of order $c13-math-0298$ . The design.graeco function can create a GLSD of order 14, although we have considered more Greek letters. Thus, the objects Greek and Latin store the Greek and Latin letters. A GLSD is obtained using the design.graeco function. The matrices Latin_Matrix and Greek_Matrix constitutes two LSDs, which need to be superimposed upon each other. The function plotSquareMatrix is defined within the GLSD_Length function, which helps to visualize a square matrix, as seen at the console on a graphical device. The R code in the loop for(j in 1:n) is motivated from http://stackoverflow.com/questions/13169318/r-creating-vectors-of-latin-greek-expression-for-plot-titles-axis-labels-or-l. The resulting display is given in Figure 13.5. □

Figure 13.5 Graeco-Latin Square Design

A Graeco–Latin Square Design
H( $c13-math-0299$ )	B( $c13-math-0300$ )	I( $c13-math-0301$ )	G( $c13-math-0302$ )	F( $c13-math-0303$ )	A( $c13-math-0304$ )	C( $c13-math-0305$ )	D( $c13-math-0306$ )	J( $c13-math-0307$ )	E( $c13-math-0308$ )
E( $c13-math-0309$ )	D( $c13-math-0310$ )	B( $c13-math-0311$ )	I( $c13-math-0312$ )	A( $c13-math-0313$ )	F( $c13-math-0314$ )	H( $c13-math-0315$ )	J( $c13-math-0316$ )	C( $c13-math-0317$ )	G( $c13-math-0318$ )
D( $c13-math-0319$ )	G( $c13-math-0320$ )	J( $c13-math-0321$ )	B( $c13-math-0322$ )	I( $c13-math-0323$ )	H( $c13-math-0324$ )	F( $c13-math-0325$ )	C( $c13-math-0326$ )	E( $c13-math-0327$ )	A( $c13-math-0328$ )
F( $c13-math-0329$ )	J( $c13-math-0330$ )	A( $c13-math-0331$ )	C( $c13-math-0332$ )	B( $c13-math-0333$ )	I( $c13-math-0334$ )	D( $c13-math-0335$ )	E( $c13-math-0336$ )	G( $c13-math-0337$ )	H( $c13-math-0338$ )
J( $c13-math-0339$ )	F( $c13-math-0340$ )	C( $c13-math-0341$ )	H( $c13-math-0342$ )	E( $c13-math-0343$ )	B( $c13-math-0344$ )	I( $c13-math-0345$ )	G( $c13-math-0346$ )	A( $c13-math-0347$ )	D( $c13-math-0348$ )
I( $c13-math-0349$ )	C( $c13-math-0350$ )	F( $c13-math-0351$ )	E( $c13-math-0352$ )	D( $c13-math-0353$ )	G( $c13-math-0354$ )	B( $c13-math-0355$ )	A( $c13-math-0356$ )	H( $c13-math-0357$ )	J( $c13-math-0358$ )
B( $c13-math-0359$ )	I( $c13-math-0360$ )	E( $c13-math-0361$ )	F( $c13-math-0362$ )	G( $c13-math-0363$ )	J( $c13-math-0364$ )	A( $c13-math-0365$ )	H( $c13-math-0366$ )	D( $c13-math-0367$ )	C( $c13-math-0368$ )
C( $c13-math-0369$ )	E( $c13-math-0370$ )	G( $c13-math-0371$ )	A( $c13-math-0372$ )	H( $c13-math-0373$ )	D( $c13-math-0374$ )	J( $c13-math-0375$ )	F( $c13-math-0376$ )	B( $c13-math-0377$ )	I( $c13-math-0378$ )
A( $c13-math-0379$ )	H( $c13-math-0380$ )	D( $c13-math-0381$ )	J( $c13-math-0382$ )	C( $c13-math-0383$ )	E( $c13-math-0384$ )	G( $c13-math-0385$ )	B( $c13-math-0386$ )	I( $c13-math-0387$ )	F( $c13-math-0388$ )
G( $c13-math-0389$ )	A( $c13-math-0390$ )	H( $c13-math-0391$ )	D( $c13-math-0392$ )	J( $c13-math-0393$ )	C( $c13-math-0394$ )	E( $c13-math-0395$ )	I( $c13-math-0396$ )	F( $c13-math-0397$ )	B( $c13-math-0398$ )

The ANOVA decomposition of the sum of squares for the GLSD model 13.18 will be as follows:

13.19

where

The ANOVA table for the GLSD is given in Table 13.7. For Example 13.4.5, the R method is illustrated next.

Table 13.7 ANOVA for the GLSD Model

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0401$ -Statistic
Between Latin Treatments	$c13-math-0402$	$c13-math-0403$	$c13-math-0404$	$c13-math-0405$
Between Greek Treatments	$c13-math-0406$	$c13-math-0407$	$c13-math-0408$	$c13-math-0409$
Between Rows	$c13-math-0410$	$c13-math-0411$	$c13-math-0412$	$c13-math-0413$
Between Columns	$c13-math-0414$	$c13-math-0415$	$c13-math-0416$	$c13-math-0417$
Error	$c13-math-0418$	$c13-math-0419$	$c13-math-0420$
Total	$c13-math-0421$	$c13-math-0422$

Example 13.4.9. Extending the Rocket Example

In continuation of Example 13.4.7 of the Rocket Propellant data, we now have the added blocking factor in test assemblies. This dataset is available in the rocket_Graeco data frame from the companion package. Using assembly as the additional blocking variable, we continue to use the lm function and then the anova function to obtain the results.

> data(rocket_Graeco)
> par(mfrow=c(2,2))
> plot(y∼op+batch+treat+assembly,rocket_Graeco) # output suppressed
> rocket.glsd.lm <- lm(y∼factor(op)+factor(batch)+treat+assembly, rocket_Graeco)
> anova(rocket.glsd.lm)
Analysis of Variance Table
Response: y
              Df Sum Sq Mean Sq F value   Pr(>F)
factor(op)     4    150   37.50  4.5455 0.032930 *
factor(batch)  4     68   17.00  2.0606 0.178311
treat          4    330   82.50 10.0000 0.003344 **
assembly       4     62   15.50  1.8788 0.207641
Residuals      8     66    8.25
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The result, in comparison with the ANOVA table in Example 13.4.7, shows that using the assembly as an additional variable reduces the experimental errors from 128 to 66, although we now have reduced degrees of freedom. □

We now move to more important and complex experimental designs.

13.5 Factorial Designs

Factorial designs¹ are useful in experiments involving two or more factor variables. Suppose that there are two factor variables, with variable 1 having $c13-math-0424$ levels and variable 2 having $c13-math-0425$ levels. A factorial design investigates all possible combinations of the levels of the factors. In simple words, if factor $c13-math-0426$ is at $c13-math-0427$ levels, factor $c13-math-0428$ at $c13-math-0429$ levels, and factor $c13-math-0430$ at $c13-math-0431$ levels, a factorial design will investigate all possible levels $c13-math-0432$ . It is then common practice to say that factors are crossed, implying that each factor variable level also has a corresponding observation among the levels of other factors. In factorial designs, the interest is more often to determine if there is an interaction between some combination of the factor levels. What does interaction really mean? In general, the main effects of a factor are observed to be the changes in the regressand due to the changes of the factor levels. However, if the changes in such expected values of the regressand also depend on the levels of other factor variables, we believe that there is an interaction effect between the factor variables. In the previously discussed designs in Sections 13.3 and 13.4, the factors were implicitly assumed to not have any kind of interaction among their different levels. Recollect from the three-dimensional scatter plot in Figure 12.6 and the cantor plot in Figure 12.7, the model 12.31 consisted of straight lines or planes only. However, for the models 12.31 and Figure 13.6, the three-dimensional plots and cantor plots had curvi-linear shapes in them, which is then an indication of the presence of interaction among the variables.

Two plots, with the headings: A: Design of the battery factorial experiment, and B: Interaction effect of the battery factorial experiment, with Mean of L and Average life on the y-axes, and Factors and Temperature on the x-axis. — **Figure 13.6** Design and Interaction Plots for 2-Factorial Design

Four plots, with the heading: Understanding height of bottling − interaction plots; with Mean of deviation on the y-axis, and Carbonation, Pressure, Speed Factors on the x-axis at the plot on the top left; Deviation on the y-axis, Pressure on the y-axis f — **Figure 13.7** Understanding Interactions for the Bottling Experiment

Sir R.A. Fisher advocated the use of a complex design like factorial designs with “No aphorism is more frequently repeated in connection with field trials, than that we must ask Nature a few questions, or, ideally, one question, at a time. The writer is convinced that this view is wholly mistaken.” To understand this advantage, consider two factorial variables at two levels each. Suppose that two factor variables, $c13-math-0433$ and $c13-math-0434$ , are both available at levels high $c13-math-0435$ and low $c13-math-0436$ . Then the four possible combinations of the two treatments are $c13-math-0437$ , $c13-math-0438$ , $c13-math-0439$ , and $c13-math-0440$ . If low refers to the absence of factor levels, it is also a common practice to denote these respective combination levels with $c13-math-0441$ , $c13-math-0442$ , $c13-math-0443$ , and $c13-math-0444$ . Suppose we are interested in finding the effect of changing the factors $c13-math-0445$ and $c13-math-0446$ . A rule of thumb is to take two observations at each combination level of the factors in the presence of the experimental error. Now, the effect of treatment factor $c13-math-0447$ is found by the difference in the combination level of $c13-math-0448$ , while that of $c13-math-0449$ is with $c13-math-0450$ . Thus, we require data on the three combination levels $c13-math-0451$ , $c13-math-0452$ , and $c13-math-0453$ , or equivalently six observations only. In a full factorial experiment, we would also have two more data points at the combination level $c13-math-0454$ . Now, we can obtain two main effects for $c13-math-0455$ with $c13-math-0456$ and $c13-math-0457$ , and two main effects for $c13-math-0458$ too with $c13-math-0459$ and $c13-math-0460$ .

In the remainder of this section, we will consider some useful factorial designs. The focus will be only on fixed effects models.

13.5.1 Two Factorial Experiment

Consider the case of two factor variables. Suppose that factor $c13-math-0461$ is at $c13-math-0462$ levels, and $c13-math-0463$ at $c13-math-0464$ levels. The two factorial design model is given by

13.20

Here, $c13-math-0466$ is the overall mean effect, $c13-math-0467$ is the effect of the $c13-math-0468$ -th factor level of $c13-math-0469$ , $c13-math-0470$ is the $c13-math-0471$ -th effect of the factor $c13-math-0472$ , and $c13-math-0473$ is the interaction effect between the variables. The ANOVA decomposition of the sum of squares for the two factorial experiments is given by

13.21

where

The computation of $c13-math-0476$ involves two stages. Since such issues do not exist when we use R, we skip this detail. The ANOVA table for the two-factorial model is given in Table 13.8. The hypotheses problems of interest here are the following:

1. Testing for Factor $c13-math-0477$ :
2. Testing for Factor $c13-math-0479$ :
3. Testing for the Interaction of factors:

Table 13.8 ANOVA for the Two Factorial Model

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0482$ -Statistic
Treatment A	$c13-math-0483$	$c13-math-0484$	$c13-math-0485$	$c13-math-0486$
Treatment B	$c13-math-0487$	$c13-math-0488$	$c13-math-0489$	$c13-math-0490$
Interaction	$c13-math-0491$	$c13-math-0492$	$c13-math-0493$	$c13-math-0494$
Error	$c13-math-0495$	$c13-math-0496$	$c13-math-0497$
Total	$c13-math-0498$	$c13-math-0499$

The two-factorial experiment is now illustrated with an example from Montgomery (2005).

Example 13.5.1. Two Factorial Experiment for Battery Data

An engineer designs a battery to be used in a device, which will be subject to extreme variations in temperature, and as a blocking variable he has some control on the type of plate material among three different choices. The engineer has no control on the temperature variations once the device leaves the factory. Thus, the task of the engineer is to investigate two major problems: (i) The effect of material type and temperature on the life of the device, and (ii) Finding the type of material which has least variation among the varying temperature levels. The temperature in the study is at three levels of 15, 70, and $c13-math-0500$ Fahrenheit, while there are three different types of material. For each combination of the temperature and material, four replications of the life of the battery are tested. This dataset is available in the battery.txt file. Thus, the two-factorial experiment for this problem is given by

The reader may refer to Example 5.3.1 of Montgomery (2005) for more details of this experiment.

> data(battery)
> names(battery) <- c("L","M","T")
> battery$M <- as.factor(battery$M)
> battery$T <- as.factor(battery$T)
> battery.aov <- aov(L∼M*T,data=battery)
> model.matrix(battery.aov)
   (Intercept) M2 M3 T70 T125 M2:T70 M3:T70 M2:T125 M3:T125
1            1  0  0   0    0      0      0       0       0
2            1  0  0   0    0      0      0       0       0
3            1  0  0   0    0      0      0       0       0
4            1  0  0   0    0      0      0       0       0
5            1  1  0   0    0      0      0       0       0
32           1  1  0   0    1      0      0       1       0
33           1  0  1   0    1      0      0       0       1
34           1  0  1   0    1      0      0       0       1
35           1  0  1   0    1      0      0       0       1
36           1  0  1   0    1      0      0       0       1
attr(,"assign")
[1] 0 1 1 2 2 3 3 3 3
attr(,"contrasts")
attr(,"contrasts")$M
[1] "contr.treatment"
attr(,"contrasts")$T
[1] "contr.treatment"
> summary(battery.aov)
            Df Sum Sq Mean Sq F value   Pr(>F)
M            2  10684    5342   7.911  0.00198 **
T            2  39119   19559  28.968 1.91e-07 ***
M:T          4   9614    2403   3.560  0.01861 *
Residuals   27  18231     675
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The reader is advised to interpret the working design matrix as given by model.matrix(battery.aov). The factor variables have been renamed for simplicity's sake only. Using the aov function, the linear model for this factorial experiment is set up. The code summary(battery.aov) shows that both material and temperature and their interaction are significant variables for explaining the life of the device. Now that it is known that the interaction effect is significant, a question that arises is what does it look like? Using the two functions plot.design and interaction.plot, the average life at the different factor levels are visualized. The multiple comparisons are obtained using the glht function from the multcomp package.

> par(mfrow=c(1,2))
> plot.design(L∼M+T,data=battery)
> interaction.plot(battery$T,battery$M,battery$L,type="b",pch=19,
+ fixed=TRUE,xlab="Temperature",ylab="Average life")
> glht(battery.aov,linfct=mcp(M="Dunnett"),alternative="two.sided", interaction=TRUE)
         General Linear Hypotheses
Multiple Comparisons of Means: Dunnett Contrasts
Linear Hypotheses:
           Estimate
2 - 1 == 0    21.00
3 - 1 == 0     9.25
Warning message:
In mcp2matrix(model, linfct = linfct) :
  covariate interactions found -- default contrast might be inappropriate
> glht(battery.aov,linfct=mcp(M="Tukey"),alternative="two.sided", interaction=TRUE)
         General Linear Hypotheses
Multiple Comparisons of Means: Tukey Contrasts
Linear Hypotheses:
           Estimate
2 - 1 == 0    21.00
3 - 1 == 0     9.25
3 - 2 == 0   -11.75
Warning message:
In mcp2matrix(model, linfct = linfct) :
  covariate interactions found -- default contrast might be inappropriate
> glht(battery.aov,linfct=mcp(T="Tukey"),alternative="two.sided", interaction=TRUE)
         General Linear Hypotheses
Multiple Comparisons of Means: Tukey Contrasts
Linear Hypotheses:
              Estimate
70 - 15 == 0    -77.50
125 - 15 == 0   -77.25
125 - 70 == 0     0.25
Warning message:
In mcp2matrix(model, linfct = linfct) :
  covariate interactions found -- default contrast might be inappropriate

The use of the “simpler” plot.design function in Part A of Figure 13.6 shows that the average life increases across the material type (from 1 to 3), whereas it decreases as the temperature of the region increases. However, the interaction plot, as given in Part B of Figure 13.6, clearly shows that at Temperature T $c13-math-0502$ Fahrenheit, the average life for Material M 2 is longer than that for M at 3, a clear interaction effect between the temperature and material type. To verify whether this interaction is statistically significant, we may use testInteractions from the package phia.

> testInteractions(battery.aov,fixed="T",across="M")
F Test:
P-value adjustment method: holm
              M1     M2 Df Sum of Sq       F    Pr(>F)
 15        -9.25  11.75  2     886.2  0.6562 0.5268904
 70       -88.50 -26.00  2   16552.7 12.2574 0.0004892 ***
125       -28.00 -36.00  2    2858.7  2.1169 0.2799107
Residuals               27   18230.8
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The interaction effect at Temperature=70 is clearly seen to be significant in the above output. With the main and interaction effects as understood by us, it is not important to focus on the model assumptions. Since the two-factorial experiment is again an example of a linear regression model, the techniques of model assumption from Chapter 12 carry over and are used for the battery.aov lm object.

> p <- 9
> diagnostics_matrix <- matrix(nrow=nrow(battery),ncol=8)
> colnames(diagnostics_matrix) <- c("Obs No.","y","Pred
+ y","Residual","Levarage","S.Residual","Cooks.Dist","Outlier")
> diagnostics_matrix[,1] <- 1:nrow(battery)
> diagnostics_matrix[,2] <- battery$L
> diagnostics_matrix[,3] <- battery.aov$fitted.values
> diagnostics_matrix[,4] <- round(battery.aov$residuals,3)
> diagnostics_matrix[,5] <- influence(battery.aov)$hat
> diagnostics_matrix[,6] <- round(rstudent(battery.aov),3)
> diagnostics_matrix[,7] <- round(cooks.distance(battery.aov),3)
> diagnostics_matrix[,8] <- diagnostics_matrix[,6]*sqrt((16-p-1)/
+ (16-p-diagnostics_matrix[,6]ˆ2))
Warning message:
In sqrt((16 - p - 1)/(16 - p - diagnostics_matrix[, 6]ˆ{ : NaNs produced
> round(diagnostics_matrix,3)
      Obs No.   y Pred y Residual Levarage S.Residual Cooks.Dist
 [1,]       1 130 134.75    -4.75     0.25     -0.207      0.002
 [2,]       2  74 134.75   -60.75     0.25     -3.100      0.270
 [3,]       3 155 134.75    20.25     0.25      0.897      0.030
 [4,]       4 180 134.75    45.25     0.25      2.140      0.150
 [5,]       5 150 155.75    -5.75     0.25     -0.251      0.002
[32,]      32  45  49.50    -4.50     0.25     -0.196      0.001
[33,]      33  96  85.50    10.50     0.25      0.460      0.008
[34,]      34  82  85.50    -3.50     0.25     -0.153      0.001
[35,]      35 104  85.50    18.50     0.25      0.817      0.025
[36,]      36  60  85.50   -25.50     0.25     -1.139      0.048

The interpretation and the R program is left as an exercise for the reader. □

The extension of the two-factorial experiment is discussed in the next subsection.

13.5.2 Three-Factorial Experiment

A natural extension of the two-factorial experiment will be the three-factorial experiment. Let the three factors be denoted by $c13-math-0503$ , $c13-math-0504$ , and $c13-math-0505$ , each respectively at $c13-math-0506$ , $c13-math-0507$ , and $c13-math-0508$ factor levels. Consequently, there are now three more interaction terms to be modeled for in $c13-math-0509$ , $c13-math-0510$ , and $c13-math-0511$ . Towards a complete crossed model, we need each replicate (of index $c13-math-0512$ ) to cover all possible factor combinations. Thus, the three-way factorial model is then modeled by

13.22

The total sum of squares decomposition equation is then

13.23

where

The ANOVA table for the three-factorial design model is given in Table 13.9.

Table 13.9 ANOVA for the Three-Factorial Model

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0516$ -Statistic
Treatment A	$c13-math-0517$	$c13-math-0518$	$c13-math-0519$	$c13-math-0520$
Treatment B	$c13-math-0521$	$c13-math-0522$	$c13-math-0523$	$c13-math-0524$
Treatment C	$c13-math-0525$	$c13-math-0526$	$c13-math-0527$	$c13-math-0528$
Interaction (AB)	$c13-math-0529$	$c13-math-0530$	$c13-math-0531$	$c13-math-0532$
Interaction (AC)	$c13-math-0533$	$c13-math-0534$	$c13-math-0535$	$c13-math-0536$
Interaction (BC)	$c13-math-0537$	$c13-math-0538$	$c13-math-0539$	$c13-math-0540$
Interaction (ABC)	$c13-math-0541$	$c13-math-0542$	$c13-math-0543$	$c13-math-0544$
Error	$c13-math-0545$	$c13-math-0546$	$c13-math-0547$
Total	$c13-math-0548$	$c13-math-0549$

Since the details of the two-factorial experiment extend in a fairly straightforward manner to the current model, the model will be illustrated with two examples from Montgomery (2005).

Example 13.5.2. A Three-Factorial Experiment for Bottling Data

The Example 5.3 from Montgomery (2005) is discussed here through the R program, and the current example also benefits from Lalanne (2006), the R companion of Montgomery's book. The height of the filling of a soft drinks bottle is required to be as consistent as possible and is controlled through three factors: (i) the percent carbonation of the drink, (ii) the operating pressure in the filler, and (iii) the line speed which is the number of bottles filled per minute. The first factor variable of the percent of carbonation is available at three levels of 10, 12, and 14, the operating pressure is at 25 and 30 psi units, while the line speeds are at 200 and 250 bottles per minute. Two complete replicates are available for each combination of the three factor levels, that is, 24 total number of observations. In this experiment, the deviation from the required height level is measured.

The R formula for a three-factorial experiment may be set up using aov(y∼a+b+c+(a*b)+(a*c)+(b*c)+(a*b*c),data), or simply with aov(y∼.ˆ3,data). It is nice to know both the options! In the case of the two-factorial experiment, the investigation requires us only to find out whether there is an interaction effect between two factor variables in the model, and the interaction.plot function has been useful for that purpose. However, in the three-factorial experiment, there are overall four interaction variables. A simple extension of the interaction.plot function is to obtain such plots while holding the other factorial variable at a specified level. Thus, an extension of the interaction.plot function is first defined and then the various plots are obtained.

> data(bottling)
> # # Preliminary investigation for interaction effect
> windows(height=15,width=20)
> par(mfrow=c(2,2))
> plot.design(Deviation∼.ˆ3,data=bottling)
> IP_subset <- function(ab,colIndex,ss_char)}
+ abcd <- ab[,colIndex]
+ abcd <- abcd[abcd[,3]==ss_char,]
+ vnames <- names(abcd)
+ interaction.plot(x.factor=abcd[,1],trace.factor=abcd[,2], response=abcd[,4],
+ type="b",xlab=vnames[1],ylab=vnames[4],trace.label=vnames[2]){
> IP_subset(bottling,c(3,4,2,1),"10")
> IP_subset(bottling,c(3,4,2,1),"12")
> IP_subset(bottling,c(3,4,2,1),"14")
> title("Understanding Height of Bottling - Interaction Plots",outer=TRUE,line=-1)
> par(mfrow=c(1,2))
> IP_subset(bottling,c(2,3,4,1),"200")
> IP_subset(bottling,c(2,3,4,1),"250")
> par(mfrow=c(1,2))
> IP_subset(bottling,c(2,4,3,1),"25")
> IP_subset(bottling,c(2,4,3,1),"30")

The newly-defined function IP_subset needs an explanation. There are three arguments for this function. Recollect that the plot function interaction.plot is essentially a “Two-way Interaction Plot”. Thus, there is a need of a function which checks for the interaction between two factor variables when the third variable is held constant. Thus, given the data.frame, the newly-defined function IP_subset finds the required data.frame, the colIndex, which has four integers required for the first two entries, which have the two-factor variable to be investigated for the interaction effect when the third-factor variable is held at a constant ss_char. The fourth entry of colIndex contains the output values. Note that the data frame is generic and any control over the two-factor variables, the variables to be held constant, and the output needs to be clearly specified in colIndex. The plot.design, as in the north-east block of Figure 13.7, clearly indicates linearity across each of the input (factor) variables. The investigation of the interaction effect between Pressure and Speed is ruled out in the remaining three plots of Figure 13.7. The interpretation for the remaining two combinations of the factor variables program, as included in the above R module, is left to the reader. Now that the interaction effect is understood, it is time to analyze the data.

> summary(bottling.aov <– aov(Deviation∼.ˆ3,bottling))
                           Df Sum Sq Mean Sq F value   Pr(>F)
Carbonation                 2 252.75  126.38 178.412 1.19e-09 ***
Pressure                    1  45.37   45.37  64.059 3.74e-06 ***
Speed                       1  22.04   22.04  31.118  0.00012 ***
Carbonation:Pressure        2   5.25    2.63   3.706  0.05581 .
Carbonation:Speed           2   0.58    0.29   0.412  0.67149
Pressure:Speed              1   1.04    1.04   1.471  0.24859
Carbonation:Pressure:Speed  2   1.08    0.54   0.765  0.48687
Residuals                  12   8.50    0.71
–––
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> summary(aov(Deviation∼ Carbonation + Pressure + Speed+ (Carbonation*Pressure)
+ +(Carbonation*Speed)+(Pressure*Speed)+ (Carbonation*Speed*Pressure),data=bottling))
+ # Equivalent way and hence output is not provided

The use of aov and summary above gives us the desired output. Cleary, the ANOVA Table 13.9 for the height of the bottling variable shows that while the main effects are significant, $c13-math-0550$ -values for the interaction effects are all largeenough and hence, it is appropriate to conclude that the interaction effects are insignificant. □

Example 13.5.3. Understanding Strength of Paper with a Three-Factorial Experiment

The strength of a paper depends on three variables: (i) the percentage of hardwood concentration in the raw pulp, (ii) the vat pressure, and (iii) the cooking time of the pulp. The hardwood concentration is tested at three levels of 2, 4, and 8 percentage, the vat pressure at 400, 500, and 650, while the cooking time is at 3 and 4 hours. For each combination of the these three factor variables, 2 observations are available, and thus a total of $c13-math-0551$ observations. The goal of the study is investigation of the impact of the three factor variables on the strength of the paper, and the presence of the interaction effect, if any. The dataset is available in the Strength_Paper.csv file and the source is Problem 5.16 of Montgomery (2005).

> data(SP)
> plot.design(Strength∼.ˆ3,data=SP)
> windows(width=10,height=5)
> par(mfrow=c(1,3))
> IP_subset(SP,c(2,3,1,4),"2")
> IP_subset(SP,c(2,3,1,4),"4")
> IP_subset(SP,c(2,3,1,4),"8")
> par(mfrow=c(1,2))
> IP_subset(SP,c(1,2,3,4),"3")
> IP_subset(SP,c(1,2,3,4),"4")
> par(mfrow=c(1,3))
> IP_subset(SP,c(1,3,2,4),"400")
> IP_subset(SP,c(1,3,2,4),"500")
> IP_subset(SP,c(1,3,2,4),"650")
> summary(SP.aov <- aov(Strength∼.ˆ3,SP))
                               Df Sum Sq Mean Sq F value   Pr(>F)
Hardwood                        2  3.764   1.882   2.458 0.113792
Pressure                        2 24.041  12.020  15.701 0.000113 ***
Cooking_Time                    1 14.694  14.694  19.194 0.000360 ***
Hardwood:Pressure               4 14.224   3.556   4.645 0.009422 **
Hardwood:Cooking_Time           2  0.304   0.152   0.198 0.821750
Pressure:Cooking_Time           2  4.951   2.475   3.233 0.063137 .
Hardwood:Pressure:Cooking_Time  4  1.751   0.438   0.572 0.686497
Residuals                      18 13.780   0.766
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The presence of the interaction effects is clearly brought out by the IP_subset plots. The validation of the same is obtained from the summary of the fitted aov object. The question now is whether the interaction effects are statistically significant? The answer is again provided by the testInteractions function from the phia package.

> testInteractions(SP.aov,fixed="Hardwood",across="Pressure")
F Test:
P-value adjustment method: holm
          Pressure1 Pressure2 Df Sum of Sq       F    Pr(>F)
2            -3.775     -1.75  2   28.5517 18.6477 0.0001231 ***
4            -0.825     -1.25  2    3.2317  2.1107 0.1501486
8            -0.925     -1.80  2    6.4817  4.2333 0.0622582 .
Residuals                     18   13.7800
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> testInteractions(SP.aov,fixed="Cooking_Time",across="Pressure")
F Test:
P-value adjustment method: holm
          Pressure1 Pressure2 Df Sum of Sq       F   Pr(>F)
3           -1.3833     -2.05  2    13.121  8.5697 0.002428 **
4           -2.3000     -1.15  2    15.870 10.3650 0.002023 **
Residuals                     18    13.780
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Thus, the significance of the interaction effects has also been established here. □

The principle of blocking in the context of factorial experiments is taken up in the next subsection.

13.5.3 Blocking in Factorial Experiments

The two- and three-factorial experiments discussed above are extensions of the basic CRD model. In Section 13.4, we saw that replication does not alone eliminate the source of error and that the principle of blocking helps improve the efficiency of the experiment. Similarly, in the case of factorial experiments, practical considerations sometimes dictate that the replication principle cannot be implemented uniformly and that the factorial experiments may be improved through the introduction of a blocking route. The blocking model for a two-factorial experiment is stated next:

13.24

The total sum of squares decomposition equation for the model 13.24 is then

13.25

where

The corresponding ANOVA table for the blocking factorial experiment 13.24 is given in Table 13.10.

Table 13.10 ANOVA for Factorial Models with Blocking

Source of Variation	Sum of Squares	Degrees of Freedom	Mean Square	$c13-math-0555$ -Statistic
Blocks	$c13-math-0556$	$c13-math-0557$	$c13-math-0558$	$c13-math-0559$
Treatment A	$c13-math-0560$	$c13-math-0561$	$c13-math-0562$	$c13-math-0563$
Treatment B	$c13-math-0564$	$c13-math-0565$	$c13-math-0566$	$c13-math-0567$
Interaction (AB)	$c13-math-0568$	$c13-math-0569$	$c13-math-0570$	$c13-math-0571$
Error	$c13-math-0572$	$c13-math-0573$	$c13-math-0574$
Total	$c13-math-0575$	$c13-math-0576$

Example 13.5.4. Blocking for Intensity Dataset

The intent of this experiment is to help the engineer to improve the ability of detecting targets on a radar system. The two variables chosen, which are believed to have the most impact on detecting the abilities of the radar system, are marked as the amount of background noise and type of filter on the screen. The background noise Ground is classified into one of three levels: low, medium, and high. The two types of filter are available. However, there are different operators who use the radar system and this information forms the blocks. The dataset is available in the intensity.txt file. The formula for the aov object needs to be specialized for the factorial experiment with blocking. The R program next delivers the required computation.

> data(intensity)
> intensity.aov <- aov(Intensity∼ Ground*Filter+Error(Operator),intensity)
> summary(intensity.aov)
Error: Operator
          Df Sum Sq Mean Sq F value Pr(>F)
Residuals  3 402.17  134.06
Error: Within
              Df  Sum Sq Mean Sq F value    Pr(>F)
Ground         2  335.58  167.79 15.1315 0.0002527 ***
Filter         1 1066.67 1066.67 96.1924 6.447e-08 ***
Ground:Filter  2   77.08   38.54  3.4757 0.0575066 .
Residuals     15  166.33   11.09
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> intensity.aov
Call:
aov(formula = Intensity ∼ Ground * Filter + Error(Operator),
    data = intensity)
Grand Mean: 94.91667
Stratum 1: Operator
Terms:
                Residuals
Sum of Squares   402.1667
Deg. of Freedom         3
Residual standard error: 11.57824
Stratum 2: Within
Terms:
                   Ground    Filter Ground:Filter Residuals
Sum of Squares   335.5833 1066.6667       77.0833  166.3333
Deg. of Freedom         2         1             2        15
Residual standard error: 3.329998
Estimated effects may be unbalanced

The reader is asked to compare the results with Montgomery (2005) and Lalanne (2006); for more details post an attempt of interpretation of results here. □

The theory and applications of Experimental Designs extend far beyond the models discussed here and thankfully the purpose of the current chapter has been to consider some of the important models among them.

13.6 Further Reading

Kempthorne (1952), Cochran and Cox (1958), Box, Hunter, and Hunter (2005), Federer (1955), and of course, Fisher (1971) are some of the earliest treatises in this area.

Montgomery (2005), Wu and Hamada (2000–9), and Dean and Voss (1999) are some of the modern accounts in DOE. Casella (2008) is a very useful source with emphasis on computations using R. The web link http://cran.r-project.org/web/views/ExperimentalDesign.html is very useful for the reader interested in almost exhaustive options for DOE analysis using R.

13.7 Complements, Problems, and Programs

Problem 13.1 Identify which of the design models studied in this chapter are appropriate for the datasets available in the BHH2 package. The list of datasets available in the package may be found with try(data(package="BHH2")). The exercise may also be repeated for design-related packages such as agricolae, AlgDesign, and granova.
Problem 13.2 Carry out the diagnostic tests for the olson_crd fitted model in Example 13.3.4. Repeat a similar exercise for the ANOVA model fitted in Example 13.3.2.
Problem 13.3 Multiple comparison tests of Dunnett, Tukey, Holm, and Bonferroni have been explored in Example 13.3.8. The confidence intervals are reported only for TukeyHSD. The reader should obtain the confidence intervals for the rest of the multiple comparison tests contrasts.
Problem 13.4 Explore the use of the functions design.crd and design.rcbd from the agricolae package for setting up CRD and block designs.
Problem 13.5 The function granova.2w may be applied on the girdernew dataset with a slight modification. Create a new data frame girdernew2 <- girdernew[,c(3,1,2)]. Note that an additional R package rgl will be required though. Test the code granova.2w( girdernew2,ss gf + mf) and make an attempt to interpret the output.
Problem 13.6 In the fitted models ssaov, hardness_aov, rocket_aov, rocket.aov, and rocket.glsd.aov discussed in Section 13.4, investigate the presence of an interaction effect through the use of the interaction.plot graphical function.
Problem 13.7 Perform the diagnostic tests on the BIBD model in Example 13.4.4.
Problem 13.8 Investigate the presence of outliers and influential measures for the fitted models rocket.lm and rocket.glsd.lm in the respective Examples 13.4.7 and 13.4.9.
Problem 13.9 Obtain the confidence intervals for the contrasts of the fitted model battery.aov in Example 13.5.1.
Problem 13.10 Carry out the diagnostic tests for the fitted models bottling.aov, SP.aov, and intensity.aov.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 13: Experimental Designs

Create new playlist

Sign In

Sign Up

13.1 Introduction

13.2 Principles of Experimental Design

13.3 Completely Randomized Designs

13.3.1 The CRD Model

13.3.2 Randomization in CRD

13.3.3 Inference for the CRD Models

13.3.4 Validation of Model Assumptions

13.3.5 Contrasts and Multiple Testing for the CRD Model

13.4 Block Designs

13.4.1 Randomization and Analysis of Balanced Block Designs

13.4.2 Incomplete Block Designs

13.4.3 Latin Square Design

13.4.4 Graeco Latin Square Design

13.5 Factorial Designs

13.5.1 Two Factorial Experiment

13.5.2 Three-Factorial Experiment

13.5.3 Blocking in Factorial Experiments

13.6 Further Reading

13.7 Complements, Problems, and Programs

Table of Contents for
Chapter 13: Experimental Designs