Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

10
Multiple Decision Problems

10.1 Introduction

We will make statements about a ≥ 2 populations (distributions). We first discuss selection procedures and later multiple comparisons of means. Gupta and Huang (1981) give a good overview of multiple decision problems.

10.2 Selection Procedures

We start with a set G of a populations, i.e. G = {P_i, i = 1, … , a}. These populations correspond with random variables with parameter vectors for which at least one component is unknown. This general approach is described in Rasch and Schott (2018). In this book we restrict ourselves to a special case. In P_i we consider stochastically independent random variables with a ‐distribution.

It is our aim to select the population with the largest expectation or the populations with the t < a largest expectations. In Section 10.7 we discuss selection procedures for variances.

We first order the populations in G by magnitude of their expectations. For this, we need an order relation.

10.2.1 The Indifference Zone Formulation for Selecting Expectations

Selection Problem 5.3. (Bechhofer, 1954). For a given risk of wrong decision β with images and δ > 0 a subset M_B of size t from G has to be selected. Selection is based on random samples from A_i with normally distributed components y_i. Select M_B in such a way that the probability P(CS) of a correct selection is

10.1

In (5.1) d(G₁, G₂) is the distance between A_{a − t + 1} and A_a − t. the value δ, given in advance and, besides β, is part of the precision requirement.

The condition images is reasonable, because for images no real statistical problem exists. Without experimenting, one can denote any of the subsets of size t by M_B and with images it fulfils (5.1).

The region [μ_{a − t + 1}, μ_{a − t + 1} − δ] is the indifference zone and Selection Problem 5.3 is often called the indifference zone formulation of selection problems.

From Guiard (1994) a modified problem formulation is:

Selection Problem 10.1A. Select a subset M_B of size t corresponding to Selection Problem 5.7 in such a way that in place of (5.1)

10.2

is used. Here is the set in G, containing all A_i with μ_i ≥ μ_{a − t + 1} − δ.

For practical purposes, this formulation is easier to understand. Later, we show how results for Selection Problem 5.3 apply to Selection Problem 10.1A.

For the selection procedure, take from each of a normal populations with the random variable y_i with E(y_i) = μ_i; var(y_i) = σ² a random sample . These a random samples are assumed to be stochastically independent and the components y_ij are assumed to be distributed like y_i. We base decisions on the estimators or its realisations of μ_i.

10.2.1.1 Indifference Zone Selection, σ² Known

We first assume that the variances of the a populations are equal and known. In practice this is rarely the case but this approach gives a better understanding.

Selection Rule 10.1. From the a independent random samples the sample means …, are calculated and then we select the t populations with the t largest means into the set M_B, see Bechhofer (1954).

Selection Rule 10.1 can only be applied, if σ² is known. If σ² is unknown, we apply multi‐stage selection procedures in Section 5.4.1.

Problem 10.1

Determine the minimal sample size n so that for given a and δ the probability of a correct selection of the largest expectation P_C in (5.3) is at least 1 − β.

Solution

Using the R‐command in the package OPDOE:

> size.selection.bechhofer(a=,beta=,delta=,sigma=)

we can calculate n.

Example

For a = 10, P_C = 0.95, δ = 0.5, σ =1 we get

 > library(OPDOE)
> size.selection.bechhofer(a=10,beta=0.05,delta=0.5,sigma=1)
[1] 47

and therefore need in each sample 47 observations.

For a = 10, P_C = 0.95, δ = 1, and σ =1 we get

 > size.selection.bechhofer(a=10,beta=0.05,delta=1,sigma=1)
[1] 12

Problem 10.2

Determine the minimal sample size n so that for given a and δ = cσ the probability of a correct selection of the t largest expectation P_C in (5.3) is at least 1 − β.

Solution

At first add to R the command to make a function

> Bech.snr = function(a,t, c, beta) # Bechhofer's optimum sample number

   {
   K=(1-beta)/t

BechInt = function(n,a,t,c) # Bechhofer‐Integral

{d =c *sqrt(n)

          integrand=function(z)
         {(pnorm(z+d)^(a-t))*((1-pnorm(z))^(t-1))*dnorm(z)}
         intnew=integrate(integrand,-Inf,Inf)$value
         intnew
        }

f = function(n,a,t,c, beta)}BechInt(n,a,t,c)- K}

k = uniroot(f,c(1,1000),a = a,t = t,c = c,beta = beta)$root

    # Finds the unique minimum integer k0 for which f(k0)>K
   k0=ceiling(k)
   print(paste("optimum sample number: n =",k0), quote=F)
   }

Then use the function

> Bech.snr(a=,t=,c =,beta=)

Example

We use:

a = 20, c = 0.5, t = 4, β = 0.05.
a = 10, c = δ = 0.5, t = 1, β = 0.05, to show that we can solve the example of Problem 5.3 without OPDOE.
a = 10, , t = 1, β = 0.05.

The results are

> Bech.snr(a = 20,t = 4, c = 0.5,beta = 0.05)
[1] optimum sample number: n = 75
> Bech.snr(a = 10,t = 1, c = 0.5,beta = 0.05)
[1] optimum sample number: n = 47
> Bech.snr(a = 10,t = 4,c = 1.27,beta = 0.05)
[1] optimum sample number: n = 10

Of course, in (2) we obtain the same sample size as in the example of Problem 5.3.

Example 10.1

To select the four populations with the largest expectations from a = 10 populations with equal variances we observed ten data per population and received the sample means in Table 5.1.

Table 10.1 Sample means of Example 10.1.

Popula tion	P₁	P₂	P₃	P₄	P₅	P₆	P₇	P₈	P₉	P₁₀
	138.6	132.2	138.4	122.7	130.6	131.0	139.2	131.7	128.0	122.5

Using Selection Rule 10.1, we have to select the populations P₇, P₁, P₃, P₂.

10.2.1.2 Indifference Zone Selection, σ² Unknown

Bechhofer showed for Selection Rule 10.1 that P_C has the maximal lower bound of (5.2) if we have normal distributions with known and equal variances and use n_i = n; i = 1, … , a for fixed a, t, δ.

If σ² is unknown a two‐stage selection rule is proposed.

Selection Rule 10.2. Calculate from observations (realisations of y_ij) y_ij(i = 1, … , a; j = 1, … , n₀) from the a populations A₁, … , A_a with

the estimate

10.6

with df = a(n₀ − 1) degrees of freedom, as in Table 5.2 with N = an₀. For given a, c, t, and β we calculate using the R‐command of Problem 5.6 the sample size n. If n > n₀ we take from each of the a populations n − n₀ additional observations, otherwise n₀ is the final sample size. With n or n₀ we continue as in Selection Rule 10.1.

In place of Selection Problem 10.l, Selection Problem 10.1A can always be used. There are advantages in applications. The researcher could ask what can be said about the probability that we really selected the t best populations, if (μ_{a − t + 1}, μ_a − t) < δ . An answer to such a problem is not possible, it is better to formulate Selection Problem 10.1A, which can better be interpreted, and where we now know at least with probability 1 −β that we selected exactly t populations not being more than δ worse than A_{a − t + 1}.

Guiard (1994) showed that the least favourable cases concerning the values of P_C and for Selection Problem 5.7 and 10.1A are identical. By this, the lower bounds 1 −β in (10.2) are identical for Selection Problems 5.3 and 10.1A. We call the probability of a δ‐precise selection.

10.3 The Subset Selection Procedure for Expectations

Now it is again our aim to select the population with the largest expectations. The size r of the selected subset M_G is random. Selection is based on a stochastically independent random samples ; its components y_ij are assumed to be distributed like y_i, which are N(μ_i, σ²) distributed. We base decisions on the estimators or its realisations of μ_i.

Selection Problem 5.6 (Gupta 1956).

For a given risk β of incorrect decision with

10.7

select from G a subset M_G of random size r so that

10.8

Selection Problem 5.6 is the subset formulation of the selection problem.

The following selection rule stems from Gupta and Panchapakesan (1970, 1979).

Selection Rule 10.3. We use the estimators based on a samples of equal size n, which are ‐distributed.

All the A_i are put into M_G, for which

10.9

We have to choose with a pre‐given D so that

10.10

where Φ and ϕ are the distributions function and the density function of the standardised normal distribution, respectively.

If σ² is unknown we write approximately, where s² is an estimate of σ², based on f degrees of freedom. Then (5.6) is replaced by

10.11

where h_f(y) is the density function of and is CS(f)‐distributed.

10.4 Optimal Combination of the Indifference Zone and the Subset Selection Procedure

Which of the two problem formulations is better suited for practical purposes? Often experiments with a technologies, a varieties, medical treatments and others, have the purpose of selecting the best of them (i.e. t = 1). If we then have a huge list of candidates at the beginning – say about 500, such as in drug screening, then it is reasonable at first to reduce the number of candidates by a subset procedure down to let us say r = 20. In a second step we then use an indifference zone procedure with a = r. We will present an optimal combination of the two approaches taken from simulation results in Rasch and Yanagida (2019).

A Simulation Experiment. (Rasch and Yanagida, 2019)

We choose t = 1 and as final probability of a correct selection P_C = 0.95. Further we assume that the μ_i, i = 1, … a > 1 are expectations of normal distributions with equal variances.

We start with Gupta's approach with a probability of correct selection PC_Gu ≥ 0.95 and a sample size n_Gu as small as possible, both found by systematic search. This results in a subset of size r for each run. If A_a is not in that subset the simulation run was finished with the result ‘incorrect selection’. If A_a is in the subset, and r = 1, the simulation run was finished with the result ‘correct selection’. If, however, A_a is in the subset and r > 1 we continue with Bechhofer's approach with a sample size n_B from the R‐program of Selection Problem 5.3.

How can the free parameters of both problems be combined in an optimal way so that the overall probability of correct selection is P_C = 0.95 and the total experimental size

10.12

is as small as possible?

Rasch and Yanagida (2019) showed that for unknown variances and a ≥ 30 nearly no difference occurs between known and unknown variances due to the large degrees of freedom of the t‐distribution. Therefore, for a not too small the results are valid for unknown variances too.

In the simulation experiment, we used Gupta's approach with sample sizes n_Gu shown in Table 5.2.

Table 10.2 Values of n_Gu used in the simulation experiment.

	`a`
`δ`	`30`	`50`	`100`	`200`
	19	20	20	23
δ = σ	5	5	5	6

The simulated observations in each sample and each run are x_ij, i = 1, … , a; j = 1, … , n_Gu. Each x_ij is a realisation of a normally distributed random variable with expectation μ_i; i = 1, … , a; μ_a = 1; μ_j = 0, j = 1, … , a−1 and variance σ² = 1.

Using Gupta's approach for the a realised sample means results in a subset of size r.

If is in this subset and r > 1 we simulate from the r populations in this subset samples of size obtained by R.

Using Bechhofer's approach results in an a best‐selected population; if it is not A_a, the selection is incorrect. We used in the simulation experiment besides the probability of correct selection P_C = 0.95 a = 30, 50, 100, 200. All simulations had 100 000 runs, which means that the relative frequencies of correct selection are not far from 0.95. When in N_f of the runs the selection was incorrect (either already after Gupta's approach or at the end) then

is a good estimate of P_C.

Table 5.3 shows the average size of the selected subset, which must be used as the number of populations in the indifference zone selection.

Table 10.3 Values of average found in the subset selection.

	`a`
δ	`30`	`50`	`100`	`200`
	12.25	18.81	32.08	59.81
δ = σ	12.48	18.70	31.91	62.07

Table 5.4 shows the optimal values of P_B. Table 5.5 shows the average total number of observations needed for both selection procedures and Table 5.6 the estimated overall probability of correct selection.

Table 10.4 Optimal values of P_B used in the simulation experiment.

	`a`
δ	`30`	`50`	`100`	`200`
	0.981	0.983	0.991	0.988
δ = σ	0.975	0.981	0.991	0.982

Table 10.5 Average total size of the simulation experiment (upper entry) and the size needed for Bechhofer's approach only (lower entry).

	`a`
`δ`	`30`	`50`	`100`	`200`
δ = σ/2	1151.7 1830	2013.70 3350	4253.95 7500	8836.00 16 400
δ = σ	288.38 480	500.25 850	1071.70 1900	2215.90 4200

Table 10.6 Relative frequencies of correct selection calculated from 100 000 runs.

	`a`
`δ`	`30`	`50`	`100`	`200`
	0.9519	0.9512	0.9510	0.9504
d = σ	0.9504	0.9516	0.9507	0.9502

We learn from the simulation experiment that a combination of Gupta's and Bechhofer's approach leads to a smaller total sample size than the use of Bechhofer's approach alone.

In the mean time further results are obtaind by simulation for t >1 and for non‐normal distributions. For this see the special issue of JSTP and a proceedings volume (Springer) of the 10th International Workshop on Simulation and Statistics, Salzburg 2019.

10.5 Selection of the Normal Distribution with the Smallest Variance

Let the random variable x in P_i be ‐distributed with known μ_i. From n observations from each population P_i; i = 1, … , a we calculate

In the case of unknown μ_i we use estimates _i and calculate

The y_i will be used to select the population with the smallest variance; each y_i has the same number f of degrees of freedom (if μ_i is known we have f = n and if μ_iis unknown then f = n−1).

Selection for the smallest variance follows from Selection Rule 10.4.

Selection Rule 10.4.

Put into M_G all P_i for which

, where is the smallest sample variance. z* = z(f, a, β) ≤ 1 depends on the degrees of freedom f, the number a of populations and on β.

For z* we choose the largest number, so that the right‐hand side of 5.7 equals 1 − β. We have to calculate P(CS) for the least favourable case given by (monotonicity of the likelihood ratio is given). We denote the estimates of as usual by and formulate (for the proof see Rasch and Schott (2018, theorem 11.5)

10.6 Multiple Comparisons

Real multiple decision problems (with more than two decisions) are present; if we consider results of some tests simultaneously, their risks must be mutually evaluated. Of course, we cannot give a full overview about methods available. For more details, see Miller (1981) and Hsu (1996).

We consider the situation that a populations P_i, i = 1, … , a are given in which random variables y_i, i = 1, … , a are independent of each other normally distributed with expectations μ_i and common but unknown variance σ². When the variances are unequal, some special methods are applied and we discuss only some of them. The situation is similar to that of the one‐way ANOVA in Chapter 5; the a populations are the levels of a fixed factor A.

We assume that from a populations independent random samples of size n_i are drawn. Some of the methods assume the balanced case with equal sample sizes. We consider several multiple comparison (MC) problems.

MC Problem 5.3. Test the null hypothesis

against the alternative hypothesis

H_A: there exists at least one pair (i,j) with i ≠ j so that μ_i ≠ μ_j.

The first kind risk is defined not for a single pair (i,j) but for all possible pairs of the experiment and this risk is therefore called an experiment‐wise risk α_e.

MC Problem 5.6. Test each of the null hypotheses

against the corresponding alternative hypothesis

Each pair (H_0,ij; H_A,ij) of these hypotheses is independently tested of the others. This corresponds with the situation of Chapter 3 for which the two‐sample t‐test was applied.

The first kind risk α_ij is defined for the pair (H_0,ij; H_A,ij) of hypotheses and may differ for each pair. Often we choose all α_ij = α_c and call it a comparison‐wise first kind risk. If we perform all t‐tests using the two‐sample t‐test or the Welch test than we speak about the multiple t‐procedure or W‐procedure.

MC Problem 5.10. One of the populations (without loss of generality P_a) is prominent (a standard method, a control treatment, and so on). Test each of the a − 1 null hypotheses

10.14

against the alternative hypothesis

10.15

The first kind risk α_i for the pair (H_0,i; H_A,i) of hypotheses is independent of that one of other pairs and therefore also called comparison‐wise. Often we choose α_i = α_c.

If we use the term experiment‐wise, risk of the first kind α_e in MC Problem 5.10 means that it is the probability that in at least one of the pairs (H_0,ij; H_A,ij) of hypotheses in MC Problem 5.6 or that in at least one of the a − 1 pairs (H_0,i; H_A,i) of hypotheses in MC Problem 5.10 the null hypothesis is erroneously rejected. However, make sure that such a risk is not really a risk of a statistical test but a probability in a multiple decision problem because when we consider all possible pairs (null hypothesis–alternative hypothesis) of MC Problem 5.6 or 5.10, we have a multiple decision problem with more than two possible decisions if a > 2.

In general we cannot convert α_e and α_c into each other. The asymptotic (for known σ²) relations for k orthogonal contrasts

10.16

10.17

follow from elementary rules of probability theory, because we can assign to the independent contrasts independent F‐tests (transformed z‐tests) with f₁ = 1, f₂ = ∞ degrees of freedom.

To solve MC Problems 5.6 and 5.10 we first construct confidence intervals for differences of expectations as well as for linear contrasts in these expectations. With these confidence intervals, the problems easily are handled.

As already mentioned at the start of Section 5.1.1, we assume that from a populations independent random samples of size n_i are drawn. We consider the a populations P_i as the a levels of a fixed factor A in a one‐way analysis of variance as discussed in Section 5.3, therefore we write

10.18

and call μ the overall expectation and μ + a_i the expectation of the ith level of population P_i. The total size of the experiment is . In applied statistics, we use the notation ‘multiple comparison of means’ synonymously with ‘multiple comparison of expectations’.

10.6.1 The Solution of MC Problem 5.3

We can solve MC Problem 5.3 by several methods. At first, we use the F‐test from Chapter 5.

10.6.1.1 The F‐test for MC Problem 5.3

If MC Problem 5.3 is handled by the F‐test, we use the notations of Table 5.2. H₀ : μ₁ = μ₂ = ⋯ = μ_a is rejected, if

10.19

Problem 10.3

Determine in a balanced design the subclass number n for MC Problem 5.7 for a precision determined by α_e = 0.05, β = 0.05, and δ/σ = 1.

Solution

Use the design function of the R‐package OPDOE for the one‐way analysis of variance:

 >size.anova(model="a", a= ,alpha= ,beta= ,delta= ,case= ).

Example

Determine n_min and n_max for a = 10, α_e = 0.05, β = 0.05, and δ/σ = 1. Attention, the δ in the command corresponds with δ/σ in our precision requirement.

 > library(OPDOE)
> size.anova(model="a", a=10, alpha=0.05, beta=0.05, delta=1, case="minimin")
 n
11
> size.anova(model="a", a=10, alpha=0.05, beta=0.05, delta=1,  case="maximin")
 n
49

We have to draw between 11 and 49 observations from each population.

10.6.1.2 Scheffé's Method for MC Problem 5.3

The method proposed in Scheffé (1953) allows the calculation of simultaneous confidence intervals for all linear contrasts of the μ + a_i in (10.17).

We now reformulate MC Problem 5.3 as an MC problem to construct confidence intervals. If H₀ is correct then all linear contrasts in the μ_i = μ + α_i equal zero. The validity of H₀ conversely follows from the fact that all linear contrasts vanish.

Therefore, confidence intervals K_r for all linear contrasts L_r can be constructed in such a way that the probability that L_r ∈ K_r for all r is at least 1 − α_e. We then reject H₀ with a first kind risk α_e, if at least one of the K_r does not cover L_r.

Problem 10.4

Use Scheffé's method to test the null hypothesis of MC Problem 5.3. We consider all linear contrasts L_r in the μ_i.

Solution

Construct

10.20

This confidence interval contains all with probability 1 − α_e. Here

10.21

(from Table 5.2). Using (5.15) to construct confidence intervals for all differences of expectations only, then the confidence interval in (10.20) has a too large expected width, it contains the differences of expectations with a probability ≥1 − α_e. We say in such cases that the confidence intervals and the corresponding tests are conservative. Therefore, the minimal sample sizes are too large.

Example

We construct confidence intervals (5.15) for the images differences of a = 10 expectations for α_e = 0.05 and means calculated with 20 observations each. In this case we have images = and . At first we calculate F(9, 200 − 10|0.95) with R as

 >   qf(0.95,9,190)
[1] 1.929425

and from this = 4.16741 and = 0.3162. Then we obtain as confidence intervals for the Scheffé method. Pairs of sample means differing by more than 1.318 standard deviations lead to significant differences in the corresponding expectations.

Example 10.2

We use data simulated by Rasch and Schott (2018, example 11.4). A (pseudo‐)random number generator has generated 10 samples of size five each. The values of the samples 1 to 8 are realisations of an N(50, 64)‐normally distributed random variable; the two other samples differ only in expectations, we have μ₉ = 52 and μ₁₀ = 56, respectively. Table 5.7 gives the simulated samples.

Table 10.7 Simulated observations of Example 5.7.

	`Number of sample`
y_ij	`1`	`2`	`3`	`4`	`5`	`6`	`7`	`8`	`9`	`10`
y_i1	63.4	49.6	50.3	55.5	62.5	30.7	56.7	64.5	44.4	55.7
y_i2	46.7	48.4	52.8	36.1	45.8	48.6	46.2	42.2	38.2	64.7
y_i3	59.1	49.3	52.5	54.0	52.8	45.8	41.9	49.6	64.8	61.8
y_i4	60.7	48.3	58.6	55.9	44.9	44.9	55.8	48.9	43.7	38.9
y_i5	54.9	51.5	48.0	52.9	51.3	52.9	48.9	40.7	61.3	61.8
	56.96	49.42	52.44	50.88	51.46	44.58	49.90	49.18	50.48	56.58

Table 5.8 contains the differences between the means.

Table 10.8 Differences between means of Example 5.7.

`j`	`2`	`3`	`4`	`5`	`6`	`7`	`8`	`9`	`10`
`i`
1	7.54	4.52	6.08	5.50		7.06	7.78	6.48
2		−3.02	−1.46	−2.04		−0.48	0.24	−1.06	−7.16
3			1.56	0.98	7.86	2.54	3.26	1.96	−4.14
4				−0.58	6.30	0.98	1.70	0.40	−5.70
5					6.88	1.56	2.28	0.98	−5.12
6						−5.32	4.60	−5.90	−
7							0.72	−0.58	−6.68
8								−1.30	−7.40
9									−6.10

10.6.1.3 Bonferroni's Method for MC Problem 5.3

Confidence intervals by Scheffé's method are not optimal if confidence intervals for k special but not for all contrasts are wanted. Sometimes we obtain shorter intervals using the Bonferroni inequality (Bonferroni, 1936).

Theorem 10.5

If the components y_i of an a‐dimensional random vector y^T = (y₁, … , y_a) with distribution function F(y₁, … , y_a) have the same marginal distribution function F(y) then the Bonferroni inequality

10.22

is valid with the distribution function F(y_i) of y_i.

The proof is given in Rasch and Schott (2018, Theorem 11.7).

If k special linear contrasts are given, then the estimator is N(L_r, k_rσ²)‐distributed for each r images . Then

10.23

with (from Table 5.2) are components of a k‐dimensional random variable. The marginal distributions are central t‐distributions with degrees of freedom and the density f(t, ν).

The Bonferroni inequality allows us to find a lower bound of the probability that all t_r values (r = 1,…,k) lie between −w and w(w > 0). Due to the symmetry of the t‐distribution and Theorem 5.8 we get

10.24

We choose w so that the right‐hand side of (5.17) equals (1 − α_e) and obtain simultaneous (1 − α_e)‐confidence intervals for the L_r as

10.25

This means we determine w so that

10.26

and the Bonferroni inequality has the form

10.27

10.6.1.4 Tukey's Method for MC Problem 5.3 for n_i = n

Tukey's method (1953) is applied for equal numbers of observations in all a normally distributed samples.

Definition 10.3

If X = (x₁, … , x_a)^T is a random sample with independent N(μ, σ²)‐distributed components and if νs²/σ² is independent of Y CQ(v)‐distributed and w is the range of (x₁, … , x_a) we call the random variable

the studentised range of X and s². The studentised augmented range is the random variable

Note that √2 |q*| equals the critical value traditionally denoted by q, the upper α‐quantile of the studentised range distribution with a treatments and ν degrees of freedom.

In applications, the x_i are typically the means of random samples each of size n, s² is the pooled variance, and the degrees of freedom are ν = a(n − 1).

Tukey's method is based on the distribution of q_{a, ν}. The distribution function of the studentised range q_{a, v} is given by formula (11.38) in Rasch and Schott (2018).

We denote by q(a, ν|1 − α) the (1 − α)‐quantile of the distribution function of q_{a, ν} in dependency on the number a of components of Yand the degrees of freedom of s² in Definition 5.16.

Tukey's method to construct confidence intervals for the differences of the expectations of a independent N(μ_i, σ²)‐distributed random variables y_i(i = 1, … , a) is based on the equivalence of the probabilities

and

This equivalence is a consequence of the fact that the validity of the inequality in the second term is necessary and sufficient for the validity of the inequality in the first term. The maximum of a set of random variables is understood as its largest order statistic and

is the range w of N(0, σ²)‐distributed random variables, if the y_i independently from each other are N(μ_i, σ²)‐distributed. We know from theorem 11.8 in Rasch and Schott (2018) that

10.29

(5.19) gives a class of simultaneous confidence intervals with confidence coefficient 1 − α_e.

10.6.1.5 Generalised Tukey's Method for MC Problem 5.3 for n_i ≠ n

Spjøtvoll and Stoline (1973) generalised Tukey's method without assuming n_i = n.

If all conditions of Section 10.8.1.2 are fulfilled, all linear contrasts are simultaneously covered with probability 1 − α_e by intervals

10.30

In (10.29) images , and (a, f| 1 − α) is the (1 − α)‐quantile of the distribution of the augmented studentised range (a, f) (see Rasch and Schott (2018)). These intervals, contrary to the Tukey method, depend on the degree of unbalancedness.

A further generalisation of the Tukey method can be found in Hochberg (1974) and Hochberg and Tamhane (1987) – see also Hochberg and Tamhane (2008).

Problem 10.7

Construct confidence intervals for differences μ_i − μ_j of a expectations of random variables y_i; i = 1, … , a, which are independent of each other normally distributed with expectations μ_i and common but unknown variance σ². From a populations independent random samples of size n are drawn.

Solution

From (10.30) we receive .

Example

As in the example of Problem 5.6 we use a = 10 expectations, α_e = 0.05 and means calculated with 20 observations each.

This leads to

Using R we get q^*(10,190,0.95):

 > qtukey(.95,10, df =  190)/sqrt(2)
[1] 3.201717

10.6.2 The Solution of MC Problem 5.6 – the Multiple t‐Test

In the case of equal sample sizes a multiple Welch test analogous to the two‐sample Welch test is possible.

Each of the null hypotheses

has to be tested against the corresponding alternative hypothesis

Each pair (H_0,ij; H_A,ij) of these hypotheses is tested independently of the others. This corresponds with the situation of Chapter 3 for which the two‐sample t‐test was applied.

By contrast to the two‐sample t‐test the multiple t‐test uses in place of the variances of the two samples under comparison the overall variance estimator, which is equal to the residual mean square of the one‐way ANOVA in Chapter 5. Notice that it is assumed that all a sample sizes are equal to n > 1. The test statistic for testing H_0,ij is

10.31

with MS_res from (5.8).

H_0,ij is rejected if the realisation t of t in (10.31) is larger than the (1 − α)‐quantile of the central t‐distribution with N − a degrees of freedom.

Problem 10.8

Determine the minimal sample size for testing H_0,ij; μ_i = μ_j, i ≠ j, i, j = 1, … , a for a first kind risk α, a second kind risk β, an effect size δ, and a standard deviation sd.

Solution

Use the R‐command in OPDOE: >size.multiple_t.test(a=,alpha=,beta = Delta=,sd=).

Example

We calculate the minimal sample size for testing H_0,ij; μ_i = μ_j for a = 10, a first kind risk α = 0.01, a second kind risk β = 0.1, and an effect size δ = 0.5 sd using

 > library(OPDOE)

> size.multiple_t.test(a = 10,alpha = 0.01,beta = 0.1,delta = 0.5,sd = 1) which gives

 $size
[1] 121
$total
[1] 1210

Next we use lower precision and an effect size δ = sd to obtain

 > size.multiple_t.test(a=10,alpha=0.05,beta=0.1,delta=1,sd=1)
$size
[1] 23
$total
[1] 230

10.6.3 The Solution of MC Problem 5.10 – Pairwise and Simultaneous Comparisons with a Control

Sometimes a – 1 treatments have to be compared with a standard procedure called control. We assume first that we have equal sample sizes n for the treatments.

10.6.3.1 Pairwise Comparisons – The Multiple t‐Test

If one of the a populations P, without loss of generality let it be P_a, is considered as a standard or control, we use in case of a pairwise comparison in place of (10.31)

10.32

to test each of the a − 1 null hypotheses

against the corresponding alternative hypothesis

When we use for the tests of the a − 1 differences an overall significance level α* and we use for each test of the a − 1 differences a significance level α, then the Bonferroni inequality gives α ≤ α* ≤ (a − 1) α. When we use for the test of each of the a − 1 differences a significance level α = α*/( a − 1) we have an approximate overall significance level of α*. To test with an exact overall significance level α* we must use the multivariate Student distribution t(a−1, f) with f = df(MS_res) = a(n − 1). Dunnett (1955) solved this problem. Let us now derive the optimum choice of sample sizes for the multiple t‐test with a control.

Problem 10.9

Determine the minimal sample size for testing H_0,i; μ_i = μ_a, i = 1, … , a − 1 with an approximate overall significance level α* [hence we use for the test a first kind risk α = α*/(a−1)], a second kind risk β, an effect size δ and a standard deviation sd.

Solution

Use the R‐command in OPDOE

> size.t.test(power =,sig.level =, delta =, sd=, type ="two.sample").

Example

We calculate the minimal sample size for testing H_0,i; μ_i = μ_a, i = 1, … , a − 1, for a = 10, a first kind risk α = 0.15/9 = 0.0167, a second kind risk β = 0.05, and an effect size δ = sd using

 > library(OPDOE)
> size.t.test(power=0.95,sig.level=0.0167,delta=1,sd=1,type="two.sample")

which gives

 [1] 35

The total sample size is then 10*35 = 350.

Next we use lower precision with β = 0.1 and obtain

 > size.t.test(power=0.8,sig.level=0.0167,delta=1,sd=1,type="two.sample")
[1] 23

The total sample size is then 10*23 = 230.

Often we have a fixed set of resources and a budget that allows for only N observations. To maximise the power of the tests we want to minimise the total variance. If we use n_a observations for the control population P_a and for the other treatments an equal sample size of n observations then N = (a − 1)n + n_a. The variance of _i − _a is σ² (1/n + 1/ n_a). The total variance (TSV) is the sum of the (a – 1) parts, hence TSV = (a – 1)[σ² (1/n + 1/ n_a)] and we have to minimise it subject to the constraint N = (a − 1)n + n_a. This is a Lagrange multiplier problem in calculus where we must minimise M = TSV + λ[N − (a − 1)n − n_a]. Setting the partial derivatives w.r.t. n and n_a equal to zero yields the equations:

∂M /∂n = −(a − 1) σ² /n² – λ(a − 1) = 0
∂M /∂n_a = −(a − 1) σ² /n_a² – λ = 0.

Hence n_a = N − (a − 1)n and inserting this in TSV we have a function of n as f(n); the first derivative of f(n) to n, f′(n) = 0 gives as solution the stationary point n = N/[(a − 1) + √(a − 1)]. To decide what type of extreme it is calculate f″(n) = 2/n³ + 2(a − 1)²/[N − (a − 1)n]² and this is positive for all n > 0, hence the stationary point belongs to a minimum of f(n) due to the constraint. Of course we must take integer numbers for n and n_a.

In the R package OPDOE we cannot solve the minimal sample size when we take the integer numbers for n_a = n√(a − 1) and n = N/[(a − 1) + √(a − 1)].

10.6.3.2 Simultaneous Comparisons – The Dunnett Method

Simultaneous 1 − α_e‐confidence intervals for the a − 1 differences

are constructed for the equal sample sizes n for the a samples. (After renumbering μ_a is always the expectation of the control.)

We consider a independent N(μ_i, σ²)‐distributed random variables y_i, independent of a CS(f)‐distributed random variable .

Dunnett (1955) derived the distribution of

which is the multivariate Student distribution t(a − 1, f) with f = df(MS_res) = a(n − 1), which he called the distribution d(a − 1,f) with f = df(MS_res) = a(n–1).

Dunnett (1964) and Bechhofer and Dunnett (1988) present the quantiles d(a − 1, f| 1 − α_e) of the distribution of

We see that d ≤ d(a − 1, f| 1 − α_e) is necessary and sufficient for

For all i, by

10.33

a class of confidence intervals is given, covering all differences μ_i − μ_a with probability 1 − α_e.

For the one‐way classification with the notation of Example 5.7 we receive, for equal sub‐class numbers n, the class of confidence intervals

10.34

If n_i = n, for i = 1, ..., a, we use the Dunnett procedure, based on confidence intervals of the Dunnett method. Then H_0i is rejected, if

10.35

If the n_i are not equal, we use a method proposed by Dunnett (1964) with modified quantiles. For example, when the optimal sample size of the control is taken and n_i = n for i = 1, 2, ..., a − 1, the table of critical values are given in Dunnett (1964) and Bechhofer and Dunnett (1988).

Dunnett and Tamhane (1992) proposed a further modification, called a step up procedure, starting testing the hypothesis with the least significant of the a – 1 test statistics at the left‐hand side of (5.25) and then going further with the next larger one and so on. Analogously, a step down procedure is defined. Dunnett and Tamhane (1992) showed that the step up procedure is uniformly more powerful than a method proposed by Hochberg (1988) and preferable to the step down procedure. As a disadvantage they mentioned the greater difficulty of calculating critical values and minimum sample sizes in the standard and other groups. This problem is of no importance if we use the R‐package DunnettTests together with the R‐package mvtnorm.

Problem 10.10

Determine the minimal sample size for simultaneously testing H_0,i; μ_i = μ_a; i = 1, … , a − 1 for a first kind risk α, a second kind risk β, an effect size δ, and a standard deviation sd.

Solution

We use the command of the R package DunnettTests:

 > nvDT(ratio,power,r,k,mu,mu0,contrast,sigma=NA,dist, alpha=0.05,mcs=1e+05,testcall)

Here k equals our a – 1. Further are

`ratio`	The pre‐specified ratio of sample size in each of the treatment groups to the sample size in the control group
`power`	The power required to be achieved
`r`	The least number of null hypotheses to be rejected, r ≤ 1 ≤ k
`k`	Number of hypotheses to be tested, k ≥ 2 and k ≤ 16
`mu`	Assumed population mean in each of the k treatment groups
`mu0`	Assumed population mean in the control group.
`contrast`	Because in this chapter `mu` and `mu0` are concerned with means of continuous outcomes, specify contrast = ‘means’
`sigma`	An (rough) estimate of the error standard deviation
`dist`	Whether the sample size is calculated for t‐distributed test statistics (dist = ‘tdist’) or standard normally distributed test statistics (dist = ‘zdist’)
`alpha`	The pre‐specified overall significance level, default = 0.05 must not be input
`mcs`	The number of Monte Carlo sample points to numerically approximate the power for a given sample size, refer to equations (4.3) and (4.5) in Dunnett and Tamhane (1992)
`testcall`	The applied Dunnett test procedure: ‘SD’ = step‐down Dunnett test; ‘SU’ = step‐up Dunnett test.

In Problem 3.3 it is shown what to do when σ is unknown to obtain an estimate of the error standard deviation.

Example

We use as expectation of the standard μ₀ = 2 and as expectation of the other populations μ_i = 3. We receive in the step up case for testing all r = k = 9 hypotheses assuming ratio = 1 (This is the case of equal sample sizes n for all a treatments).

 > library(DunnettTests)
> nvDT(1, 0.95, r=9, 9, mu=3, mu0=2,  sigma=1.,contrast="means",dist="zdist", testcall="SU")
$`least sample size required in each treatment groups`
[1] 34
$`least sample size required in the control group`
[1] 34

with a total experimental size N = 10 · 34 = 340. If ratio is increased the total sample size increases as we see from

 > nvDT(2, 0.95, r=9, 9, mu=3, mu0=2,  sigma=1.,contrast="means",dist="zdist", testcall="SU")
$`least sample size required in each treatment groups`
[1] 48
$`least sample size required in the control group`
[1] 24

with a total experimental size N = 9 · 48 + 24 = 456.

It can be shown that the best choice for the control is n_a = n√k, hence ratio = which gives

 > nvDT(0.3333, 0.95, r=9, 9, mu=3, mu0=2,  sigma=1.,contrast="means",dist="zdist", testcall="SU")
$`least sample size required in each treatment groups`
[1] 23
$`least sample size required in the control group`
[1] 70

with a total experimental size N = 9 · 23 + 70 = 277. A smaller ratio, let us say ratio = 0.2, leads again to a larger N.

 > nvDT(0.2, 0.95, r=9, 9, mu=3, mu0=2,  sigma=1.,contrast="means",dist="zdist", testcall="SU")
$`least sample size required in each treatment groups`
[1] 21
$`least sample size required in the control group`
[1] 105

with N = 9 · 21 + 105 = 294.

A rough graphical representation of the relation between a – 1 and N is shown in Figure 10.1.

Figure 10.1 Relationship between the total experimental size N and .

Finally, we compare the minimal sample sizes of the methods used for Example 5.7 in Table 5.9.

images — Table 10.9 Minimal sample sizes for several multiple decision problems.

Method		Remarks
Selection Rule 10.1 (t = 1) Dunnett's stepwise procedure Multiple t‐procedure F‐test	12 27.7 average 23 11 ≤ n ≤ 49	t = 1 9 comparisons 45 comparisons One test

References

Bechhofer, R.E. (1954). A single sample multiple decision procedure for ranking means of Normal populations with known variances. Ann. Math. Statist. 25: 16–39.
Bechhofer, R.E. and Dunnett, C.W. (1988). Percentage Points of Multivariate Student T Distributions, Selected Tables in Mathematical Statistics 11. Providence: American Mathematical Society.
Bonferroni, C.E. (1936). Teoria statistica delle classi e calcolo delle probabilità. Pubbl. d. R. Ist. Super. di Sci. Econom. e Commerciali di Firenze.
Dunnett, C.W. (1955). A multiple comparison procedure for comparing several treatments with a control. Jour. Amer. Stat. Ass. 50: 1096–1121.
Dunnett, C.W. (1964). New tables for multiple comparisons with a control. Biometrics 20: 482–491.
Dunnett, C.W. and Tamhane, A.C. (1992). A step‐up multiple test procedure. Jour..Amer. Stat. Ass. 87: 162–170.
Guiard, V. (1994). Different definitions of Δ‐correct selection for the indifference zone formulation. In Miescke and Rasch 1994: 176–199.
Gupta, S.S. (1956). On a Decision Rule for a Problem in Ranking Means. Mim. Ser., no 150. Univ. North Carolina.
Gupta, S.S. and Huang, D.Y. (1981). Multiple Statistical Decision Theory: Recent Develop‐Ments. New York: Springer.
Gupta, S.S. and Panchapakesan, S. (1970). On a class of subset selection procedures. Mim. Ser. 225. Purdue Univ. .
Gupta, S.S. and Panchapakesan, S. (1979). Multiple Decision Procedures: Theory and Methodology of Selecting and Ranking Populations. New York: Wiley.
Hochberg, Y. (1974). Some generalization of the T‐ method in simultaneous inference. Jour. Multiv. Anal. 4: 224–234.
Hochberg, Y. (1988). A sharper Bonferroni procedure for multiple tests of significance. Biometrika 75: 800–802.
Hochberg, Y. and Tamhane, A.C. (1987). Multiple Comparison Procedures. New York: Wiley.
Hochberg, Y. and Tamhane, A.C. (2008). Some Theory of Multiple Comparison Procedures for Fixed‐Effects Linear Models. New York: Wiley.
Hsu, J.C. (1996). Multiple Comparisons. London: Chapman & Hall.
Miescke, K.J. and Rasch, D. (eds.) (1996a). Special issue on 40 years of statistical selection theory, part I. J. Statist. Plann. Inference 54: 2.
Miescke, K.J. and Rasch, D. (eds.) (1996b). Special issue on 40 years of statistical selection theory, part II. J. Statist. Plann. Inference 54: 3.
Miller, R.G. (1981). Simultaneous Statistical Inference. New York: Springer Verlag.
Rasch, D. and Schott, D. (2018). Mathematical Statistics. Oxford: Wiley.
Rasch, D. and Yanagida, T. (2019). An optimal two‐stage procedure to select the best out of a Normal population. Journal of Statistical Theory and Practice 13: 1.
Scheffé, H. (1953). A method for judging all contrasts in the analysis of variance. Biometrika 40: 87–104.
Spjøtvoll, E. and Stoline, M.R. (1973). An extension of the T‐method of multiple comparisons to include the cases with unequal sample size. Jour. Amer. Stat. Ass. 68: 975–978.
Tukey, J.W. (1953). Multiple Comparisons. Jour. Amer. Stat. Ass. 48: 624–625.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 10 Multiple Decision Problems

Create new playlist

Sign In

Sign Up

10.1 Introduction

10.2 Selection Procedures

10.2.1 The Indifference Zone Formulation for Selecting Expectations

10.2.1.1 Indifference Zone Selection, σ2 Known

10.2.1.2 Indifference Zone Selection, σ2 Unknown

10.3 The Subset Selection Procedure for Expectations

10.4 Optimal Combination of the Indifference Zone and the Subset Selection Procedure

10.5 Selection of the Normal Distribution with the Smallest Variance

10.6 Multiple Comparisons

10.6.1 The Solution of MC Problem 5.3

10.6.1.1 The F‐test for MC Problem 5.3

10.6.1.2 Scheffé's Method for MC Problem 5.3

10.6.1.3 Bonferroni's Method for MC Problem 5.3

10.6.1.4 Tukey's Method for MC Problem 5.3 for ni = n

10.6.1.5 Generalised Tukey's Method for MC Problem 5.3 for ni ≠ n

10.6.2 The Solution of MC Problem 5.6 – the Multiple t‐Test

10.6.3 The Solution of MC Problem 5.10 – Pairwise and Simultaneous Comparisons with a Control

10.6.3.1 Pairwise Comparisons – The Multiple t‐Test

10.6.3.2 Simultaneous Comparisons – The Dunnett Method

References

Table of Contents for
10 Multiple Decision Problems

10.2.1.1 Indifference Zone Selection, σ² Known

10.2.1.2 Indifference Zone Selection, σ² Unknown

10.6.1.4 Tukey's Method for MC Problem 5.3 for n_i = n

10.6.1.5 Generalised Tukey's Method for MC Problem 5.3 for n_i ≠ n