7.3. Nonlinear Regression Models with a Continuous Response

In this section we first formulate the optimal design problem for general nonlinear regression models and then consider a popular dose-response model with a continuous response variable (four-parameter logistic, or Emax, model).

7.3.1. General Nonlinear Regression Model

Consider a general nonlinear regression model that is used in the analysis of concentration- or dose-response curves in a large number of clinical and pre-clinical studies. The model describes the relationship between a continuous response variable, Y, and design variable x (dose or concentration level):


Here yij's are often assumed to be independent observations with Var(yij|xi;θ) = σ2(xi), i.e., the variance of the response variable varies across doses or concentration levels. Also, xi is the design variable that assumes values in a certain region, χ. Lastly, θ is an m-dimensional vector of unknown parameters.

When the response variable Y is normally distributed, the variance matrix of the MLE, , can be approximated by D(ξ,θ)/N, see (7.4), and the information matrix of a single observation is


where f(x,θ) is a vector of partial derivatives of the response function η(x,θ),


Now, as far as optimal experimental design is concerned, the sequence of steps that follow is exactly the same as for quantal dose-response models considered in Section 7.2:

  1. Select a preliminary parameter estimate, .

  2. Select an optimality criterion, Ψ = Ψ[D(ξ,)], e.g., the D-optimality criterion.

  3. Construct a locally optimal design ξ*() for the selected criterion.

It is recommended that sensitivity analysis is performed, i.e., steps 1 through 3 must be repeated for slightly different s,s = 1,...,Ns, to verify robustness of the design ξ*(); see Atkinson and Fedorov (1988).

Since we know that μ(xi,θ) = g(xi,θ) gT(xi,θ), the process of constructing D- or A-optimal designs for the general nonlinear regression model will be virtually identical to the process of finding optimal designs for quantal dose-response models (described in detail in Section 7.2). We only need to redefine the g(x,θ) function that determines the information matrix μ(x,θ) and sensitivity functions in (7.6) and (7.7) for the D- and A-optimality criteria, respectively.

7.3.2. Example 3: A Four-Parameter Logistic Model with a Continuous Response

The four-parameter logistic, or Emax, model, is used in many pharmaceutical applications, including bioassays (of which ELISA, or enzyme-linked immunosorbent assay, and cell-based assays are popular examples). See Finney (1976), Karpinski (1990), Hedayat et al (1997), Källén and Larsson (1999) for details and references.

This example comes from a study of a compound that inhibits the proliferation of bone marrow erythroleukemia cells in a cell-based assay (Downing, Fedorov and Leonov, 2001; Fedorov and Leonov, 2005). The logistic model used in the study is defined as follows:


where Y is a continuous outcome variable (number of cells in the assay) and x is the design variable (concentration level). Further, θ1 is often denoted as ED50 and represents the concentration at which the median response (θ3 + θ4)/2 is attained, θ2 is a slope parameter, θ4 and θ3 are the lower and upper asymptotes (minimal and maximal responses).

As an aside note, the four-parameter logistic model provides an example of a partially nonlinear model since θ3 and θ4 enter the response function in a linear fashion. This linear entry implies that the D-optimal design does not depend on values of θ3 and θ4 (Hill, 1980).

7.3.2.1. Computation of the D-Optimal Design

The vector of unknown parameters θ was estimated by modeling the data collected from a 96-well plate assay, with six repeated measurements at each of the ten concentrations, {500, 250, 125, ..., 500/29} ng/ml, which represents a two-fold serial dilution design with N = 60 and weights wi = 1/10. The estimated parameters are


The design region will be set to


on a log-scale. The x values will be exponentiated to compute the response function η(x,θ).

Program 7.3 constructs the D-optimal design for the introduced four-parameter logistic model by calling the %OptimalDesign1 macro. As in Programs 7.1 and 7.2, we need to define the following parameters and pass them to the macro:

  • The initial design is based on ten equally-spaced and equally-weighted log-transformed concentration levels within the design region (POINTS and WEIGHTS variables).

  • The grid consists of 801 equally-spaced points in χ = [−0.024,6.215].

  • The true values of the model parameters are specified using the PARAMETER variable.

  • The algorithm parameters are identical to those used in Programs 7.1 and 7.2.

  • To specify the g(x,θ) function, we need to compute a vector of partial derivatives. The derivatives are defined in the %deriv macro. The MATRIX variable contains the values of x. The DER1, DER2, DER3 and DER4 variables represent the derivatives of the response function with respect to θ1, θ2, θ3 and θ4, respectively, and the DERIVATIVE variable contains the vector of partial derivatives.

Also, note that, in order to improve the stability of the iterative procedure, the merging constant (CMERGE) needs to be increased to 12.

As before, we will concentrate on the user-defined parameters. The complete SAS code can be found on the book's companion web site.

Example 7-3. D-optimal design for the four-parameter logistic model with a continuous response (Design parameters, algorithm parameters and g(x,θ) function)
* Design parameters;
%let points=do(-0.024,6.215,6.239/9);
%let weights=repeat(0.1,1,10);
%let grid=do(-0.024,6.215,6.239/800);
%let parameter={15.03 1.31 530 1587};
* Algorithm parameters;
%let convc=1e-9;
%let maximit=1000;
%let const1=1;
%let const2=1;
%let cmerge=12;

* G function;
%macro deriv(matrix,derivative);
    nm=nrow(&matrix);
    one=j(nm,1,1);
    dera=(parameter[4]-parameter[3])/(1+(exp(&matrix[,1])/parameter[1])
        ##parameter[2])##2#(parameter[2]/parameter[1])#(exp(&matrix[,1])
        /parameter[1])##parameter[2];
    derb=-(parameter[4]-parameter[3])/(1+(exp(&matrix[,1])/parameter[1])
        ##parameter[2])##2#(exp(&matrix[,1])/parameter[1])##parameter[2]
        #log(exp(&matrix[,1])/parameter[1]);
    derc=1-1/(1+(exp(&matrix[,1])/parameter[1])##parameter[2]);
    derd=1/(1+(exp(&matrix[,1])/parameter[1])##parameter[2]);
    &derivative=dera||derb||derc||derd;
%mend deriv;
%OptimalDesign1;

Example. Output from Program 7.3
Initial design

                                  Weight         X

                                   0.100    −0.024
                                   0.100     0.669
                                   0.100     1.362
                                   0.100     2.056
                                   0.100     2.749
                                   0.100     3.442
                                   0.100     4.135
                                   0.100     4.829
                                   0.100     5.522
                                   0.100     6.215

                                   Optimal design

                                  Weight         X

                                   0.250    −0.024
                                   0.250     2.027
                                   0.250     3.540
                                   0.250     6.215

                  Determinants of the variance-covariance matrices

                                 INITIAL     OPTIMAL

                                0.000031    0.000015

                     Variance-covariance matrix, initial design

                          COL1      COL2      COL3      COL4

                         0.019     0.000    −0.103    −0.283
                         0.000     0.000     0.027    −0.041
                        −0.103     0.027     6.745    −3.599
                        −0.283    −0.041    −3.599    12.639

Variance-covariance matrix, optimal design

                          COL1      COL2      COL3      COL4

                         0.015     0.000    −0.118    −0.181
                         0.000     0.000     0.013    −0.019
                        −0.118     0.013     5.086    −1.065
                        −0.181    −0.019    −1.065     7.087

Output 7.3 shows that the D-optimal design includes two points on the boundary of the design region (x1* = −0.024 and x4* = 6.215) and two in the middle of the design region (x2* = 2.027 and x3* = 3.540). The optimal weights are equal to 1/m = 0.25 (compare Output 7.1); see the left panel in Figure 7.5. As in Examples 1 and 2 given in Section 7.2, the sensitivity function is equal to the number of the model parameters, m = 4, at the optimal concentrations (see the right panel in Figure 7.5).

Figure 7-5. Left panel: Initial (open circles) and optimal (closed circles) designs. Right panel: Sensitivity functions for the initial (dashed curve) and optimal (solid curve) designs.

Output 7.3 also helps compare characteristics of the initial and optimal designs. The determinant of the final variance-covariance matrix is 1.5 × 10−5, compared to the value of 3.1 × 10−5 for the initial design. The variance of individual parameter estimates has improved, too. For example, the variance of θ1 has dropped from 0.019 to 0.015. Parameter θ1, or ED50, is usually the most important for practitioners. This example illustrates a good performance of a D-optimal design with respect to other criteria of optimality, like c-criterion for the estimation of θ1.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.139.107.210