3.7. Fixed Effects Methods for Multinomial Responses

So far, this chapter has dealt only with binary response variables. We now consider a categorical response variable yit that can take on more than two values. Without loss of generality, let's suppose that those values are the integers ranging from 1 to J. Let pijt = Prob(yit = j). We now want a model for the dependence of this probability on predictors xit and zi.

We begin with the simpler case in which we assume an ordering of the J categories. The most popular model for ordered categorical data is the cumulative logit model which, in its conventional form, is available in both PROC LOGISTIC and PROC GENMOD. A fixed effects version of the model can be written as


where is the "cumulative" probability of being in category j or higher. Unfortunately, this model does not have reduced sufficient statistics for the αi parameters. Thus, conditional maximum likelihood is not an option for estimation. One approach to estimation would be the approximate conditional method proposed by Waterman and Lindsay (1996), but that's not available in any commercial software. However, we can apply the hybrid method discussed in the previous section using robust standard errors to adjust for the lack of independence in the repeated observations for each individual.

As an example, we'll analyze data on a sample of 396 people who survived residential fires in the Philadelphia area (Keane et al. 1996). They were interviewed at 3 months, 6 months, and 12 months after the fire. The outcome variable FORGIVE is a response to the question "Have you had feelings that it is difficult to forgive yourself for anything that happened during the fire?" The possible responses were coded as follows:


1

Not at all


2

A little


3

Somewhat


4

Very much

The working data set contains 1,188 records, three for each person, corresponding to the three interviews. The predictor variables, measured at each interview, are


DEPRESS

A depression scale with values ranging from 1 to 4


RELSER

A measure of attendance at religious services:1=never, 2=occasional, 3=regular


SEVENT

Number of stressful events that have occurred since the fire or since the last interview, ranging from 0 to 5

There is also a variable SUBJID which is the id number for each person and is common to all three records for each person.

To implement the hybrid method, we must calculate person-specific means for each of the variables, merge those into the original data set, and then calculate deviations from those means:

PROC MEANS DATA=my.forgive NWAY NOPRINT;
   CLASS subjid;
   VAR relser control depress sevent ;
   OUTPUT OUT=means MEAN= mrelser mcontrol mdepress msevent;
RUN;
DATA forgive;
   MERGE my.forgive means;
   BY subjid;
   dcontrol=control-mcontrol;
   ddepress=depress-mdepress;
   drelser=relser-mrelser;
   dsevent=sevent-msevent;
RUN;

Now we're ready to estimate the cumulative logit model with PROC GENMOD:

PROC GENMOD DATA=forgive;
   CLASS time subjid;
   MODEL forgive= ddepress drelser dsevent
         mdepress mrelser msevent time / D=MULTINOMIAL;
   REPEATED SUBJECT=subjid / TYPE=IND;
   CONTRAST 'Fixed vs. Random Effects' ddepress 1 mdepress −1,
             drelser 1 mrelser −1, dsevent 1 msevent −1;
RUN;

The D=MULTINOMIAL option specifies that the dependent variable FORGIVE has a multinomial distribution with a cumulative link to the predictor variables. The default is a cumulative logit link (as opposed to probit or complementary log-log). Note that because I did not put the DESC option on the PROC statement, this model predicts the probability of the response variable having a lower value (less likelihood of difficulty forgiving oneself).

The REPEATED statement invokes GEE estimation, with standard errors and associated statistics calculated using the robust method of White (1980). Note that TYPE=IND is the only correlation structure allowed whenever D=MULTINOMIAL is specified. This means that GEE estimation presumes that there is no correlation among the repeated measures, and implies that GEE coefficient estimates are identical to those produced by conventional maximum likelihood. Nevertheless, the standard errors and test statistics are adjusted for dependence among the observations. Finally, the CONTRAST statement tests whether the coefficients for the deviation variables are the same as the coefficients for the corresponding mean variables. As we've seen before, this is equivalent to testing for fixed effects versus random effects.

Table 3.14. Output 3.13 Hybrid Estimates for Cumulative Logit Model with PROC GENMOD
Analysis Of GEE Parameter Estimates
Empirical Standard Error Estimates
Parameter EstimateStandard Error95% ConfidenceLimitsZPr>|Z|
Intercept1 −3.78530.3653−4.5012−3.0693−10.36<.0001
Intercept2 −2.96430.3492−3.6487−2.2799−8.49<.0001
Intercept3 −2.10850.3422−2.7791−1.4378−6.16<.0001
ddepress 0.30010.13890.02780.57242.160.0308
drelser 0.14620.1600−0.16730.45980.910.3606
dsevent 0.17280.07980.01650.32922.170.0303
mdepress 0.78310.12580.53651.02986.22<.0001
mrelser −0.36190.1572−0.6700−0.0538−2.300.0213
msevent 0.05790.1294−0.19570.31160.450.6543
time10.78160.17120.44601.11724.56<.0001
time20.48500.16960.15260.81752.860.0042
time30.00000.00000.00000.0000..
Contrast Results for GEE Analysis
ContrastDFChi-SquarePr>ChiSqType
Fixed vs. Random Effects311.870.0078Score

Results are displayed in Output 3.13. The coefficients for the three deviation variables (those whose names begin with D) can be interpreted as if they were fixed effects coefficients. Because these coefficients depend only on variation over time within persons, they control for all stable covariates. Among these coefficients, we see significant effects of depression and number of stressful events. As expected, those who are more depressed and who had more stressful events are more likely to have trouble forgiving themselves. It's clear, on the other hand, that the deviation score for attendance at religious services does not have an effect, even though the mean score is significant at the .02 level. The results from the CONTRAST statement indicate that we should reject the null hypothesis that the deviation coefficients are the same as the corresponding mean coefficients. The implication is that we should focus our attention on the deviation coefficients, since they control for all stable covariates.

As we have previously observed, one limitation of the GEE method is that the coefficients are population averaged rather than subject specific, implying that they are attenuated toward zero because of population heterogeneity. If you're willing to put in some additional programming effort and computer time, you can avoid this problem by fitting a random effects model with PROC NLMIXED. Because the cumulative logit model is not built into NLMIXED, however, the programming is somewhat more involved:

PROC NLMIXED DATA=forgive;
   eta=bddepress*ddepress + bdrelser*drelser + bdsevent*dsevent
      + bmdepress*mdepress + bmrelser*mrelser + bmsevent*msevent +
        t1*(time EQ 1) + t2*(time EQ 2) + alpha;
   IF forgive=1 THEN p=1/(1+EXP(-b1-eta));
   ELSE IF forgive=2 THEN p=1/(1+EXP(-b2-eta))−1/(1+EXP(-b1-
           eta));
   ELSE IF forgive=3 THEN p=1/(1+EXP(-b3-eta))−1/(1+EXP(-b2-
           eta));
   ELSE p=1−1/(1+EXP(-b3-eta));
   ll=LOG(p);
   MODEL forgive~GENERAL(ll);
   RANDOM alpha~NORMAL(0,var) SUBJECT=subjid;
   CONTRAST 'Test of fixed vs. random' bddepress-bmdepress,
             bdrelser-bmrelser,bdsevent-bmsevent,bdsevent-
             bmsevent;
   PARMS  bddepress=0 bdrelser=0 bdsevent=0 bmdepress=0
          bmrelser=0 bmsevent=0 t1=0 t2=0 b1=1 b2=2 b3=3 var=1;
RUN;

The first statement after the PROC statement defines ETA to be a linear function of the explanatory variables, including a random disturbance term ALPHA. As with previous NLMIXED programs, I've chosen names for the coefficients that are the same as the names for the covariates, except prefixed by B, making it easier to interpret the output. The next four IF and ELSE IF statements specify the probability P of observing each outcome of the response variable, as it depends on ETA. The log-likelihood LL is defined to equal the log of the probability P. The MODEL statement says simply that the response variable FORGIVE has a log-likelihood given by LL. The RANDOM statement declares ALPHA to be normally distributed with a mean of 0 and a variance parameter VAR. The CONTRAST statement tests whether the coefficients for the deviation variables are the same as the coefficients for the corresponding mean variables. Finally, the PARMS statement assigns starting values to all the parameters.

The coefficient estimates in Output 3.14 generally follow the same pattern as those in Output 3.13 produced by PROC GENMOD, but their magnitudes are all noticeably larger. This is just what we would expect from a subject-specific method. The increased magnitude does not always imply an increased level of statistical significance, however, because the standard errors also increase. For example, the coefficient for DSEVENT has a p-value of .03 in Output 3.13 and .09 in Output 3.14. Another thing to keep in mind is that PROC GENMOD took 0.2 seconds to estimate the model on my PC, whereas PROC NLMIXED took 8 seconds.

Table 3.15. Output 3.14 Estimates for Cumulative Logit Model with PROC NLMIXED
Parameter Estimates
ParameterEstimateStandard ErrorDFtValuePr>|t|AlphaLowerUpperGradient
bddepress−0.51280.1948395−2.630.00880.05−0.8957−0.12980.000115
bdrelser−0.26210.2324395−1.130.26020.05−0.71910.1949−0.00007
bdsevent−0.23190.1380395−1.680.09350.05−0.50320.039270.000148
bmdepress−1.23570.2041395−6.05<.00010.05−1.6369−0.8344−0.00013
bmrelser0.60430.24083952.510.01250.050.13101.0777−0.00004
bmsevent−0.096380.2235395−0.430.66650.05−0.53570.3430−0.00006
t1−1.11660.2639395−4.23<.00010.05−1.6355−0.59780.000016
t2−0.67500.2521395−2.680.00770.05−1.1706−0.17940.000035
b13.23320.58513955.53<.00010.052.08294.3835−0.00004
b24.53900.60963957.45<.00010.053.34065.7374−0.00004
b35.69480.63743958.93<.00010.054.44166.94800.00005
var4.15690.82133955.06<.00010.052.54225.7716−4.26E-6
Contrasts
LabelNum DFDen DFF ValuePr>F
Test of fixed vs. random33954.600.0035

Now let's consider the more complicated situation in which the J categories are unordered. The most popular model for unordered dependent variables is the multinomial logit model (also known as the generalized logit model), which we now extend to include fixed effects:


In essence, this is a set of binary logistic regression equations that simultaneously compare each category to the last category. Notice that the fixed effects αij vary both over individuals and over response values.

As with a single binary logistic model (a special case of the multinomial model), there are reduced sufficient statistics for the αij terms, namely the frequency counts over time of the different response values for each individual. By conditioning on those counts, this model can be estimated by conditional maximum likelihood. Unfortunately, there are no SAS procedures that are designed to do this. For certain simple cases in which the time-varying covariates are categorical, the model can be reformulated as a log-linear model that can be estimated with PROC GENMOD or PROC CATMOD (Tjur 1982; Conaway 1989; Darroch and McCloud 1986; Kenward and Jones 1991), but I will not pursue those methods here. Alternatively, one could break the multinomial model into a set of binary models, one model for each comparison of a particular category with a reference category (Begg and Gray 1984; Allison 1999). Each binary model could then be estimated using the conditional logistic regression methods we have already discussed in this chapter. While this approach produces consistent estimates (in the statistical sense) of the coefficients, results will differ depending on the choice of the reference category. Furthermore, there is no overall test for the effect of each variable on the response variable.

As with the cumulative logit model, it seems that the best available approach in SAS is to use the hybrid method. In fact, we can use the same data set that we created earlier for estimating the cumulative model. Unfortunately, PROC GENMOD will not estimate an unordered multinomial model, and PROC LOGISTIC does not allow for dependence among the repeated observations. Instead, we shall use PROC SURVEYLOGISTIC, which does conventional maximum likelihood estimation of the coefficients but produces standard errors and test statistics that allow for dependence among the repeated observations. Here's the code:

PROC SURVEYLOGISTIC DATA=forgive;
   CLASS time;
   MODEL forgive(REF='1')= ddepress drelser dsevent
         mdepress mrelser msevent time / LINK=GLOGIT;
   CLUSTER subjid;
   CONTRAST 'Fixed vs. Random Effects' ddepress 1 mdepress −1,
            drelser 1 mrelser −1, dsevent 1 msevent −1;
RUN;

The syntax of PROC SURVEYLOGISTIC is nearly identical to that of PROC LOGISTIC except, in this example, for the CLUSTER statement. The CLUSTER statement specifies an id variable that defines groups within which observations are allowed to be dependent. In the MODEL statement, the REF= '1' option specifies that the reference category for the dependent variable will be 1, which is the value for 83% of the cases. The default is to choose the highest value (in this case 5) as the reference category, but only about 4% of the cases have that value. Although the generalized logit model is fundamentally invariant to the choice of the reference category, choosing a reference category with few cases can make it appear as though none of the coefficients is statistically significant.

The LINK=GLOGIT option tells SAS that this is a generalized (unordered) logit model rather than the cumulative logit model, which is the default. The CONTRAST statement tests the null hypothesis that all the deviation coefficients are identical to all the corresponding mean coefficients. Although the syntax is the same as the CONTRAST statement for the cumulative model in GENMOD, the consequences are somewhat different. In our unordered model, each predictor variable has three coefficients. Each coefficient measures the effect of the variable on being in one particular category rather than the reference category. When the CONTRAST statement specifies, say, DDEPRESS 1 MDEPRESS −1, each of the three coefficients for DDEPRESS is compared with the corresponding coefficients for MDEPRESS. Thus, the chi-square statistic produced by our CONTRAST statement has nine degrees of freedom (three variables × three coefficients).

Results are shown in Output 3.15. The "Type 3 Analysis of Effects" panel gives chi-squares for the null hypotheses that all three coefficients for each variable are zero, controlling for all the other variables. These chi-squares are invariant to the choice of the reference category. We see that none of the fixed effects deviation variables is statistically significant.

The "Analysis of Maximum Likelihood Estimates" panel reports the individual coefficient estimates and associated statistics. As already noted, these coefficients are conventional maximum likelihood estimates under the assumption that all observations are independent. Hence, they are the same numbers that would be produced by PROC LOGISTIC with the LINK=GLOGIT option. The standard errors, on the other hand, are adjusted for dependence among the repeated observations for each person. In fact, they are not all that different from the standard errors produced by PROC LOGISTIC (or PROC SURVEYLOGISTIC without the CLUSTER statement). Finally, the results for the CONTRAST statement provide marginal evidence that the deviation coefficients are different from the mean coefficients.

Table 3.16. Output 3.15 Estimates for Generalized Logit Model with PROC SURVEYLOGISTIC
Type 3 Analysis of Effects
EffectDFWald Chi-SquarePr>ChiSq
ddepress36.06130.1087
drelser31.41030.7031
dsevent35.56060.1351
mdepress337.0442<.0001
mrelser37.38680.0605
msevent30.94350.8149
time624.40000.0004
Analysis of Maximum Likelihood Estimates
Parameter forgiveDFEstimateStandard ErrorWald Chi-SquarePr>ChiSq
Intercept 21−2.09650.399227.5840<.0001
Intercept 31−3.33330.546237.2401<.0001
Intercept 41−3.44580.569936.5541<.0001
ddepress 210.28690.21541.77460.1828
ddepress 310.05970.26250.05180.8200
ddepress 410.51620.24184.55660.0328
drelser 210.09790.29330.11140.7386
drelser 310.12640.40820.09580.7569
drelser 410.30570.30161.02750.3107
dsevent 210.11970.14500.68200.4089
dsevent 310.08690.19070.20760.6487
dsevent 410.29610.15053.87300.0491
mdepress 210.62850.136221.3044<.0001
mdepress 310.81420.178320.8480<.0001
mdepress 411.03470.209124.4928<.0001
mrelser 21−0.45000.17156.88240.0087
mrelser 31−0.20400.27030.56940.4505
mrelser 41−0.38160.25722.20150.1379
msevent 210.12800.14040.83130.3619
msevent 31−0.005200.20730.00060.9800
msevent 410.03260.22660.02070.8856
time1210.17450.15711.23430.2666
time1310.51630.16829.42450.0021
time1410.50690.19117.03900.0080
time2210.19840.14041.99570.1577
mdepress 411.03470.209124.4928<.0001
mrelser 21−0.45000.17156.88240.0087
Contrast Test Results
ContrastDFWald Chi-SquarePr>ChiSq
Fixed vs. Random Effects916.07810.0653

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.145.186.63