4.6. A Hybrid Approach

As we saw with linear models and logistic models, it's possible to combine the fixed effects and random effects approaches to get some of the virtues of each. As before, the first step is to calculate the mean of each time-varying predictor variable for each individual, and then calculate the deviations from those means:

PROC SORT DATA=patents2;
   BY id;
PROC MEANS DATA=patents2 NWAY NOPRINT;
   CLASS id;
   VAR rd_0-rd_5;
   OUTPUT OUT=means MEAN=mrd_0 mrd_1 mrd_2 mrd_3 mrd_4 mrd_5;
DATA patcomb;
   MERGE patents2 means;
   BY id;
   drd_0=rd_0-mrd_0;
   drd_1=rd_1-mrd_1;
   drd_2=rd_2-mrd_2;
   drd_3=rd_3-mrd_3;
   drd_4=rd_4-mrd_4;
   drd_5=rd_5-mrd_5;
RUN;

The next step is to run a regression model with both the deviations and the means as predictor variables. To do this correctly, it's important to use an estimation method that allows for dependence among the multiple observations for each individual. The simplest approach is GEE with PROC GENMOD:

PROC GENMOD DATA=patcomb;
   CLASS id t;
   MODEL patent= drd_0-drd_5 mrd_0-mrd_5 t / DIST=NB;
   REPEATED SUBJECT=id / TYPE=MDEP(4) CORRW;
   CONTRAST 'FE VS. RE' drd_0 1 mrd_0 −1,drd_1 0 mrd_1 −1,
             drd_2 1 mrd_2 −1, drd_3 1 mrd_3 −1,drd_4 1 mrd_4 −1,
             drd_5 1 mrd_5 −1;
RUN;

Here I've specified a negative binomial distribution with an MDEP correlation structure. The CONTRAST statement produces a chi-square test of the null hypothesis that all the deviation coefficients are equal to all the corresponding mean coefficients.

Results are in Output 4.16. The coefficients for the deviation variables can be interpreted as if they were fixed effects estimates in the sense that they control for all stable covariates. In fact, they are quite close to the fixed effects coefficients of the R & D variables in Output 4.12. However, the GEE standard error estimates are somewhat larger than those from the unconditional fixed effects method. The coefficients for the means are generally quite different from the deviation coefficients, although none is statistically significant. Under the GEE and random effects models of the last section, the deviation and mean coefficients should be the same. The chi-square test of that assumption, reported at the end of the output, indicates that there is marginal evidence for rejection.

Table 4.16. Output 4.16 GEE Estimates of the Hybrid Model
Analysis Of GEE Parameter Estimates
Empirical Standard Error Estimates
Parameter EstimateStandard Error95% ConfidenceLimitsZPr>|Z|
Intercept 1.09440.08760.92271.266112.49<.0001
drd_0 0.35970.12230.12010.59942.940.0033
drd_1 −0.11580.1177−0.34650.1149−0.980.3252
drd_2 0.05290.0819−0.10760.21340.650.5183
drd_3 −0.02870.0887−0.20240.1451−0.320.7465
drd_4 0.02730.0852−0.13970.19430.320.7485
drd_5 −0.07750.0781−0.23050.0755−0.990.3209
mrd_0 −0.04900.8545−1.72391.6258−0.060.9542
mrd_1 1.05901.8618−2.59024.70810.570.5695
mrd_2 −0.91961.9653−4.77142.9322−0.470.6398
mrd_3 −0.35261.5650−3.42002.7148−0.230.8218
mrd_4 1.37791.1420−0.86033.61621.210.2276
mrd_5 −0.24280.4805−1.18460.6990−0.510.6134
t10.19240.05040.09350.29123.810.0001
t20.14280.04920.04640.23912.900.0037
t30.15370.03920.07690.23053.92<.0001
t40.10190.04110.02140.18232.480.0131
t50.00000.00000.00000.0000..
Contrast Results for GEE Analysis
ContrastDFChi-SquarePr>ChiSqType
FE VS. RE612.990.0432Score

The alternative to GEE is to implement the hybrid method in the context of a random effects model. Here's the PROC NLMIXED code for doing that, with results in Output 4.17:

PROC NLMIXED DATA=patcomb;
lambda=EXP(int+d0*drd_0+d1*drd_1+d2*drd_2+d3*drd_3+d4*drd_4+
   d5*drd_5+m0*mrd_0+m1*mrd_1+m2*mrd_2+m3*mrd_3+m4*mrd_4+m5*mrd_5
   +t1*(t EQ 1)+t2*(t EQ 2)+t3*(t EQ 3)+t4*(t EQ 4)+alpha);
   MODEL patent~NEGBIN(theta,(theta/(lambda+theta)));
   RANDOM alpha~NORMAL(0,s2) SUBJECT=id;
   PARMS int=1 d0=0 d1=0 d2=0 d3=0 d4=0 d5=0 m0=0 m1=0
         m2=0 m3=0 m4=0 m5=0 t1=0 t2=0 t3=0 t4=0 s2=1 theta=1;
RUN;

The coefficients for the deviation scores, along with their standard errors, are remarkably close to the fixed effects estimates in Output 4.12. This suggests that the random effects hybrid approach may be superior to GEE in replicating fixed effects results (as we found for logistic models in chapter 3), but beware. While PROC GENMOD took about a fifth of a second to run the GEE model, PROC NLMIXED took seven minutes to estimate the random effects model. For both of these methods I could also have included time-invariant variables like SCIENCE. I did not do so in order to maximize comparability with the fixed effects results.

Table 4.17. Output 4.17 PROC NLMIXED Estimates of the Hybrid MODEL
Fit Statistics
−2 Log Likelihood9671.0
AIC (smaller is better)9709.0
AICC (smaller is better)9709.5
BIC (smaller is better)9780.9
Parameter Estimates
ParameterEstimateStandard ErrorDFtValuePr>|t|AlphaLowerUpperGradient
int0.69400.070573239.83<.00010.050.55520.83280.009595
d00.37450.068743235.45<.00010.050.23930.5097−0.00483
d1−0.082680.07341323−1.130.26090.05−0.22710.06175−0.00721
d20.059000.069763230.850.39840.05−0.078250.19620.002148
d30.0095200.064923230.150.88350.05−0.11820.13720.00983
d40.041120.061813230.670.50630.05−0.080470.16270.005421
d5−0.002240.05033323−0.040.96460.05−0.10120.09677−0.00302
m0−0.15430.7641323−0.200.84010.05−1.65761.3490−0.02112
m12.02261.57483231.280.20000.05−1.07565.12080.011452
m2−2.11811.7480323−1.210.22650.05−5.55701.32080.07705
m3−0.20771.5771323−0.130.89530.05−3.31042.8951−0.11503
m41.76951.27423231.390.16590.05−0.73734.27630.06146
m5−0.41680.5980323−0.700.48640.05−1.59330.7598−0.00998
t10.22850.028643237.98<.00010.050.17220.28490.006028
t20.17910.028243236.34<.00010.050.12360.23470.00154
t30.17390.027453236.34<.00010.050.11990.2279−0.00316
t40.069970.026593232.630.00890.050.017650.12230.009996
s20.75300.0675832311.14<.00010.050.62010.88600.004639
theta32.13443.33153239.65<.00010.0525.580238.68860.000046

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.117.142.128