Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 4
Tests on Error Component Models

The double dimensionality of panel data allows for much richer specifications than simple cross sections or time series. This is both a blessing and a curse, given how much more complicated the specification may become. In fact, all possible features from either cross sections or time series, like distance‐decaying correlation in – respectively – space or time, can coexist with individual (time), time‐(individual‐) invariant heterogeneity. Moreover, diagnostic tests will usually have a hard time distinguishing between different forms of persistence along the same dimension unless explicitly designed to take the “other” effect into account.

The specification problem of panel models is typically associated with the presence or absence of individual effects, i.e., with the need to account for unobserved heterogeneity. Given that in the vast majority of cases it will be inappropriate to rule out individual heterogeneity altogether, the related issue emerges of whether it is safe to assume that the latter is uncorrelated with the explanatory variables (and therefore to proceed in a random effects framework) or rather to proceed estimating out (transforming out) the individual effects in a fixed effects fashion. Hence, tests for individual effects under either of the two approaches and Hausman‐type tests for determining which one is appropriate are among the most popular diagnostic procedures in this field.

Next to the fundamental specification issues with individual effects , the remainder errors can in turn be correlated: either in time, in which case it will be crucial to distinguish time‐decaying persistence of idiosyncratic shocks from the time‐invariant persistence deriving from the presence of an individual effect; or in space, and then the issue becomes whether correlation simply descends from participating in the same cross section or, provided the data are referenced in some space (e.g., in geography), whether nearby observations are more correlated than distant ones.

For these reasons, a rich toolbox of diagnostic and specification testing procedures has been developed, which will be the subject of this chapter, presented roughly in the order given above up to the issue of cross‐sectional correlation. On the converse, spatial correlation proper will be the subject of a separate chapter.

4.1 Tests on Individual and/or Time Effects

In order to test whether either individual or time effects are present, two approaches are possible:

the first is to start from estimating said effects out (within model) and then perform a zero restriction test,
the second is to start from the OLS model and to infer about the presence of the effects drawing on the OLS residuals.

4.1.1 F Tests

The sum of squared residuals and the degrees of freedom for the within model are: and . Let the null hypothesis be the absence of individual effects so that the restricted model is pooled OLS where the sum of squared residuals and the degrees of freedom are, respectively, and . Under , the test statistic:

follows a Fisher‐Snedecor F with and degrees of freedom.

The test of the null hypothesis of no individual and time effects is obtained by using the two‐ways within model and the pooling model:

Finally, the test of the null hypothesis of, say, no time effects, but in the presence of individual effects is:

4.1.2 Breusch‐Pagan Tests

The Breusch and Pagan (1980) test is a Lagrange multipliers test based on the OLS residuals. It is based on the score vector , i.e., the vector of partial derivatives of the log‐likelihood function from the restricted model. The variance of the score vector is the information matrix:

We estimate a restricted model characterized by a parameter vector ; under , we have:

or, denoting by and the score and its estimated variance in the restricted model:

which is distributed as a where the degrees of freedom are equal to the number of restrictions. We'll first derive the test for the one‐way individual error component model, for which the log‐likelihood function is:

The gradient is then:

To derive the variance, we start by calculating the matrix of second derivatives :

To compute the expectation of this matrix, we note that and :

To compute the test statistic, we impose the null hypothesis: (absence of individual effects). In this case, the estimator for the parameters is OLS and that of is . The score and its estimated variance are then:

whose inverse is:

Finally, the test statistic is:

Or, replacing by :

which is asymptotically distributed as a with 1 degree of freedom.

The test of the time effect is likewise computed:

The Breusch‐Pagan test extends easily to the two‐ways error component model, as the statistic can be written as the sum of the two previous statistics:

and follows a with two degrees of freedom under the null hypothesis of no individual and time effects.

For unbalanced panels, the relevant statistics are:¹

These statistics present two problems. The first one is that the alternative hypothesis is that the effects' variance is non‐zero, i.e., strictly positive or negative; when a variance must be non‐negative. For the one‐way error component model, Honda (1985) and King and Wu (1997) proposed a one‐sided test based on the square root of the above statistic, which is then normally distributed. The Honda statistic is then and its 5% critical value is 1.64 (and likewise for the test of no time effects). For the two‐ways error components model, Honda (1985) proposed to use as Baltagi et al. (1992) and King and Wu (1997) use:

The second problem is due to the fact that or may be lower than 1. In this case, Baltagi et al. (1992), following Gourieroux et al. (1982), proposed to replace the statistic by 0. The modified statistic is then defined by:

which follows a mixed distribution:

Example 4‐1 F and LM tests – `RiceFarms` data set

A F test for the presence of individual effects is implemented in the function pFtest, which compares the nested models OLS and within. Under the null of no individual effects, the statistic is distributed as an F with degrees of freedom equal to the number of individuals minus 1 on the numerator and to the degrees of freedom of the within model on the denominator. We first estimate the relevant models:

 data("RiceFarms", package = "splm")
Rice <- pdata.frame(RiceFarms, index = "id")
rice.w <- plm(log(goutput) ˜ log(seed) + log(totlabor) + log(size), Rice)
rice.p <- update(rice.w, model = "pooling")
rice.wd <- plm(log(goutput) ˜ log(seed) + log(totlabor) + log(size), Rice,
               effect = "twoways")

and we then supply the testing function with two fitted models:

 pFtest(rice.w, rice.p)

  F test for individual effects

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
F = 1.7, df1 = 170, df2 = 850, p-value = 3e-06
alternative hypothesis: significant effects

The formula‐data syntax may also be used:

 pFtest(log(goutput) ˜ log(seed) + log(totlabor) + log(size), Rice)

Unsurprisingly, the absence of individual effects is strongly rejected.

To test the absence of individual and time effects, one would use:

 pFtest(rice.wd, rice.p)

  F test for twoways effects

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
F = 4.3, df1 = 180, df2 = 850, p-value <2e-16
alternative hypothesis: significant effects

 pFtest(log(goutput) ˜ log(seed) + log(totlabor) + log(size), Rice,
       effect = "twoways")

To test the absence of time effects allowing for the presence of individual effects, we compare the individual and the two‐ways effect within models:

 pFtest(rice.wd, rice.w)

  F test for twoways effects

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
F = 70, df1 = 5, df2 = 850, p-value <2e-16
alternative hypothesis: significant effects

Once more, the null hypothesis is very strongly rejected.

The Breusch and Pagan (1980) test can be computed using the function plmtest. The argument is either an OLS model or a formula‐data pair. By default, the Honda (1985) version is computed. The direction of the effects, as usual, is determined by the effect argument.

 plmtest(rice.p)

  Lagrange Multiplier Test - (Honda) for balanced
  panels

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
normal = 4.8, p-value = 7e-07
alternative hypothesis: significant effects
plmtest(log(goutput)˜log(seed)+log(totlabor)+log(size), Rice)

  Lagrange Multiplier Test - (Honda) for balanced
  panels

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
normal = 4.8, p-value = 7e-07
alternative hypothesis: significant effects
plmtest(rice.p, effect = "time")

  Lagrange Multiplier Test - time effects (Honda) for
  balanced panels

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
normal = 59, p-value <2e-16
alternative hypothesis: significant effects
plmtest(rice.p, effect = "twoways")

  Lagrange Multiplier Test - two-ways effects (Honda)
  for balanced panels

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
normal = 45, p-value <2e-16
alternative hypothesis: significant effects

The two useful extensions proposed by Baltagi et al. (1992) can be applied to test the existence of individual and time effects, setting the argument type to 'kw' or 'ghm' to use respectively the techniques proposed by King and Wu (1997) and Gourieroux et al. (1982).

 plmtest(rice.p, effect = "twoways", type = "kw")

  Lagrange Multiplier Test - two-ways effects (King
  and Wu) for balanced panels

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
normal = 59, p-value <2e-16
alternative hypothesis: significant effects
plmtest(rice.p, effect = "twoways", type = "ghm")

  Lagrange Multiplier Test - two-ways effects
  (Gourieroux, Holly and Monfort) for balanced panels

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
chibarsq = 3500, df0 = 0.00, df1 = 1.00, df2 = 2.00,
w0 = 0.25, w1 = 0.50, w2 = 0.25, p-value <2e-16
alternative hypothesis: significant effects

4.2 Tests for Correlated Effects

We have seen that if the model errors are not correlated with the explanatory variables, then both estimators, fixed as well as random effects, are consistent. To compare them, we keep assuming that the idiosyncratic component of the error term () is uncorrelated to the regressors. Two situations are then possible:

: the individual effects are not correlated with the explanatory variables; in this case, both estimators are consistent, but the random effects estimator is more efficient than the fixed effects.
: the individual effects are correlated with the explanatory variables; in this case, the fixed effects estimator, which estimates out the individual effects, is consistent. On the contrary, the random effects estimator is inconsistent because one component of the composite error, the individual effect, is correlated with the explanatory variables.

4.2.1 The Mundlak Approach

In order to clarify the relationship between the two estimators, Mundlak (1978) considered the following model:

with

The individual effects are therefore correlated with the explanatory variables, being they equal to the sum of a linear combination of the individual means of said variables and of an error term . The model to be estimated is then written, in matrix form, as:

The error term has the usual properties of the error components model, i.e., zero mean and a variance equal to:

The GLS model is estimated by applying OLS on the data transformed pre‐multiplying each variable by , with .

We then have , and . The GLS estimator is then written as:

Using the formula of the inverse of a partitioned matrix (see equation 2.19), we get:

and:

The fundamental result of Mundlak (1978) is therefore that, if one correctly accounts for the correlation between the error terms and the explanatory variables, the GLS estimator is the within estimator.

4.2.2 Hausman Test

This results also suggests a way to test for the presence of correlation; in fact, testing for no correlation corresponds to testing for: . Under , we have:

which is distributed as a with degrees of freedom. Well, we have and .

This test statistic is one version of the test proposed by Hausman (1978). The general principle consists in comparing two models and where:

under : and are both consistent, but is more efficient than ,
under : only is consistent.

The idea of the test is that if is true, then the estimated coefficients from the two models shall not diverge; under the alternative, they will. The test is therefore based on and Hausman showed that, under , the variance of this difference is simply: .

The most common version of this test is based on comparing the within and the GLS estimators. The difference between the two is: . Under the hypothesis of no correlation between errors and explanatory variables, we have . The variance of is:

To determine these variances and covariances, we write the two estimators as functions of the errors: and . We then have: , and . The variance of is then simply:

and the test statistic becomes:

which, under , is distributed as a with degrees of freedom.

4.2.3 Chamberlain's Approach

Chamberlain (1982) proposed a more general model than that of Mundlak (1978). In his model, the individual effects are not assumed to be a linear function of the means of the explanatory variables anymore, but of their values over the whole time period.

Denote the vector of length containing the values of the explanatory variables for the ‐th individual, and the matrix containing the values of explanatory variables for the observation periods for the ‐th individual. is a vector of length obtained by stacking the columns of . The model is then written as:

(4.1)

with:

(4.2)

Substituting (4.2) in (4.1), we get:

(4.3)

The parameter matrix , of dimension , contains two types of parameters:

the vector of parameters , which measure the marginal effect of the explanatory variables on the response,
the vector of parameters , measuring the marginal effect of the explanatory variables in each period on the individual effect.

The vector is only marginally interesting per se, but its estimation allows to consistently estimate . If and , the matrix takes the form:

We then have a system of equations containing the same explanatory variables . In this case, the generalized least squares estimator is the SUR estimator. The explanatory variables being the same across equations, this can be obtained simply by estimating each individual equation separately by OLS. If the assumptions of the fixed effects model hold, then the estimation of each column of the matrix will yield just about equal coefficients, with the exception of those situated on the diagonal.

4.2.3.1 Unconstrained Estimator

The coefficients of the row of are then:

More generally, we can write the estimator of in two different ways. The first consists in defining:

We have then:

In order to analyze the properties of this estimator, it is easier to consider the vector of coefficients obtained by stacking the rows of . Defining:

Denoting and substituting, in the last expression, by its expression in the “true” model:

The limiting distribution of is the same as that of:

with because .

As , the central limit theorem implies that:

If the error variance of is not correlated with , this matrix simplifies to:

Finally, if the errors are homoscedastic, we get an even simpler expression:

with .

An estimator of can be obtained considering the sample equivalent. Denoting by the estimation residuals, we get:

4.2.3.2 Constrained Estimator

In a second time, Chamberlain (1982) utilizes the asymptotic least squares estimator in order to obtain an estimator of the structural coefficients of the model, denoted by . These are made of the coefficients associated to the explanatory variables of equation 4.1) and of the coefficients from equation 4.2) concerning the individual effects. There are therefore structural coefficients, while the number of coefficients in the matrix is . The relation between the two coefficient vectors can be expressed as , being a matrix of dimensions .

The restricted model is obtained employing the method of asymptotic least squares, consisting in minimizing a quadratic form in the deviations between and :

The first‐order conditions for a minimum can be written as:

which yields the following estimator:

This estimator is consistent regardless of the weighting matrix employed. Just as with the generalized method of moments, the estimator is efficient if is the covariance matrix of the residuals' vector. If the hypotheses are verified (), the latter can be written as: .

4.2.3.3 Fixed Effects Models

If the hypotheses hold, the minimum of the objective function from the asymptotic least squares method is distributed as a with degrees of freedom equal to the difference in length between the two vectors and , i.e., . This enables the computation of a test of the restrictions on the coefficients implied by the fixed effects model.

Angrist and Newey (1991) have shown that these restrictions can more simply be tested using the results of artificial regressions of the fixed effect model's residuals of a specific period on the covariates for every period. Denoting the coefficient of determination of the artifactual regression of residuals of period , we have:

which follows a with if the underlying hypotheses of the within model are relevant.

Example 4‐2 Hausman test – `RiceFarms` data set

The Hausman test is performed with the phtest function, which can either take as arguments two estimated models (here: the within and the GLS) or use the formula – data interface:

 data("RiceFarms", package = "splm")
Rice <- pdata.frame(RiceFarms, index = "id")
rice.w <- plm(log(goutput) ˜ log(seed) + log(totlabor) + log(size), Rice)
rice.r <- update(rice.w, model = "random")
phtest(rice.w, rice.r)

  Hausman Test

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
chisq = 3.8, df = 3, p-value = 0.3
alternative hypothesis: one model is inconsistent

Under the hypothesis of no correlation between the regressors and the individual effects, the statistic is distributed as a with three degrees of freedom. This hypothesis, with a p‐value of 29%, is not rejected at the 5% confidence level. One could get the same result following the Mundlak (1978) approach, drawing on the difference between the within and between estimators:

 rice.b <- update(rice.w, model = "between")
cp <- intersect(names(coef(rice.b)), names(coef(rice.w)))
dcoef <- coef(rice.w)[cp] - coef(rice.b)[cp]
V <- vcov(rice.w)[cp, cp] + vcov(rice.b)[cp, cp]
as.numeric(t(dcoef) %*% solve(V) %*% dcoef)
[1] 3.773

This result is confirmed by the correlation coefficient between the individual effects (estimated by the fixed effects of the within model) and the individual means of the explanatory variable, which is obtained by applying the between function to the series.²

 cor(fixef(rice.w), between(log(Rice$goutput)))
[1] 0.448

The correlation is positive but moderate.

The Chamberlain test is available in the function piest. It is computed using the usual formula‐data interface.

 data("RiceFarms", package = "splm")
pdim(RiceFarms, index = "id")
Balanced Panel: n = 171, T = 6, N = 1026
piest(log(goutput) ˜ log(seed) + log(totlabor) + log(size),
      RiceFarms, index = "id")

  Chamberlain's pi test

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
chisq = 110, df = 87, p-value = 0.03

The variant of the Chamberlain test proposed by Angrist and Newey (1991) is available with the aneweytest function which uses the same interface.

 aneweytest(log(goutput) ˜ log(seed) + log(totlabor) + log(size),
           RiceFarms, index = "id")

  Angrist and Newey's test of within model

data:  log(goutput) ˜ log(seed) + log(totlabor) + log(size)
chisq = 140, df = 87, p-value = 2e-04

The restrictions implied by the within model are rejected by both tests at the 5% level, although they are not rejected at the 1% level for Chamberlain's version of the test.

4.3 Tests for Serial Correlation

A model with individual effects has composite errors that are serially correlated by definition. The presence of the time‐invariant error component gives rise to serial correlation that does not die out over time; thus standard tests applied on pooled data usually end up rejecting the null of spherical residuals. There may also be serial correlation of the time‐decaying kind in the idiosyncratic error terms, e.g., as an AR(1) process. By “testing for serial correlation” we mean testing for this latter kind of dependence.

For these reasons, the subjects of testing for individual error components and for serially correlated idiosyncratic errors are closely related. In particular, simple (marginal) tests for one direction of departure from the hypothesis of spherical errors usually have power against the other one: in case it is present, they are substantially biased toward rejection. Joint tests are correctly sized and have power against both directions but usually do not give any information about which one actually caused rejection. Conditional tests for serial correlation that take into account the error components are correctly sized under presence of both departures from sphericity and have power only against the alternative of interest. While most powerful if correctly specified, the latter, based on the likelihood framework, are crucially dependent on normality and homoscedasticity of the errors.

In plm a number of joint, marginal, and conditional ML‐based tests are provided, plus some semi‐parametric alternatives that are robust versus heteroscedasticity and free from distributional assumptions.

More tests can be obtained by comparing nested models, in a likelihood ratio test framework, or by restriction on a more general model, in the Wald test framework.

4.3.1 Unobserved Effects Test

The unobserved effects test à la Wooldridge (see Wooldridge, 2002, 10.4.4), is a semi‐parametric test for the null hypothesis that , i.e., that there are no unobserved effects in the residuals. Given that under the null, the covariance matrix of the residuals for each individual is diagonal, the test statistic is based on the average of elements in the upper (or lower) triangle of its estimate, diagonal excluded: (where are the pooled OLS residuals), which must be “statistically close” to zero under the null, scaled by its standard deviation:

This test is (‐) asymptotically distributed as a standard normal regardless of the distribution of the errors. It does also not rely on homoscedasticity.

It has power both against the standard random effects specification, where the unobserved effects are constant within every group, as well as against any kind of serial correlation. As such, it “nests” both individual effects and serial correlation tests, trading some power against more specific alternatives in exchange for robustness.

While not rejecting the null favors the use of pooled OLS, rejection may follow from serial correlation of different kinds, and in particular, quoting Wooldridge (2002, 10.4.4), “should not be interpreted as implying that the random effects error structure must be true”.

Example 4‐3 unobserved effects test – `RiceFarms` data set

Below, the test is applied to the rice farms data:

 data("RiceFarms", package="plm")
Rice <- pdata.frame(RiceFarms, index = "id")
fm <- log(goutput) ˜ log(seed) + log(totlabor) + log(size)
pwtest(fm, Rice)

  Wooldridge's test for unobserved individual effects

data:  formula
z = 2.9, p-value = 0.003
alternative hypothesis: unobserved effect

The null hypothesis of no unobserved effects is rejected.

4.3.2 Score Test of Serial Correlation and/or Individual Effects

The Wooldridge testing procedure will detect very general forms of persistence in the errors but give few directions toward a finer specification. If one is willing to make more specific parametric hypotheses, a maximum likelihood approach will allow to detect the features of persistence in a finer way.

The random effects model can be extended to having idiosyncratic errors that follow an autoregressive process of order 1 (AR(1)):

where the now familiar hypotheses of the random effects model apply, i.e., individual effects are independent of both the regressors and the idiosyncratic errors . Moreover, normality is assumed for both error components.

The specification analysis of the above model requires telling apart the time‐invariant individual effects from the time‐decaying persistence due to the AR(1) component. As observed, the presence of individual effects may affect tests for serial correlation and vice versa. Several alternative strategies can be used, based on the following tools:

a joint test, which has power against both alternatives,
marginal tests, testing the null hypothesis of no serial correlation while maintaining the hypothesis of no individual effects under both the null and the alternative hypothesis,
locally robust tests, i.e., marginal tests with a correction that makes them robust to local deviations from the maintained hypothesis,
conditional test, i.e., testing the null hypothesis of no serial correlation, the hypothesis of the presence of individual effects being maintained under both the null and the alternative hypothesis.

The advantage of the robust tests is that the unconstrained model (RE‐AR(1) for random effect model with first‐order auto‐regressive errors) need not be estimated.

Baltagi and Li (1991) and Baltagi and Li (1995) proposed a joint test of no serial correlation and no individual effects. The test statistic is:

with and , being the OLS residuals.

Baltagi and Li (1995) also proposed a marginal test of serial correlation, the maintained hypothesis being the absence of individual effects:

Symmetrically, the marginal test of individual effects, with the maintained hypothesis of no serial correlation is simply the Breusch‐Pagan test:

They also proposed a conditional test of serial correlation, the maintained hypothesis being the presence of individual effects. This latter test ( in the original paper) is based on the residuals of the random effect model estimated by the maximum likelihood method. Under the null of serially uncorrelated errors, the test turns out to be identical for both the alternative of AR(1) and MA(1) processes.

Bera et al. (2001) derive locally robust tests both for individual random effects and for first‐order serial correlation in residuals as “corrected” versions of the standard LM test and . They write respectively:

and

While still dependent on normality and homoscedasticity, these are robust to local departures from the hypotheses of, respectively, no serial correlation or no individual effects. Although suboptimal, these tests may help detecting the right direction of the departure from the null, thus complementing the use of joint tests. Moreover, being based on pooled OLS residuals, the BSY tests are computationally far less demanding than the conditional test of Baltagi and Li (1995).

On the other hand, the statistical properties of these locally corrected tests are inferior to those of the non‐corrected counterparts when the latter are correctly specified. If there is no serial correlation, then the optimal test for random effects is the likelihood‐based LM test of Breusch and Pagan (with refinements by Honda, see plmtest), while if there are no individual effects, the optimal test for serial correlation is the Breusch‐Godfrey test. If the presence of a random effect is taken for granted, then the optimal test for serial correlation is the likelihood‐based conditional LM test of Baltagi and Li (1995) (see pbltest).

Example 4‐4 LM tests for random effects and/or serial correlation – `RiceFarms` data set

From the LM test in a previous example, the Rice Farming data show evidence of individual effects; and from the Hausman test, the latter seem to comply with the hypothesis that they are uncorrelated with the covariates. We now investigate the presence of serial correlation in the context of the RE‐AR(1) model outlined above.

The joint LM test for random effects and serial correlation under normality and homoscedasticity of the idiosyncratic errors has been derived by Baltagi and Li (1991) and Baltagi and Li (1995) and is implemented as an option in pbsytest, by setting the test to 'J'. In the case of the rice farming model, the test strongly rejects. Rejection of the joint test, though, gives no information on the direction of the departure from the null hypothesis, i.e., is rejection due to the presence of serial correlation, of random effects, or of both?

Bera et al. (2001)'s locally robust tests can provide statistical evidence about the direction of misspecification in the (doubly) restricted model. In fact, the presence of the “other” effect (individual effects when testing for serial correlation and vice versa) will not influence the test statistic as long as the magnitude is moderate. How much of the “other” effect is tolerated before the test statistic becomes biased, however, is an empirical question and will be case‐specific, although the simulations in the original paper can provide a rough assessment.

The locally robust BSY tests for, respectively, serial correlation or individual effects are implemented in the function pbsytest, by setting the test argument to 'AR' (default) or 'RE'. The test for random effects is implemented in the one‐sided version, which takes into account that the variance of the random effect must be non‐negative.

 bsy.LM <- matrix(ncol=3, nrow = 2)
tests <- c("J", "RE", "AR")
dimnames(bsy.LM) <- list(c("LM test", "p-value"), tests)
for(i in tests) {
    mytest <- pbsytest(fm, data = Rice, test = i)
    bsy.LM[1:2, i] <- c(mytest$statistic, mytest$p. value)
    }
round(bsy.LM, 6)
            J     RE    AR
LM test 62.65 0.3351 39.23
p-value  0.00 0.3688  0.00

The robust tests allow us to discriminate between time‐invariant error persistence (random effects) and time‐decaying persistence (autoregressive errors), concluding in favor of the second.

Finally, the optimal conditional test of Baltagi and Li for serial correlation, allowing for random effects of any magnitude, is computed using the pbltest function, using the residuals of the random effects maximum likelihood estimator:

 pbltest(fm, Rice, alternative = "onesided")

Baltagi and Li one-sided LM test

data:  fm
z = 6.1, p-value = 6e-10
alternative hypothesis: AR(1)/MA(1) errors in RE panel model

Serial correlation is detected, i.e., we conclude that in the encompassing model.

4.3.3 Likelihood Ratio Tests for AR(1) and Individual Effects

Likelihood ratio (LR) tests for restrictions are based on the likelihoods from the general and the restricted model. The test statistic is simply twice the difference of the values of the log‐likelihood function:

where is the full vector of parameter estimates from the unrestricted model and from the restricted one, and is the number of restrictions.

A likelihood ratio test for serial correlation in the idiosyncratic residuals can be done, in general, as a nested models test comparing the model with spherical idiosyncratic residuals with the more general alternative featuring AR(1) residuals. If both estimated models allow for random effects, then the test will become conditional on the latter feature.

Thus,

and symmetrically for .

Example 4‐5 Likelihood ratio tests – `Grunfeld` data set

Maximum likelihood estimation of linear models with or without either random individual effects or serially correlated errors can be estimated, e.g., with functionality from the nlme package. In its notation, the RE specification is a model with only one random effects regressor: the intercept. Below we report coefficients of Grunfeld's model estimated by GLS and then by ML.

 library("nlme")
data(Grunfeld, package = "plm")
reGLS <- plm(inv ˜ value + capital, data = Grunfeld, model = "random")
reML <- lme(inv ˜ value + capital, data = Grunfeld, random = ˜1 | firm)
rbind(coef(reGLS), fixef(reML))
     (Intercept)  value capital
[1,]      -57.83 0.1098  0.3081
[2,]      -57.86 0.1098  0.3082

Linear models with groupwise structures of time dependence may be fitted by gls, specifying the correlation structure in the correlation option:

 lmAR1ML <- gls(inv ˜ value + capital, data = Grunfeld,
    correlation = corAR1(0, form = ˜ year | firm))

and analogously the random effects panel with, e.g., AR(1) errors (see Baltagi, 2001, Ch. 5) may be fit by lme specifying an additional random intercept:

 reAR1ML <- lme(inv ˜ value + capital, data = Grunfeld,
    random = ˜ 1 | firm, correlation = corAR1(0, form = ˜ year | firm))
summary(reAR1ML)
Linear mixed-effects model fit by REML
 Data: Grunfeld
   AIC  BIC logLik
  2095 2115  -1041

Random effects:
 Formula: ˜1 | firm
        (Intercept) Residual
StdDev:       78.04     72.8

Correlation Structure: AR(1)
 Formula: ˜year | firm
 Parameter estimate(s):
   Phi
0.8238
Fixed effects: inv ˜ value + capital
             Value Std.Error  DF t-value p-value
(Intercept) -40.28    30.694 188  -1.312  0.1911
value         0.09     0.008 188  11.770  0.0000
capital       0.31     0.032 188   9.737  0.0000
 Correlation:
        (Intr) value
value   -0.239
capital -0.280 -0.125

Standardized Within-Group Residuals:
     Min       Q1      Med       Q3      Max
-2.40759 -0.31847  0.04847  0.19863  3.30040

Number of Observations: 200
Number of Groups: 10

Let us compare either with the restricted alternative. The GLS model without correlation in the residuals is the same as OLS, and one could well use lm for the restricted model. Here we estimate it by gls.

 lmML <- gls(inv ˜ value + capital, data = Grunfeld)
anova(lmML, lmAR1ML)
        Model df  AIC  BIC logLik   Test L.Ratio p-value
lmML        1  4 2400 2413  -1196
lmAR1ML     2  5 2095 2111  -1042 1 vs 2   307.3  <.0001

The AR(1) test on the random effects model is to be done in much the same way, using the random effects model objects estimated above:

 anova(reML, reAR1ML)
        Model df  AIC  BIC logLik   Test L.Ratio p-value
reML        1  5 2206 2222  -1098
reAR1ML     2  6 2095 2114  -1041 1 vs 2     113  <.0001

A likelihood ratio test for random effects compares the specifications with and without random effects and spherical idiosyncratic errors:

 anova(lmML, reML)
     Model df  AIC  BIC logLik   Test L.Ratio p-value
lmML     1  4 2400 2413  -1196
reML     2  5 2206 2222  -1098 1 vs 2   196.4  <.0001

The random effects, AR(1) errors model in turn nests the AR(1) pooling model; therefore, a likelihood ratio test for random effects sub AR(1) errors may be carried out, again, by comparing the two autoregressive specifications:

 anova(lmAR1ML, reAR1ML)
        Model df  AIC  BIC logLik   Test L.Ratio p-value
lmAR1ML     1  5 2095 2111  -1042
reAR1ML     2  6 2095 2114  -1041 1 vs 2   2.134   0.144

whence we see that the Grunfeld model specification doesn't seem to need any random effects once we control for serial correlation in the data.

4.3.4 Applying Traditional Serial Correlation Tests to Panel Data

A general testing procedure for serial correlation in fixed effects (FE), random effects (RE), and pooled‐OLS panel models alike can be based on considerations in (Wooldridge, 2002, 10.7.2). For the random effects model, Wooldridge (2002) observes that under the null of homoscedasticity and no serial correlation in the idiosyncratic errors, the residuals from the quasi‐demeaned regression must be spherical as well. Else, as the individual effects are wiped out in the demeaning, any remaining serial correlation must be due to the idiosyncratic component. Hence, a simple way of testing for serial correlation is to apply a standard serial correlation test to the quasi‐demeaned model. The same applies in a pooled model, w.r.t.the original data.

The FE case is different. It is well known that if the original model's errors are uncorrelated, then FE residuals are negatively serially correlated, with for each (see Wooldridge, 2002, 10.5.4). This correlation clearly dies out as increases, so this kind of AR test is applicable to within model objects only for “sufficiently large”. Baltagi and Li (1995) derive a basically analogous ‐asymptotic test for first‐order serial correlation in a FE panel model as a Breusch‐Godfrey LM test on within residuals (see Baltagi and Li, 1995, par. 2.3 and formula 12). They also observe that the test on within residuals can be used for testing on the RE model, as “the within transformation wipes out the individual effects, whether fixed or random.” Generalizing the Durbin‐Watson test to FE models by applying it to fixed effects residuals is documented in Bhargava et al. (1982). On the converse, in short panels the test gets severely biased toward rejection (or, as the induced correlation is negative, toward acceptance in the case of the one‐sided Durbin‐Watson test with alternative set to 'greater'). See below for a serial correlation test applicable to “short” FE panel models.

Example 4‐6 Breusch‐Godfrey and Durbin Watson tests – `RiceFarms` data set

The functions pbgtest and pdwtest re‐estimate the relevant quasi‐demeaned model by OLS and apply, respectively, standard Breusch‐Godfrey and Durbin‐Watson tests from package lmtest to the residuals:

 rice.re <- plm(fm, Rice, model='random')
pbgtest(rice.re, order = 2)

Breusch-Godfrey/Wooldridge test for serial
correlation in panel models

data:  fm
chisq = 36, df = 2, p-value = 2e-08
alternative hypothesis: serial correlation in idiosyncratic errors
pdwtest(rice.re, order = 2)

Durbin-Watson test for serial correlation in panel
models

data:  fm
DW = 1.7, p-value = 5e-07
alternative hypothesis: serial correlation in idiosyncratic errors

The tests share the features of their OLS counterparts, in particular the pbgtest allows testing for higher‐order serial correlation, which can be of particular interest for quarterly data. As the functions are simple wrappers toward bgtest and dwtest, all arguments from the latter two apply and may be passed on through the ‘…’ operator.

As observed above, applying the pbgtest and pdwtest functions to an FE model is appropriate only if the time dimension is long enough. In the frequent case of ”short” panels, one of the two testing procedures due to Wooldridge (2002) and described in the next section should be used instead.

4.3.5 Wald Tests for Serial Correlation using within and First‐differenced Estimators

4.3.5.1 Wooldridge's within‐based Test

Due to the demeaning procedure, under the null of no serial correlation in the errors, the residuals of an FE model must be negatively serially correlated, with for each . Wooldridge suggests basing a test for this null hypothesis on a pooled regression of FE residuals on their first lag:

Rejecting the restriction makes us conclude against the original null of no serial correlation.

The function carrying out this procedure estimates the FE model, retrieves residuals, then estimates an auxiliary (pooled) AR(1) model, and tests the above‐mentioned restriction on . Internally, a heteroscedasticity‐ and autocorrelation‐consistent covariance matrix (vcovHC, see next chapter) is used, as originally prescribed. The test is applicable to any FE panel model and in particular to “short” panels with small and large .

Example 4‐7 serial correlation tests for fixed effects models – `EmplUK` data set

In the following example, Wooldridge's within‐based serial correlation test is applied to the EmplUK data:

 data("EmplUK", package = "plm")
pwartest(log(emp) ˜ log(wage) + log(capital), data = EmplUK)

Wooldridge's test for serial correlation in FE
panels

data:  plm.model
F = 310, df1 = 1, df2 = 890, p-value <2e-16
alternative hypothesis: serial correlation

We strongly reject the null of no serial correlation. If the evidence of persistence is too strong, one should wonder whether the residuals are stationary at all, and whether a specification in differences might be preferable. In the next section we will see a similar test that can be seen as a specification device in this sense.

4.3.5.2 Wooldridge's First‐difference‐based Test

In the context of the first difference model, Wooldridge (2002, 10.6.3) proposes a serial correlation test that can also be seen as a specification test to choose the most efficient estimator between fixed effects (within) and first difference (FD).

The starting point is the observation that if the idiosyncratic errors of the original model are uncorrelated, the errors of the (first) differenced model will be correlated, with , while any time‐invariant effect is wiped out in the differencing. So a serial correlation test for models with individual effects of any kind can be based on estimating the auxiliary model

and testing the restriction , corresponding to the null of no serial correlation in the original model. Drukker (2003) provides Monte Carlo evidence of the good empirical properties of the test.

On the other extreme (see Wooldridge, 2002, 10.6.1), if the differenced errors are uncorrelated, then is a random walk. In this latter case, the most efficient estimator is the first difference (FD) one; in the former case, it is the fixed effects one (within).

Example 4‐8 Wooldridge's first difference test – `EmplUK` data set

We apply the test in the context of the EmplUK data, a large and short panel with strong individual heterogeneity. We want to test for serial correlation in both within and first‐differenced errors. The function pwfdtest allows testing either hypothesis: the default behavior h0='fd' is to test for serial correlation in first‐differenced errors:

   pwfdtest(log(emp) ˜ log(wage) + log(capital), data = EmplUK)

Wooldridge's first-difference test for serial
correlation in panels

data:  plm.model
F = 0.93, df1 = 1, df2 = 750, p-value = 0.3
alternative hypothesis: serial correlation in differenced errors

while specifying h0= 'fe' the null hypothesis becomes no serial correlation in original errors, which is similar to the pwartest.

 pwfdtest(log(emp) ˜ log(wage) + log(capital), data = EmplUK,
    h0 = "fe")

Wooldridge's first-difference test for serial
correlation in panels

data:  plm.model
F = 130, df1 = 1, df2 = 750, p-value <2e-16
alternative hypothesis: serial correlation in original errors

Not rejecting one of the two is evidence in favor of using the estimator corresponding to h0. In this case, the original residuals show evidence of serial correlation, which disappears after first differencing. The results point at a unit root in the errors.

Example 4‐9 Wooldridge's first difference test – `RiceFarms` data set

Results will not always be so clear‐cut. For the RiceFarms example,

 W.fd <- matrix(ncol = 2, nrow =2)
H0 <- c("fd", "fe")
dimnames(W.fd) <- list(c("test", "p-value"), H0)
for(i in H0) {
    mytest <- pwfdtest(fm, Rice, h0 = i)
    W.fd[1, i] <- mytest$statistic
    W.fd[2, i] <- mytest$p. value
    }
round(W.fd, 6)
           fd        fe
test    176.4 19.492371
p-value   0.0  0.000012

The truth clearly lies in the middle (both rejected, although one more strongly than the other); in this case, whichever estimator is chosen will have serially correlated errors: therefore it will be advisable to use an autocorrelation‐robust covariance matrix.

4.4 Tests for Cross‐sectional Dependence

Next to the more familiar issue of serial correlation, a growing body of literature has been dealing with cross‐sectional dependence in panels, which can arise, e.g., if individuals respond to common shocks (as in common factor models) or if spatial diffusion processes are present, relating individuals in a way depending on a measure of distance (as in spatial models).

If cross‐sectional dependence is present, the consequence is, at a minimum, inefficiency of the usual estimators and invalid inference when using the standard covariance matrix. This is the case, for example, in unobserved effects models when cross‐sectional dependence is due to an unobservable factor structure but with factors uncorrelated with the regressors. In this case the within or RE are still consistent, although inefficient (see De Hoyos and Sarafidis, 2006). If the unobserved factors are correlated with the regressors, which can seldom be ruled out, consequences are more serious: the estimators will be inconsistent.

4.4.1 Pairwise Correlation Coefficients

Correlation in the cross‐section can take very diverse shapes. The most common testing procedures are based on considering the population of all possible pairwise correlations between pairs of distinct individual units, estimating each one independently, by exploiting the time dimension of the data, and then calculating some synthetic measure or test statistic. The basic tool for assessing pairwise correlation between individual units and for a double‐indexed vector is the product‐moment correlation coefficient, defined as

A descriptive assessment of the degree of cross‐sectional correlation in the given sample can then be based on the average of individual correlation coefficients: . If individual correlations are some positive and some negative, this solution has the problem that coefficients with different signs compensate, yielding a statistic that underestimates the true level of dependence in the data. Therefore, another common procedure is to average the absolute values of individual coefficients: .

4.4.2 CD‐type Tests for Cross‐sectional Dependence

A number of statistics for testing the null hypothesis of no cross‐sectional dependence in model errors can be based on . The function pcdtest implements both the calculation of and , and a family of cross‐sectional dependence tests that can be applied in different settings, ranging from those where grows large with fixed to “short” panels with a big dimension and a few time periods. All are based on (transformations of) the product‐moment correlation coefficient of a model's residuals, defined as above. The Breusch‐Pagan LM test, based on the squares of , is valid for with fixed:

where in the case of an unbalanced panel only pairwise complete observations are considered, and with being the number of observations for individual ; else, if the panel is balanced, for each . The test is distributed as . It is inappropriate whenever the dimension is “large.” A scaled version, applicable also if and then (as in some pooled time series contexts), is defined as:

and distributed as a standard normal.

Pesaran's (2004) CD test

based on without squaring (also distributed as a standard Normal) is appropriate both for ‐ and ‐asymptotics. It has good properties in samples of any practically relevant size and is robust to a variety of settings. The only big drawback is that the test loses power against the alternative of cross‐sectional dependence if the average correlation is zero, even if individual coefficients are non‐zero. Such a situation is not uncommon and can arise for example in the presence of an unobserved factor structure with factor loadings averaging zero, that is, where some units react positively to common shocks, others negatively. Another case where the test will lose power is if the data are cross‐sectionally demeaned, or when the model contains time‐specific dummies (see Sarafidis and Wansbeek, 2012, p. 27). In these instances, the absolute correlation coefficient is likely to turn out much bigger than .

Example 4‐10 tests for cross‐sectional dependence – `RDSpillovers` data set

Eberhardt et al. (2013) consider the returns of own research and development (R&D) in the production function of European firms. They account for common factors and spillover effects; they find evidence that when controlling for such features, the effect of own R&D is not significant any more and conclude that the value of R&D derives from a complex mix of own sources and spillovers from other firms. They estimate various specifications of a standard production function (where output is a function of labor and capital) augmented with R&D expenditure.

 data("RDSpillovers", package = "pder")
fm.rds <- lny ˜ lnl + lnk + lnrd

Pairwise‐correlations‐based tests are originally meant to use the residuals of separate estimation of one time‐series regression for each cross‐sectional unit. In the original example of Eberhardt et al. (2013), this is done in the first of the heterogeneous static specifications of Table 7, the mean groups model. This is also the default behavior of pcdtest.³ The default version of the test is 'cd', which is appropriate in a large panel setting like this.

 pcdtest(fm.rds, RDSpillovers)

Pesaran CD test for cross-sectional dependence in
panels

data:  lny ˜ lnl + lnk + lnrd
z = 29, p-value <2e-16
alternative hypothesis: cross-sectional dependence

The residuals from separate time series regressions show strong evidence of error cross‐sectional dependence.

If a different model specification (within, RE, ...) is assumed consistent, one can resort to its residuals for testing⁴ by specifying the relevant model type. The main argument of this function may be either plm or a formula and a data.frame; in the second case, unless model is set to NULL, all usual parameters relative to the estimation of a plm model may be passed on. The test is compatible with any consistent plm model for the data at hand, with any specification of effect; e.g., specifying effect = 'time' or effect = 'twoways' allows to test for residual cross‐sectional dependence after the introduction of time fixed effects to account for common shocks. Let us consider the static two‐way fixed effects specification in Eberhardt et al. (2013, Table 5):

 rds.2fe <- plm(fm.rds, RDSpillovers, model = "within", effect = "twoways")
pcdtest(rds.2fe)

Pesaran CD test for cross-sectional dependence in
panels

data:  lny ˜ lnl + lnk + lnrd
z = -1.5, p-value = 0.1
alternative hypothesis: cross-sectional dependence

As observed, the test loses its power if time fixed effects are included in the model specification. Now the test does not reject the hypothesis of no cross‐sectional correlation. One can get an idea of what is happening by comparing and :

 cbind("rho"  = pcdtest(rds.2fe, test = "rho")$statistic,
      "|rho|"= pcdtest(rds.2fe, test = "absrho")$statistic)
          rho  |rho|
rho -0.004879 0.5021

whence it can be seen how substantial cross‐sectional dependence is present, but the addition of time effects has centered the mean of correlation coefficients on zero so that positive and negative s compensate.

4.4.3 Testing Cross‐sectional Dependence in a pseries

Next to testing for cross‐sectional correlation in model residuals, tests in the CD family can be employed in preliminary statistical assessments as well, in order to determine whether the dependent and explanatory variables show any correlation to begin with. To this end, the pcdtest function has a pseries method, meaning that it can be fed a pseries object as well. One can either calculate the descriptive statistics and or resort to a formal test.

Example 4‐11 cross‐sectional dependence test for a pseries – `HousePricesUS` data set

Holly et al. (2010) analyze changes in real house prices in 49 US states between 1975 and 2003 to assess to which extent they are driven by fundamentals like disposable per capita income, net borrowing costs, and population growth (see also the full replication in Millo, 2015). The empirical analysis proceeds from the initial assessment of spatial dependence of the variables; here we reproduce the cross‐sectional correlation assessment for the dependent variable, the house price index (1980=100), taken in first differences of logs.

 data("HousePricesUS", package = "pder")
php <- pdata.frame(HousePricesUS)

 cbind("rho"   = pcdtest(diff(log(php$price)), test = "rho")$statistic,
      "|rho|" = pcdtest(diff(log(php$price)), test = "absrho")$statistic)
       rho  |rho|
rho 0.3942 0.4247

The overall averages of and are quite large in magnitude and very close to each other, indicating substantial positive correlation.

To investigate whether this behavior is geographically uniform or not, one can drill down to the regional level. within and between regions correlation tables can be constructed by means of the pcdtest function, setting test = 'rho' for the average correlation coefficient. A function cortab is provided that automates this procedure, constructing suitable matrices from the provided grouping index and calculating (default) or (if test = 'absrho') for each region (main diagonal) and each pair of regions (off‐diagonal).

 regions.names <- c("New Engl", "Mideast", "Southeast", "Great Lks",
                   "Plains", "Southwest", "Rocky Mnt", "Far West")
corr.table.hp <- cortab(diff(log(php$price)), grouping = php$region,
                        groupnames = regions.names)
colnames(corr.table.hp) <- substr(rownames(corr.table.hp), 1, 5)
round(corr.table.hp, 2)
          New E Midea South Great Plain South Rocky Far W
New Engl   0.80    NA    NA    NA    NA    NA    NA    NA
Mideast    0.68  0.66    NA    NA    NA    NA    NA    NA
Southeast  0.40  0.35  0.81    NA    NA    NA    NA    NA
Great Lks  0.27  0.20  0.62  0.61    NA    NA    NA    NA
Plains     0.40  0.32  0.57  0.53  0.52    NA    NA    NA
Southwest  0.07 -0.05  0.28  0.39  0.35  0.52    NA    NA
Rocky Mnt -0.03 -0.11  0.52  0.53  0.40  0.57  0.70    NA
Far West   0.13  0.17  0.52  0.42  0.29  0.31  0.46  0.57

The preliminary spatial dependence analysis highlights the correlation between neighboring regions and also some cases of correlation with distant ones, as is the case for California and some more developed states on the East Coast. According to the authors, this is evidence of factor‐related dependence: common shocks to technology stimulate growth in the most advanced states irrespective of geographic proximity.

The significance of cross‐sectional correlation in a pseries can also be assessed through a formal test, exactly as done above for model residuals. Given that, beside stationarity (which in our example was ensured by first differencing the data), the properties of the CD test rest on the hypothesis of no serial correlation, we follow Pesaran (2004)'s suggestion to remove any by specifying a univariate AR(2) model of the variable of interest and proceed testing the residuals of the latter for cross‐sectional dependence. This is made easy by the lagging functionalities of plm. In the following, we test cross‐sectional correlation in log house prices drawing on the residuals of an AR(2) model in order to control for any persistence in the data:

 pcdtest(diff(log(price)) ˜ diff(lag(log(price))) + diff(lag(log(price), 2)),
        data = php)

Pesaran CD test for cross-sectional dependence in
panels

data:  diff(log(price)) ˜ diff(lag(log(price))) + diff(lag(log(price),     2))
z = 59, p-value <2e-16
alternative hypothesis: cross-sectional dependence

The test strongly rejects the null hypothesis, confirming substantial cross‐sectional comovement in house prices.

Notes

2 Note that we use between and not Between, the latter returning a vector of the same length as the series with the individual means repeated

times.

3 If the time dimension is insufficient and model=NULL, the function defaults to estimation of a within model and issues a warning.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.