Chapter 4
Tests on Error Component Models

The double dimensionality of panel data allows for much richer specifications than simple cross sections or time series. This is both a blessing and a curse, given how much more complicated the specification may become. In fact, all possible features from either cross sections or time series, like distance‐decaying correlation in – respectively – space or time, can coexist with individual (time), time‐(individual‐) invariant heterogeneity. Moreover, diagnostic tests will usually have a hard time distinguishing between different forms of persistence along the same dimension unless explicitly designed to take the “other” effect into account.

The specification problem of panel models is typically associated with the presence or absence of individual effects, i.e., with the need to account for unobserved heterogeneity. Given that in the vast majority of cases it will be inappropriate to rule out individual heterogeneity altogether, the related issue emerges of whether it is safe to assume that the latter is uncorrelated with the explanatory variables (and therefore to proceed in a random effects framework) or rather to proceed estimating out (transforming out) the individual effects in a fixed effects fashion. Hence, tests for individual effects under either of the two approaches and Hausman‐type tests for determining which one is appropriate are among the most popular diagnostic procedures in this field.

Next to the fundamental specification issues with individual effects , the remainder errors can in turn be correlated: either in time, in which case it will be crucial to distinguish time‐decaying persistence of idiosyncratic shocks from the time‐invariant persistence deriving from the presence of an individual effect; or in space, and then the issue becomes whether correlation simply descends from participating in the same cross section or, provided the data are referenced in some space (e.g., in geography), whether nearby observations are more correlated than distant ones.

For these reasons, a rich toolbox of diagnostic and specification testing procedures has been developed, which will be the subject of this chapter, presented roughly in the order given above up to the issue of cross‐sectional correlation. On the converse, spatial correlation proper will be the subject of a separate chapter.

4.1 Tests on Individual and/or Time Effects

In order to test whether either individual or time effects are present, two approaches are possible:

  • the first is to start from estimating said effects out (within model) and then perform a zero restriction test,
  • the second is to start from the OLS model and to infer about the presence of the effects drawing on the OLS residuals.

4.1.1 F Tests

The sum of squared residuals and the degrees of freedom for the within model are: images and images. Let the null hypothesis be the absence of individual effects so that the restricted model is pooled OLS where the sum of squared residuals and the degrees of freedom are, respectively, images and images. Under images, the test statistic:

equation

follows a Fisher‐Snedecor F with images and images degrees of freedom.

The test of the null hypothesis of no individual and time effects is obtained by using the two‐ways within model and the pooling model:

equation

Finally, the test of the null hypothesis of, say, no time effects, but in the presence of individual effects is:

equation

4.1.2 Breusch‐Pagan Tests

The Breusch and Pagan (1980) test is a Lagrange multipliers test based on the OLS residuals. It is based on the score vector images, i.e., the vector of partial derivatives of the log‐likelihood function from the restricted model. The variance of the score vector is the information matrix:

equation

We estimate a restricted model characterized by a parameter vector images; under images, we have:

equation

or, denoting by images and images the score and its estimated variance in the restricted model:

equation

which is distributed as a images where the degrees of freedom are equal to the number of restrictions. We'll first derive the test for the one‐way individual error component model, for which the log‐likelihood function is:

equation

The gradient is then:

equation

To derive the variance, we start by calculating the matrix of second derivatives images:

equation

To compute the expectation of this matrix, we note that images and images:

equation

To compute the test statistic, we impose the null hypothesis: images (absence of individual effects). In this case, the estimator for the parameters is OLS and that of images is images. The score and its estimated variance are then:

equation

whose inverse is:

equation

Finally, the test statistic is:

equation

Or, replacing images by images:

equation

which is asymptotically distributed as a images with 1 degree of freedom.

The test of the time effect is likewise computed:

equation

The Breusch‐Pagan test extends easily to the two‐ways error component model, as the statistic can be written as the sum of the two previous statistics:

equation

and follows a images with two degrees of freedom under the null hypothesis of no individual and time effects.

For unbalanced panels, the relevant statistics are:1

equation

These statistics present two problems. The first one is that the alternative hypothesis is that the effects' variance is non‐zero, i.e., strictly positive or negative; when a variance must be non‐negative. For the one‐way error component model, Honda (1985) and King and Wu (1997) proposed a one‐sided test based on the square root of the above statistic, which is then normally distributed. The Honda statistic is then images and its 5% critical value is 1.64 (and likewise for the test of no time effects). For the two‐ways error components model, Honda (1985) proposed to use images as Baltagi et al. (1992) and King and Wu (1997) use:

equation

The second problem is due to the fact that images or images may be lower than 1. In this case, Baltagi et al. (1992), following Gourieroux et al. (1982), proposed to replace the statistic by 0. The modified statistic is then defined by:

equation

which follows a mixed images distribution: images

4.2 Tests for Correlated Effects

We have seen that if the model errors are not correlated with the explanatory variables, then both estimators, fixed as well as random effects, are consistent. To compare them, we keep assuming that the idiosyncratic component of the error term (images) is uncorrelated to the regressors. Two situations are then possible:

  • images: the individual effects are not correlated with the explanatory variables; in this case, both estimators are consistent, but the random effects estimator is more efficient than the fixed effects.
  • images: the individual effects are correlated with the explanatory variables; in this case, the fixed effects estimator, which estimates out the individual effects, is consistent. On the contrary, the random effects estimator is inconsistent because one component of the composite error, the individual effect, is correlated with the explanatory variables.

4.2.1 The Mundlak Approach

In order to clarify the relationship between the two estimators, Mundlak (1978) considered the following model:

equation

with

equation

The individual effects are therefore correlated with the explanatory variables, being they equal to the sum of a linear combination of the individual means of said variables and of an error term images. The model to be estimated is then written, in matrix form, as:

equation

The error term images has the usual properties of the error components model, i.e., zero mean and a variance equal to:

equation

The GLS model is estimated by applying OLS on the data transformed pre‐multiplying each variable by images, with images.

We then have images, images and images. The GLS estimator is then written as:

equation

Using the formula of the inverse of a partitioned matrix (see equation 2.19), we get:

equation
equation

and:

equation

The fundamental result of Mundlak (1978) is therefore that, if one correctly accounts for the correlation between the error terms and the explanatory variables, the GLS estimator is the within estimator.

4.2.2 Hausman Test

This results also suggests a way to test for the presence of correlation; in fact, testing for no correlation corresponds to testing for: images. Under images, we have:

equation

which is distributed as a images with images degrees of freedom. Well, we have images and images.

This test statistic is one version of the test proposed by Hausman (1978). The general principle consists in comparing two models images and images where:

  • under images: images and images are both consistent, but images is more efficient than images,
  • under images: only images is consistent.

The idea of the test is that if images is true, then the estimated coefficients from the two models shall not diverge; under the alternative, they will. The test is therefore based on images and Hausman showed that, under images, the variance of this difference is simply: images.

The most common version of this test is based on comparing the within and the GLS estimators. The difference between the two is: images. Under the hypothesis of no correlation between errors and explanatory variables, we have images. The variance of images is:

equation

To determine these variances and covariances, we write the two estimators as functions of the errors: images and images. We then have: images, images and images. The variance of images is then simply:

equation

and the test statistic becomes:

equation

which, under images, is distributed as a images with images degrees of freedom.

4.2.3 Chamberlain's Approach

Chamberlain (1982) proposed a more general model than that of Mundlak (1978). In his model, the individual effects are not assumed to be a linear function of the means of the explanatory variables anymore, but of their values over the whole time period.

Denote images the vector of length images containing the values of the explanatory variables for the images‐th individual, and images the images matrix containing the values of images explanatory variables for the images observation periods for the images‐th individual. images is a vector of length images obtained by stacking the columns of images. The model is then written as:

with:

Substituting (4.2) in (4.1), we get:

(4.3)equation

The parameter matrix images, of dimension images, contains two types of parameters:

  • the vector of parameters images, which measure the marginal effect of the explanatory variables on the response,
  • the vector of parameters images, measuring the marginal effect of the explanatory variables in each period on the individual effect.

The images vector is only marginally interesting per se, but its estimation allows to consistently estimate images. If images and images, the images matrix takes the form:

equation

We then have a system of images equations containing the same explanatory variables images. In this case, the generalized least squares estimator is the SUR estimator. The explanatory variables being the same across equations, this can be obtained simply by estimating each individual equation separately by OLS. If the assumptions of the fixed effects model hold, then the estimation of each column of the images matrix will yield just about equal coefficients, with the exception of those situated on the diagonal.

4.2.3.1 Unconstrained Estimator

The coefficients of the images row of images are then:

equation

More generally, we can write the estimator of images in two different ways. The first consists in defining:

equation

We have then:

equation

In order to analyze the properties of this estimator, it is easier to consider the vector of coefficients images obtained by stacking the rows of images. Defining:

equation
equation

Denoting images and substituting, in the last expression, images by its expression in the “true” model:

equation

The limiting distribution of images is the same as that of:

equation

with images because images.

As images, the central limit theorem implies that:

equation
equation

If the error variance of images is not correlated with images, this matrix simplifies to:

equation

Finally, if the errors are homoscedastic, we get an even simpler expression:

equation

with images.

An estimator of images can be obtained considering the sample equivalent. Denoting by images the estimation residuals, we get:

equation

4.2.3.2 Constrained Estimator

In a second time, Chamberlain (1982) utilizes the asymptotic least squares estimator in order to obtain an estimator of the structural coefficients of the model, denoted by images. These are made of the images coefficients associated to the explanatory variables of equation 4.1) and of the images coefficients from equation 4.2) concerning the individual effects. There are therefore images structural coefficients, while the number of coefficients in the matrix images is images. The relation between the two coefficient vectors can be expressed as images, images being a matrix of dimensions images.

The restricted model is obtained employing the method of asymptotic least squares, consisting in minimizing a quadratic form in the deviations between images and images:

equation

The first‐order conditions for a minimum can be written as:

equation

which yields the following estimator:

equation

This estimator is consistent regardless of the weighting matrix employed. Just as with the generalized method of moments, the estimator is efficient if images is the covariance matrix of the residuals' vector. If the hypotheses are verified (images), the latter can be written as: images.

4.2.3.3 Fixed Effects Models

If the hypotheses images hold, the minimum of the objective function from the asymptotic least squares method is distributed as a images with degrees of freedom equal to the difference in length between the two vectors images and images, i.e., images. This enables the computation of a test of the restrictions on the coefficients implied by the fixed effects model.

Angrist and Newey (1991) have shown that these restrictions can more simply be tested using the results of images artificial regressions of the fixed effect model's residuals of a specific period on the covariates for every period. Denoting images the coefficient of determination of the artifactual regression of residuals of period images, we have:

equation

which follows a images with images if the underlying hypotheses of the within model are relevant.

4.3 Tests for Serial Correlation

A model with individual effects has composite errors that are serially correlated by definition. The presence of the time‐invariant error component gives rise to serial correlation that does not die out over time; thus standard tests applied on pooled data usually end up rejecting the null of spherical residuals. There may also be serial correlation of the time‐decaying kind in the idiosyncratic error terms, e.g., as an AR(1) process. By “testing for serial correlation” we mean testing for this latter kind of dependence.

For these reasons, the subjects of testing for individual error components and for serially correlated idiosyncratic errors are closely related. In particular, simple (marginal) tests for one direction of departure from the hypothesis of spherical errors usually have power against the other one: in case it is present, they are substantially biased toward rejection. Joint tests are correctly sized and have power against both directions but usually do not give any information about which one actually caused rejection. Conditional tests for serial correlation that take into account the error components are correctly sized under presence of both departures from sphericity and have power only against the alternative of interest. While most powerful if correctly specified, the latter, based on the likelihood framework, are crucially dependent on normality and homoscedasticity of the errors.

In plm a number of joint, marginal, and conditional ML‐based tests are provided, plus some semi‐parametric alternatives that are robust versus heteroscedasticity and free from distributional assumptions.

More tests can be obtained by comparing nested models, in a likelihood ratio test framework, or by restriction on a more general model, in the Wald test framework.

4.3.1 Unobserved Effects Test

The unobserved effects test à la Wooldridge (see Wooldridge, 2002, 10.4.4), is a semi‐parametric test for the null hypothesis that images, i.e., that there are no unobserved effects in the residuals. Given that under the null, the covariance matrix of the residuals for each individual is diagonal, the test statistic is based on the average of elements in the upper (or lower) triangle of its estimate, diagonal excluded: images (where images are the pooled OLS residuals), which must be “statistically close” to zero under the null, scaled by its standard deviation:

equation

This test is (images‐) asymptotically distributed as a standard normal regardless of the distribution of the errors. It does also not rely on homoscedasticity.

It has power both against the standard random effects specification, where the unobserved effects are constant within every group, as well as against any kind of serial correlation. As such, it “nests” both individual effects and serial correlation tests, trading some power against more specific alternatives in exchange for robustness.

While not rejecting the null favors the use of pooled OLS, rejection may follow from serial correlation of different kinds, and in particular, quoting Wooldridge (2002, 10.4.4), “should not be interpreted as implying that the random effects error structure must be true”.

4.3.2 Score Test of Serial Correlation and/or Individual Effects

The Wooldridge testing procedure will detect very general forms of persistence in the errors but give few directions toward a finer specification. If one is willing to make more specific parametric hypotheses, a maximum likelihood approach will allow to detect the features of persistence in a finer way.

The random effects model can be extended to having idiosyncratic errors that follow an autoregressive process of order 1 (AR(1)):

equation

where the now familiar hypotheses of the random effects model apply, i.e., individual effects images are independent of both the regressors and the idiosyncratic errors images. Moreover, normality is assumed for both error components.

The specification analysis of the above model requires telling apart the time‐invariant individual effects from the time‐decaying persistence due to the AR(1) component. As observed, the presence of individual effects may affect tests for serial correlation and vice versa. Several alternative strategies can be used, based on the following tools:

  • a joint test, which has power against both alternatives,
  • marginal tests, testing the null hypothesis of no serial correlation while maintaining the hypothesis of no individual effects under both the null and the alternative hypothesis,
  • locally robust tests, i.e., marginal tests with a correction that makes them robust to local deviations from the maintained hypothesis,
  • conditional test, i.e., testing the null hypothesis of no serial correlation, the hypothesis of the presence of individual effects being maintained under both the null and the alternative hypothesis.

The advantage of the robust tests is that the unconstrained model (REAR(1) for random effect model with first‐order auto‐regressive errors) need not be estimated.

Baltagi and Li (1991) and Baltagi and Li (1995) proposed a joint test of no serial correlation and no individual effects. The test statistic is:

equation

with images and images, images being the OLS residuals.

Baltagi and Li (1995) also proposed a marginal test of serial correlation, the maintained hypothesis being the absence of individual effects:

equation

Symmetrically, the marginal test of individual effects, with the maintained hypothesis of no serial correlation is simply the Breusch‐Pagan test:

equation

They also proposed a conditional test of serial correlation, the maintained hypothesis being the presence of individual effects. This latter test (images in the original paper) is based on the residuals of the random effect model estimated by the maximum likelihood method. Under the null of serially uncorrelated errors, the test turns out to be identical for both the alternative of AR(1) and MA(1) processes.

Bera et al. (2001) derive locally robust tests both for individual random effects and for first‐order serial correlation in residuals as “corrected” versions of the standard LM test images and images. They write respectively:

equation

and

equation

While still dependent on normality and homoscedasticity, these are robust to local departures from the hypotheses of, respectively, no serial correlation or no individual effects. Although suboptimal, these tests may help detecting the right direction of the departure from the null, thus complementing the use of joint tests. Moreover, being based on pooled OLS residuals, the BSY tests are computationally far less demanding than the conditional test of Baltagi and Li (1995).

On the other hand, the statistical properties of these locally corrected tests are inferior to those of the non‐corrected counterparts when the latter are correctly specified. If there is no serial correlation, then the optimal test for random effects is the likelihood‐based LM test of Breusch and Pagan (with refinements by Honda, see plmtest), while if there are no individual effects, the optimal test for serial correlation is the Breusch‐Godfrey test. If the presence of a random effect is taken for granted, then the optimal test for serial correlation is the likelihood‐based conditional LM test of Baltagi and Li (1995) (see pbltest).

4.3.3 Likelihood Ratio Tests for AR(1) and Individual Effects

Likelihood ratio (LR) tests for restrictions are based on the likelihoods from the general and the restricted model. The test statistic is simply twice the difference of the values of the log‐likelihood function:

equation

where images is the full vector of parameter estimates from the unrestricted model and images from the restricted one, and images is the number of restrictions.

A likelihood ratio test for serial correlation in the idiosyncratic residuals can be done, in general, as a nested models test comparing the model with spherical idiosyncratic residuals with the more general alternative featuring AR(1) residuals. If both estimated models allow for random effects, then the test will become conditional on the latter feature.

Thus,

equation

and symmetrically for images.

4.3.4 Applying Traditional Serial Correlation Tests to Panel Data

A general testing procedure for serial correlation in fixed effects (FE), random effects (RE), and pooled‐OLS panel models alike can be based on considerations in (Wooldridge, 2002, 10.7.2). For the random effects model, Wooldridge (2002) observes that under the null of homoscedasticity and no serial correlation in the idiosyncratic errors, the residuals from the quasi‐demeaned regression must be spherical as well. Else, as the individual effects are wiped out in the demeaning, any remaining serial correlation must be due to the idiosyncratic component. Hence, a simple way of testing for serial correlation is to apply a standard serial correlation test to the quasi‐demeaned model. The same applies in a pooled model, w.r.t.the original data.

The FE case is different. It is well known that if the original model's errors are uncorrelated, then FE residuals are negatively serially correlated, with images for each images (see Wooldridge, 2002, 10.5.4). This correlation clearly dies out as images increases, so this kind of AR test is applicable to within model objects only for images “sufficiently large”. Baltagi and Li (1995) derive a basically analogous images‐asymptotic test for first‐order serial correlation in a FE panel model as a Breusch‐Godfrey LM test on within residuals (see Baltagi and Li, 1995, par. 2.3 and formula 12). They also observe that the test on within residuals can be used for testing on the RE model, as “the within transformation wipes out the individual effects, whether fixed or random.” Generalizing the Durbin‐Watson test to FE models by applying it to fixed effects residuals is documented in Bhargava et al. (1982). On the converse, in short panels the test gets severely biased toward rejection (or, as the induced correlation is negative, toward acceptance in the case of the one‐sided Durbin‐Watson test with alternative set to 'greater'). See below for a serial correlation test applicable to “short” FE panel models.

As observed above, applying the pbgtest and pdwtest functions to an FE model is appropriate only if the time dimension is long enough. In the frequent case of ”short” panels, one of the two testing procedures due to Wooldridge (2002) and described in the next section should be used instead.

4.3.5 Wald Tests for Serial Correlation using within and First‐differenced Estimators

4.3.5.1 Wooldridge's within‐based Test

Due to the demeaning procedure, under the null of no serial correlation in the errors, the residuals of an FE model must be negatively serially correlated, with images for each images. Wooldridge suggests basing a test for this null hypothesis on a pooled regression of FE residuals on their first lag:

equation

Rejecting the restriction images makes us conclude against the original null of no serial correlation.

The function carrying out this procedure estimates the FE model, retrieves residuals, then estimates an auxiliary (pooled) AR(1) model, and tests the above‐mentioned restriction on images. Internally, a heteroscedasticity‐ and autocorrelation‐consistent covariance matrix (vcovHC, see next chapter) is used, as originally prescribed. The test is applicable to any FE panel model and in particular to “short” panels with small images and large images.

4.3.5.2 Wooldridge's First‐difference‐based Test

In the context of the first difference model, Wooldridge (2002, 10.6.3) proposes a serial correlation test that can also be seen as a specification test to choose the most efficient estimator between fixed effects (within) and first difference (FD).

The starting point is the observation that if the idiosyncratic errors of the original model images are uncorrelated, the errors of the (first) differenced model images will be correlated, with images, while any time‐invariant effect is wiped out in the differencing. So a serial correlation test for models with individual effects of any kind can be based on estimating the auxiliary model

equation

and testing the restriction images, corresponding to the null of no serial correlation in the original model. Drukker (2003) provides Monte Carlo evidence of the good empirical properties of the test.

On the other extreme (see Wooldridge, 2002, 10.6.1), if the differenced errors are uncorrelated, then images is a random walk. In this latter case, the most efficient estimator is the first difference (FD) one; in the former case, it is the fixed effects one (within).

4.4 Tests for Cross‐sectional Dependence

Next to the more familiar issue of serial correlation, a growing body of literature has been dealing with cross‐sectional dependence in panels, which can arise, e.g., if individuals respond to common shocks (as in common factor models) or if spatial diffusion processes are present, relating individuals in a way depending on a measure of distance (as in spatial models).

If cross‐sectional dependence is present, the consequence is, at a minimum, inefficiency of the usual estimators and invalid inference when using the standard covariance matrix. This is the case, for example, in unobserved effects models when cross‐sectional dependence is due to an unobservable factor structure but with factors uncorrelated with the regressors. In this case the within or RE are still consistent, although inefficient (see De Hoyos and Sarafidis, 2006). If the unobserved factors are correlated with the regressors, which can seldom be ruled out, consequences are more serious: the estimators will be inconsistent.

4.4.1 Pairwise Correlation Coefficients

Correlation in the cross‐section can take very diverse shapes. The most common testing procedures are based on considering the population of all possible pairwise correlations between pairs of distinct individual units, estimating each one independently, by exploiting the time dimension of the data, and then calculating some synthetic measure or test statistic. The basic tool for assessing pairwise correlation between individual units images and images for a double‐indexed vector images is the product‐moment correlation coefficient, defined as

equation

A descriptive assessment of the degree of cross‐sectional correlation in the given sample can then be based on the average of individual correlation coefficients: images. If individual correlations are some positive and some negative, this solution has the problem that coefficients with different signs compensate, yielding a statistic that underestimates the true level of dependence in the data. Therefore, another common procedure is to average the absolute values of individual coefficients: images.

4.4.2 CD‐type Tests for Cross‐sectional Dependence

A number of statistics for testing the null hypothesis of no cross‐sectional dependence in model errors can be based on images. The function pcdtest implements both the calculation of images and images, and a family of cross‐sectional dependence tests that can be applied in different settings, ranging from those where images grows large with images fixed to “short” panels with a big images dimension and a few time periods. All are based on (transformations of) the product‐moment correlation coefficient of a model's residuals, defined as above. The Breusch‐Pagan LM test, based on the squares of images, is valid for images with images fixed:

equation

where in the case of an unbalanced panel only pairwise complete observations are considered, and images with images being the number of observations for individual images; else, if the panel is balanced, images for each images. The test is distributed as images. It is inappropriate whenever the images dimension is “large.” A scaled version, applicable also if images and then images (as in some pooled time series contexts), is defined as:

equation

and distributed as a standard normal.

Pesaran's (2004) CD test

equation

based on images without squaring (also distributed as a standard Normal) is appropriate both for images‐ and images‐asymptotics. It has good properties in samples of any practically relevant size and is robust to a variety of settings. The only big drawback is that the test loses power against the alternative of cross‐sectional dependence if the average correlation is zero, even if individual coefficients are non‐zero. Such a situation is not uncommon and can arise for example in the presence of an unobserved factor structure with factor loadings averaging zero, that is, where some units react positively to common shocks, others negatively. Another case where the test will lose power is if the data are cross‐sectionally demeaned, or when the model contains time‐specific dummies (see Sarafidis and Wansbeek, 2012, p. 27). In these instances, the absolute correlation coefficient images is likely to turn out much bigger than images.

4.4.3 Testing Cross‐sectional Dependence in a pseries

Next to testing for cross‐sectional correlation in model residuals, tests in the CD family can be employed in preliminary statistical assessments as well, in order to determine whether the dependent and explanatory variables show any correlation to begin with. To this end, the pcdtest function has a pseries method, meaning that it can be fed a pseries object as well. One can either calculate the descriptive statistics images and images or resort to a formal test.

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.149.251.154