Chapter 3
Advanced Error Components Models

3.1 Unbalanced Panels

For unbalanced panels, the number of observations for each individual is now individual specific and denoted by images. We'll denote by images the total number of observations. Compared to the balanced panel case, three complications appear:

  • firstly, the covariance matrix of the errors cannot be written any more as a linear combination of idempotent and mutually orthogonal matrices (for the one‐way error component model, the within and the between matrices), the weights being the variances of the errors (images and images). Denoting by images and images two matrices of individual and time dummies, matrices of the type images, returning either the sum of the values for an individual or for a time series, will explicitly appear, and these matrices are not idempotent;
  • secondly, for the individual effects model, the within transformation still consists of removing the individual mean from the variable. On the contrary, for the two‐ways effects, the within transformation is not obtained by performing a difference with the individual and time means, as in the balanced panel case, but requires more tedious matrix algebra;
  • finally, to estimate the components of the variance, we will still compute quadratic forms of the residuals of some consistent preliminary estimations, but there is no obvious choice of denominators, as there was in the balanced case.

3.1.1 Individual Effects Model

The model to be estimated can be written:

equation

The fixed effects model may be estimated by regressing images on images and images. Like in the balanced panel case, the Frisch‐Waugh theorem enables to avoid the estimation of the fixed effects. The estimation of images may be obtained by regressing in a first stage images and images on images, computing the residuals and then regressing in the second stage the residuals of images on those of images. As in the balanced panel case, these residuals are just the individual within transformation, i.e., images or images in matrix form, and the fixed effects model is simply obtained by regressing images on images.

For the GLS model, the covariance matrix of the errors is:

equation

The GLS estimator writes:

equation

images is a block‐diagonal matrix that contains images square matrices of ones of dimension images. For balanced panels, images and images. images returns the sum of the values of images for each individual. images is also a block‐diagonal matrix, with blocks images of the form:

equation

with images.

The inverse of a block‐diagonal matrix being equal to a block‐diagonal matrix for which the blocks are the inverses of those of the initial matrix, it is sufficient to calculate the inverse of images. As it is a linear combination of two idempotent and orthogonal matrices, the general formula for any power of images is:

equation

In particular, the inverse is:

equation

which can also be written as images, with:

equation

The GLS estimator may then be obtained by applying OLS on variables that have been transformed by pre‐multiplying them by images or, equivalently, by images (which will simplify notation):

equation

As in the balanced case, the transformed data can be expressed as quasi‐differences, images, with:

equation

the only difference being that now, the proportion of the individual mean that is removed is not a constant, as it depends on the number of observations for each individual.

3.1.2 Two‐ways Error Component Model

For the two‐ways error component model, we have:

equation

or, in matrix form:

equation

where images and images are matrices of respectively individual and time dummies. Pre‐multiplying a vector by images and images returns, respectively, the individual and time sum of the variable.

images and images are two diagonal matrices that contain the number of observations for each individual and time‐series. Pre‐multiplying a vector by images or by images returns, respectively, the individual and the time series means. Finally, images is a images matrix of ones and zeros, which indicates whether an observation for a specific individual and time period is present or not.

To help visualizing these matrices, we consider a panel with 3 individuals and 4 periods; the panel is unbalanced, as the first individual is not observed in the third and fourth periods, and the third one is not observed in the first period.

equation
equation
equation

3.1.2.1 Fixed Effects Model

The fixed effects model can be estimated regressing images on images and the two matrices associated with the effects vectors images and images.

The application of the Frisch‐Waugh theorem implies that the estimation can be performed by regressing in a first stage images, images, and images on images and then, in a second stage, by regressing the residuals of images on those of images and images, which means regressing images on images and images.

Applying the same theorem again, one can regress in images and images on images in the first stage, and the residuals of images on those of images in the second stage.

Residuals of a regression on images are obtained by pre‐multiplying the variables by the matrix:

equation

where, for any matrix images, images is the generalized inverse of images. Finally, the two‐ways error component fixed effects model may be obtained by applying to images and every column of images the following transformation:

equation

The double‐within transformation consists then, for unbalanced panels, in multiplying any data vector by the following matrix:

equation

Therefore, the two‐ways fixed effects model is still easy to compute even if the panel is unbalanced: all that is required is OLS estimation and the computation of deviations from the individual means. One proceeds as follows:

  • first, the individual within transformation is applied to images, images and images,
  • next, images and images are regressed on images,
  • finally, from these regressions, the residuals of images and images are obtained; then the residuals of the latter are regressed on those of the former.

The within transformation is performed on images variables, and then images preliminary linear estimations are performed on images covariates before the final estimation for which there are images covariates.

Note that no specific matrix computation is required and that, in particular, the matrix of individual dummies, which is often very large (images), need not to be stored during the estimation.

3.1.2.2 Random Effects Model

The variance matrix of the errors is:

equation

with:

equation

Denote images the covariance matrix of the errors of the individual one‐way error components model. We then have:

equation

images is block‐diagonal, with blocks: images. images and images being idempotent and orthogonal, the matrix images (defined so that images) is also a block‐diagonal matrix with blocks: images. We then have:

equation
equation

for which the inverse is:

equation

We then apply the following result: images to the matrix in brackets:

equation

Finally, we have:

equation

and the GLS estimator is:

equation

Let images and images the matrix of the covariates and of the time dummies measured in quasi‐difference from the individual means. We then have:

equation

and a similar expression for images. With the two matrices images and images in hand, the computation of the estimator requires:

  • computing the cross products of the two matrices images, images and images,
  • computing the inverse of a matrix of dimension images.

These are reasonable computational tasks: note especially that the matrix of individual effects needn't be stored and that the dimension of the matrix that has to be inverted is images and not images or images and that, at least for micro‐panels, images is relatively small. Note also that computation of the GLS estimator requires explicit matrix operations and it can no longer be obtained as a series of linear regressions on transformed data.

3.1.3 Estimation of the Components of the Error Variance

Remember that, in the balanced panel case, we used the result that natural estimators of images and images were:

equation

Feasible estimates were obtained by replacing images by the residuals images from a consistent estimation. For the balanced case, images and images were natural denominators. This is no longer the case when the panel is unbalanced, as images is not the same for all individuals (and images is not the same for all time periods).

The strategy used here consists in computing the expected values of the quadratic forms in order to obtain unbiased estimators of the variance components:

  • first, for a given estimator, define the matrix images that transforms the errors into the residuals of the images estimator: images,
  • compute the two quadratic forms of the within and between transformation of the residuals: images and images (images and images for the two‐ways error component model).
  • compute the expected values of these quadratic forms, which are functions of images, images (and images for the two‐ways model),
  • equate the quadratic forms to their expected values and solve the system of two (or three equations for the two‐ways error component model) for images, images (and images in the latter case).

Different estimators are obtained using different preliminary models to obtain the residuals. Among the numerous possible choices, as previously seen on chapter 2:

  • Wallace and Hussain (1969) use the residuals of the pooling estimator for the two quadratic forms,
  • Amemiya (1971) use the residuals of the OLS estimator for the two quadratic forms,
  • Swamy and Arora (1972) use the residuals of the OLS estimator for the first quadratic form and those of the OLS estimator for the second one.

The model and its estimation are:

(3.1)equation

The intercept can be removed by pre‐multiplying every element of the model by: images, which subtracts from every variable its overall mean and therefore removes the intercept, as images.

(3.2)equation

Subtracting the expression of the model and of its estimation, we get:

(3.3)equation

The three estimators we use (OLS, within, and between) can be seen as GLS estimators of this model, with images being equal, respectively, to images, images, and images:

(3.4)equation

Using the two previous expressions, we get images with:

(3.5)equation

images is the matrix that transforms the error vector into the residuals vector. Note that it is not a symmetric matrix, at least unless images (which corresponds to the pooling model). The quadratic form of the residuals with a matrix images is:

equation

images being a scalar, it is also equal to its trace:

equation

Using the cyclic property of the trace operator, we get:

equation

from which, taking expectations, we obtain:

equation

with images.

Finally, we get:

equation

Replacing images by its expression and denoting images, we get:

equation

or, denoting images:

equation

The most common estimators are obtained by considering the quadratic forms with the within, between‐individual, and between‐time matrices. We then get the following system of equations:

(3.6)equation

with

(3.7)equation

Using the following results: images, images, images, images, images, images, images, images, images, images and images, images and images, images, images, images, images, images, images, we get:

equation

or:

equation

The estimator is obtained by equating the quadratic form and its expected value:

equation

The images matrices corresponding to the three most common estimators are presented in Figure 3.1.

equation

Amemiya (1971):

equation

Wallace and Hussain (1969):

equation

Swamy and Arora (1972):

equation

Figure 3.1Estimators of the variance components for unbalanced panels.

3.2 Seemingly Unrelated Regression

3.2.1 Introduction

Very often in economics, the phenomenon under investigation is not well described by a single equation but by a system of equations. It is particularly the case in the field of micro‐econometrics of consumption or production. For example, the behavior of a producer is described by a minimum cost equation along with equations of factor demand. In this case, there are two advantages in considering the whole system of equations:

  • firstly, the errors of the different equations for an observation may be correlated. In this case, even if the estimation of a single equation is consistent, it is inefficient because it does not take into account the correlation between the errors,
  • secondly, economic theory may impose restrictions on different coefficients of the system, for example, the equality of two coefficients in two different equations of the system. In this case, these restrictions can be taken into account using the method of constrained least squares.

3.2.2 Constrained Least Squares

Linear restrictions on the vector of coefficients to be estimated can be represented using a restriction matrix images and a numeric vector images:

equation

For example, if the sum of the first two coefficients must equal 1 and the first and third ones should be equal, the joint restrictions can be written as:

equation

To estimate the constrained OLS estimator, we write the Lagrangian:

equation

with images and images the vector of Lagrange multipliers associated to the different constraints.1 The Lagrangian can also be written as:

equation

The first‐order conditions become:

equation

which can also be written in matrix form:

equation

The constrained OLS estimator can be obtained using the formula for the inverse of a partitioned matrix (see equation 2.18):

equation

with images and images.

We have here images. The constrained estimator is then: images, with images and images

The unconstrained estimator being images, we finally get:

equation

The difference between the constrained and the unconstrained estimators is then a linear combination of the excess of the linear constraints of the model evaluated for the unconstrained model.

3.2.3 Inter‐equations Correlation

We consider a system of images equations denoted images, with images. In matrix form, the system can be written as follows:

equation

The covariance matrix of the errors of the system is:

equation

We suppose that the errors of two equations images and images for the same observations are correlated and that the covariance, denoted by images, is constant. With this hypothesis, the covariance matrix is:

equation

Denoting by images the matrix of inter‐equations covariances, we have:

equation
equation

Because of the inter‐equations correlations, the efficient estimator is the GLS estimator: images. This estimator, first proposed by Zellner (1962), is known by the acronym SUR for seemingly unrelated regression. It can be obtained by applying OLS on transformed data, each variable being pre‐multiplied by images. This matrix is simply images. Denoting by images the elements of images, the transformed response and covariates are:

equation

images is a matrix that contains unknown parameters, which can be estimated using residuals of a consistent but inefficient preliminary estimator, like OLS. The efficient estimator is then obtained the following way:

  • first, each equation is estimated separately by OLS and we note images the images matrix for which every column is the residual vector of one of the equations in the system,
  • then, estimate the covariance matrix of the errors: images,
  • compute the matrix images and use it to transform the response and the covariates of the model,
  • finally, estimate the model by applying OLS on transformed data.

images can conveniently be computed using the Cholesky decomposition, i.e., computing the lower‐triangular matrix images such that images.

3.2.4 SUR With Panel Data

Applying the SUR estimator on panel data is straightforward when only the between or the within variability of the data is taken into account. In this case, one just has to apply the above formula using the variables in individual means (betweenSUR) or in deviations from individual means (withinSUR). Taking into account both sources of variability requires more attention and leads to the SUR error component model proposed by Avery (1977) and Baltagi (1980). The errors of the model then present two sources of correlation:

  • the correlation of the SUR model, i.e., inter‐equations correlation,
  • the correlation taken into account in the error component model, i.e., the intra‐individual correlations.

Every observation is now characterized by three indexes: images is the observation of images for equation images, individual images and period images. The observations are first ordered by equation, then by individual. Denoting images the error vector for equation images and individual images, one gets:

equation

The errors concerning different individuals being uncorrelated, the correlation matrix for two equations and all individuals is:

equation

Finally, for the whole system of equations, denoting images and images the two matrices of dimension images containing the parameters images and images, the covariance matrix of the errors is:

equation

The SUR error component model may be obtained by applying OLS on transformed data, every variable being pre‐multiplied by images.

and may be estimated using the Cholesky decomposition of images and images (see Kinal and Lahiri, 1990).

The two error covariance matrices being unknown, the error‐component SUR estimator is obtained with the following steps:

  • first, each equation is estimated separately using a consistent method of estimation (for example OLS): we denote by images and images the matrices of residuals in deviation from the individual means and in individual means, respectively,
  • next, we estimate the error covariance matrices: images and images,
  • we then compute the matrices images and images and hence, through 3.8, we obtain the transformed variables images and images,
  • finally, we apply OLS on images and images.

Different choices of preliminary estimates lead to different SUR‐error component estimators. For example, Baltagi (1980) used the method of Amemiya (1971) while Avery (1977) chose the one of Swamy and Arora (1972).

3.3 The Maximum Likelihood Estimator

An alternative to the OLS estimator presented in the previous chapter is the maximum likelihood estimator. Contrary to the GLS estimator, the parameters are not estimated sequentially (first images and then images) but simultaneously.

3.3.1 Derivation of the Likelihood Function

In order to write the likelihood of the model, the distribution of the errors must be perfectly characterized; compared with the GLS model, we then must add an hypothesis concerning the distribution of the two components of the error term, the individual images and the idiosyncratic images effects: we'll suppose that they are both normally distributed. The likelihood is the joint density for the whole sample, which is the product of the individual densities in the case of a random sample. This is not the case here, as the images observations of individual images are correlated because of the common individual effect. The model to be estimated is then:

equation

with images and images. For a given value of the individual effect, images, the density for images is:

equation

For a given value of images, the distribution of images is the one of a vector of independent random deviates, and the joint distribution is therefore the product of individual densities:

equation

The unconditional distribution is obtained by integrating out the individual effects images, which means that the mean value of the density is computed for all possible values of images:

equation

with, denoting images, images and images:

equation

Denoting by images the first term, we have images and the joint density is then (denoting images):

equation

For the second term, we have:

equation

so that the joint density for an individual is finally:

equation

The contribution of the images‐th individual to the log likelihood function is simply the logarithm of the joint density:

equation

The log likelihood function is then obtained by summing over all the individuals of the panel:

equation

or, more simply in the special case of a balanced panel:

equation

Note also that:

equation

3.3.2 Computation of the Estimator

The first derivatives of the log likelihood are, denoting images:

Solving 3.9, we obtain:

The estimator of images is simply obtained by using 3.10 as the residual variance of the model estimated on the transformed data:

Finally, using 3.11 and 3.13, the transformation parameter is:

The estimation can be performed iteratively. Starting from an estimator of images (for example the within estimator), we calculate images using the formula given by 3.14. We then transform the response and the covariates using this estimator of images and we compute a second estimation of images using 3.12. These computations are repeated until the convergence of images and images. images is then estimated using 3.13.

3.4 The Nested Error Components Model

3.4.1 Presentation of the Model

The nested random effect model is relevant when the individuals can be put together in different groups. For example, with a panel of firms, groups may be constituted by regions or production sectors.

In this chapter, we'll restrict ourselves to panels with two characteristics:

  • panels without time effects,
  • balanced panels inside each group, which means that, for every group, the number of observations for each individual is the same.

The number of individuals and the length of time series for two groups may be different. This is why this model, presented in Baltagi et al. (2001) is called the unbalanced nested error component model, even if its unbalancedness must be understood in the very restrictive sense we've just described.

Three effects will now be considered: the usual individual images and idiosyncratic images effects, but also a new one that represents group effects images. Denoting by images the matrix of group dummies:

equation

images is block‐diagonal with images (the number of groups) blocks of the following shape:

equation

Replacing images by images and images by images, this can be rewritten as a linear combination of three symmetric, idempotent, and orthogonal matrices which sum to images:

equation

where:

  • images is the within‐individual transformation,
  • images is the between‐individual transformation measured as a difference with the group mean,
  • images is the between‐group transformation.

This expression enables to easily find the expression for images, denoting images and images:

equation

which finally writes:

equation

with images and images.

The model can therefore be estimated by OLS on transformed variables for which part of the individual and the group mean (respectively images and images have been subtracted).

3.4.2 Estimation of the Variance of the Error Components

We proceed along the lines of section 3.1.3. Using residuals from a preliminary estimation images denoted images, we compute a quadratic form of images with a matrix images images.

equation

Replacing images by its expression and denoting images, we obtain:

equation

or, denoting images:

equation

The most popular estimators are obtained by computing the three quadratic forms with the within‐individual, between‐individual and between‐group matrices. We then get the following system of equations:

(3.15)equation

with:

(3.16)equation

Using the following results: images, images, images, images, images, images, images, images, images, images, images, images, images, images, images, images, images, images, images.

We finally obtain:

equation

or:

equation

Baltagi et al. (2001) have proposed a variant of the Amemiya (1971) estimator (where the within estimator is used for the three quadratic forms), the Wallace and Hussain (1969) estimator (the OLS estimator is used for the three quadratic forms) and of the Swamy and Arora (1972) estimator (the within, between‐individual and between‐group are used respectively for the within, between‐individual, and between‐group quadratic forms). The detailed formulas are presented in Figure 3.2.

equation

Amemiya (1971):

equation

Wallace and Hussain (1969):

equation

Swamy and Arora (1972):

equation

Figure 3.2Error components estimators for the nested error component model.

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.17.150.163