Chapter 6
Endogeneity

6.1 Introduction

There is an endogeneity problem when the error is correlated with at least one explanatory variable. This phenomenon is very common in econometrics because, compared to experimental sciences, it is not possible (or it is at least difficult) to control the data‐generating process. Among the possible causes of endogeneity, the three most important are:

  • simultaneity. In this case, there is an explanatory variable that is set simultaneously with the response: this is, for example, the case when one seeks to estimate a demand equation for a good, which contains the price of the good itself. The demand and the price are simultaneously set by the condition of equality of supply and demand and, therefore, a variation of the error term of the demand equation will shift the demand curve and therefore induce a variation of the quantity and the equilibrium price. The price variable is therefore endogenous.
  • covariate measured with error. If the “true” model is images and what is observed is images, the estimated model writes: images, or images with images. Hence, images is correlated with images, which is therefore endogenous.
  • omitted variable. If the “true” model is images and images is unobserved, the estimated model is images, with images. The error of the estimated model then contains the influence of the omitted variable, and this error is correlated with images if images and images are correlated. Once again, the covariate images is then endogenous.

The OLS estimator is:

equation

Replacing images by its expression: images, we obtain images as a function of the errors of the model:

equation

We then have, denoting images the sample size:

equation

The estimator is consistent (images) if images, this expression being the vector of covariances for the population between the covariates and the error. The ordinary least squares model is therefore consistent if the covariates and the error are uncorrelated. When this condition is not met, the method of instrumental variables, which will be presented in detail in this chapter, can be used.

Concerning simultaneity, there is an additional problem as the model is not defined by one equation but by a system of equations. In this case, two strategies can be followed:

  • estimating only the equation of interest (limited information estimator),
  • estimating simultaneously all the equations (full information estimator).

The latter approach leads to a more efficient estimator, as the correlation of the errors of all the equations is taken into account. But if an equation is wrongly specified, it can contaminate the estimation of the parameters of the other equations of the model.

6.2 The Instrumental Variables Estimator

6.2.1 Generalities about the Instrumental Variables Estimator

Let us consider the following model: images with images. if at least one of the covariates is correlated with the errors, the OLS estimator is not consistent. In order to obtain consistency, we use the instrumental variables estimator. The instrumental variables are denoted by images.1 Denoting by images the number of the covariates and by images the number of instruments (not including the column of ones), the instrumental variables must verify: images. Stated differently, they must not be correlated with the errors.2 In the simplest case where the number of instruments equals the number of covariates, the instrumental variable estimator is simply obtained by solving the system of equations: images, which is just identified. Developing this expression, we obtain: images, which can also be written:

(6.1)equation

If there are more instruments than covariates (images), images is an over‐determined system of linear equations, which, except for very special cases, doesn't have a solution. In this case, two equivalent approaches can be used to obtain the optimal estimator. The first one consists in pre‐multiplying the model by images:

(6.2)equation

It is a model that contains images rows and images parameters to estimate images. If one considers it as a standard regression model, the variance of the errors being images, the best linear estimator is the GLS estimator, and we then obtain the following instrumental variables estimator:

with images.

The second approach is the generalized method of moments. We consider here a vector of images moments: images for which the variance is images. Using the generalized method of moments, we seek to minimize the quadratic form of the vector of moments, using the inverse of the variance matrix of these moments:

equation

The first‐order conditions for a minimum are: images, and solving this system of linear equations, we obtain the same estimator as before.

The instrumental variables estimator is also called the two‐stage least squares estimator (2SLS), as it can be obtained by applying twice the method of ordinary least squares. When we consider the regression of images on images, we obtain the estimator images and the fitted values images. The matrix images is therefore the projection matrix on the subspace defined by the columns of images. This matrix is symmetric and idempotent, which means that images. The instrumental variables estimator 6.3 can also be written, denoting by images the fitted values of the covariates regressed on the instrumental variables:

(6.4)equation

and can therefore be obtained by applying OLS twice:

  • the first time by regressing every covariate on the instruments,
  • the second time by regressing the response on the fitted values of the first‐stage estimation.

The variance of the instrumental variables estimator is:

equation

The estimator is therefore the more efficient the larger the variance of images, which means that images and images are highly correlated.

6.2.2 The within Instrumental Variables Estimator

The specificity of panel data methods is that the error term is modeled as having two components, an individual effect and an idiosyncratic term. Therefore, the correlation between covariates and instrumental variables, on the one hand, and the errors of the model, on the other hand, must be analyzed separately for each component of the error. In this section, we consider the estimation of the model transformed in deviations from individual means. This transformation wipes out the individual effect; therefore, there is no reason to take care of the correlation between the covariates and the individual effects. The images is obtained by pre‐multiplying the model first by images: images and then by images,

and applying GLS to this transformed data, the variance matrix of the errors of this model being images:

equation

or, denoting by: images the projection matrix defined by the within transformation of the instruments:

(6.6)equation

A similar reasoning can be followed for the between model. We consider the between transformation of the model images, with the same transformation applied to the instruments (images). The instrumental variables estimator is obtained by pre‐multiplying the model by images:

and applying to this transformed model the GLS estimator:

(6.8)equation

with images.

The images is consistent, even if the individual effects are correlated with the covariates. On the contrary, the images is consistent only if there is no correlation. If this hypothesis is verified, none of them is efficient, as each of them take into account only one component of variability.

6.3 Error Components Instrumental Variables Estimator

In the previous section, the potential correlation between some covariates and the individual effects has been treated drastically by using the within transformation, which wipes out the individual effects. In this section, we present the error component instrumental variables estimator. The two components of the error being present in this model, it is in this case essential to tackle the issue of a potential correlation of some covariates with the two components of the error.

6.3.1 The General Model

Suppose in a first step that the idiosyncratic component of the error is not correlated with the covariates. In this case, if all the covariates are uncorrelated with the individual effects, the unbiased efficient estimator is the GLS estimator. This estimator enables, on the one hand, to take into account part of the inter‐individual variation in the sample and, on the other hand, to estimate parameters associated with covariates that don't exhibit temporal variations.

If, on the contrary, all the covariates are correlated with the individual effects, Mundlak (1978) (see subsection 4.2) has shown that the efficient estimator, which is the GLS estimator, is the same as the within estimator if the correlation between the individual effects and the covariates (more precisely the individual means of the covariates) is taken into account.

When only some covariates are correlated with the individual effects, none of the two previous estimators is appropriate any more:

  • the GLS estimator is not consistent anymore because of the correlation of some covariates with the individual effects,
  • the within estimator is still consistent but not efficient any more, as it doesn't take into account the fact that some covariates are uncorrelated with the individual effects but wipes out all the inter‐individual variation in the sample, especially the covariates that don't exhibit any temporal variation.

The best solution in this case consists then in using an estimator that, on the one hand, uses instrumental variables and, on the other hand, exploits the two sources of variability of the panel in an optimal way. The essential question is then to find good instruments, which is often a difficult task. The richness of panel data allows to overcome this problem. Actually, every covariate can generate two instrumental variables, using the between and the within transformations. If a rank condition that will be detailed later on is checked, the model can then be estimated without any external instrument. This approach has been used by Hausman and Taylor (1981), Amemiya and MaCurdy (1986), and Breusch et al. (1989).

If, from now, we suspect that some covariates are also correlated with the idiosyncratic part of the error, then none of the estimators we have listed above is consistent. We then use an instrumental variables estimator (within or GLS) using external instruments. This strategy has been developed by Baltagi (1981) with his “error component two‐stage least squares” images estimator and by Balestra and Varadharajan‐Krishnakumar (1987) with their “generalized two‐ stage least squares” images estimator, which differ by the way the instruments are introduced in the model.

This two branches of the literature have been developed separately, and this dichotomy exists also in most software packages, which usually provide two different functions to estimate these models. We'll follow the approach of Cornwell et al. (1992), who provide a unified view of panel models with instrumental variables. These authors consider three kinds of variables:

  • the endogenous variables, which are correlated with the two components of the error,
  • the simply exogenous variables, which are correlated with the individual effects but not with the idiosyncratic part of the error,
  • the doubly exogenous variables, which are uncorrelated with both components of the error.

Variables from the first category don't provide any usable instrument. For the second one, the within transformation is a valid instrument, as it is by construction orthogonal to the individual effects and by hypothesis uncorrelated with the idiosyncratic part. Finally, each covariate of the third category provides two instruments by using the within and the between transformation.

Consider now the specific case of time‐invariant covariates. For these variables, images and images. Therefore, such a variable provides either one instrument, if it is uncorrelated with the individual effects (the covariate itself), or no instrument.

We start with the model to be estimated written in matrix form:

equation

With the usual hypotheses concerning the error component model, the variance matrix of the error is: images. We first pre‐multiply the model by: images and then obtain a transformed model for which the errors are iid.

equation

We then apply to this model the instrumental variables method, using a set of instruments, which, denoting by images the doubly exogenous variables, by images the simply exogenous variables, and by images the whole set of instruments, can be written:

equation

where images is a set of variables that will be defined later. For now, just consider that these variables must provide valid instruments when the between transformation is applied.

The instrumental variables estimator is, denoting by images the projection matrix defined by the instruments:

equation

The two matrices images and images being orthogonal, the projection matrix may also be written as the sum of two projection matrices defined by the instruments transformed by the within and the between matrices:

equation

The estimator is then:

equation

or also, denoting images:

One can check that, as in the simple error component model, this estimator is a weighted average of the within and the between estimators: images, with:

equation

Several models proposed in the literature are special cases of this general model.

6.3.2 Special Cases of the General Model

6.3.2.1 The within Model

Firstly, if there are no external instruments and if all the covariates are simply exogenous, we have images and images, and the within estimator results.

Then, if all the covariates are either simply exogenous or endogenous and if the external instruments are simply exogenous, we also have images, and images is constituted only by simply exogenous covariates and external instruments. The condition for identification is then that the number of external instruments must be at least equal to the number of endogenous covariates. We then have the within instrumental variables estimator:

equation

6.3.2.2 Error Components Two Stage Least Squares

Baltagi (1981)'s estimator is the special case where images, which means that all the instruments (and potentially some of the covariates) are assumed to be doubly exogenous and are therefore used twice. We start from equations 6.5 and 6.7, which leads respectively to the within and between estimators. Stacking these two equations, we obtain:

equation

which is justified by the fact that the vector of parameters to be estimated images is the same in the two equations. In order to apply GLS, we compute the variance of the errors of the stacked model:

equation

We then apply the formula of the GLS estimator:

equation
equation

and we finally obtain:

(6.10)equation

which is the special case of the general model defined by equation 6.9 for which images.

6.3.2.3 The Hausman and Taylor Model

In the Hausman and Taylor (1981) model, there are no endogenous variables, only simply or doubly exogenous variables. We then have images, images and images. Moreover, the authors stress the presence of variables with (images) or without (images) time variation. The set of instruments they use is:

equation

Only covariates that exhibit time variation may be used with their within transformation images) and doubly exogenous time‐invariant variables are used without transformation as instruments (images). Without external instruments, denoting by images the number of covariates of the 4 categories, the number of instruments is images as the number of covariates is: images. The model is then identified if images, i.e., if the number of doubly exogenous time‐varying variables (which provide two instruments) is greater than the number of time‐invariant simply exogenous variables, which provide no instrument.

6.3.2.4 The Amemiya‐Macurdy Estimator

Hausman and Taylor (1981)'s estimator is consistent if the individual means of the doubly exogenous variables are uncorrelated with the individual effects. Amemiya and MaCurdy (1986) use the stronger hypothesis that the doubly exogenous variables are uncorrelated with the individual effects for each period. We then have: images for every doubly exogenous covariate. The corresponding instrument matrix is constructed the following way. Let images be the matrix of doubly exogenous instruments of dimension images for individual images. images is a vector of length images obtained by stacking the columns of images. The instrument matrix for individual images is then images, and for the whole sample, we obtain a matrix of dimension images:

6.3.2.5 The Breusch, Mizon and Schmidt's Estimator

Breusch et al. (1989) expand the instruments used by Amemiya and MaCurdy (1986) by assuming that the within transformations of simply exogenous covariates are valid instruments at every period. Stated differently: images. We then obtain the further matrix of instruments images by applying to images the same transformation than the one used in equation 6.11. The other contribution of Breusch et al. (1989) is to show how the different estimators can be presented in a consistent and nested way. They use the fact that the projection subspace defined by images is the same as the one defined by images:

  • Hausman and Taylor (1981): images,
  • Amemiya and MaCurdy (1986): images,
  • Breusch et al. (1989): images,

As each estimator adds instruments to the previous one, if these instruments are valid, it is necessarily more efficient. Moreover, the validity of extra instruments may be tested by comparing the two models with a Hausman test.

6.3.2.6 Balestra and Varadharajan‐Krishnakumar Estimator

This last estimator, proposed by Balestra and Varadharajan‐Krishnakumar (1987), is not, contrary to the others, a special case of the general model previously presented. For this model, called the images estimator (for “generalized two‐stage least squares”), the same transformation is applied to the instruments that is applied also to the covariates and to the response. Therefore, the matrix of instruments is:

equation

Baltagi and Li (1992) have shown that the instruments used by Baltagi (1981), images, perform the same projection as images and images. The instruments used by Balestra and Varadharajan‐Krishnakumar (1987) are therefore a subset of those used by Baltagi (1981), the supplementary instruments used by Baltagi (1981) being either images or images. Therefore, the estimator of Baltagi (1981) is necessarily not less efficient than the one of Balestra and Varadharajan‐Krishnakumar (1987). Baltagi and Li (1992) show, using White (1986), that the supplementary instruments used by Baltagi (1981) are redundant, which means that they don't add any gain in terms of asymptotic efficiency. Consequently, both estimators have the same asymptotic variance.

However, the estimator of Balestra and Varadharajan‐Krishnakumar (1987) has an important drawback. A part of the between component of every instrumental variable is included in the instruments, and consequently, the estimator of Balestra and Varadharajan‐Krishnakumar (1987) is unable to take into account simply exogenous instruments.

With plm, the way instruments are introduced is indicated by the inst.method argument: 'baltagi' indicates that instruments are introduced with the within and the between transformations, 'amc' uses the set of instruments used by Amemiya and MaCurdy (1986), 'bmsc' the one used by Breusch et al. (1989), and 'bvk' indicates that the instrumental variables are transformed the same way as the covariates and the response, as proposed by Balestra and Varadharajan‐Krishnakumar (1987).

6.4 Estimation of a System of Equations

Instead of estimating only one equation, we can consider a whole system of simultaneous equations, in order to take into account the correlation between the errors of different equations. The estimator obtained is a mix of the 2SLS estimator described in the previous chapter and the SUR estimator (see 3.2.4).

6.4.1 The Three Stage Least Squares Estimator

When there is no correlation between the covariates and the error, the relevant model for the system of equations is the SUR model, which is a GLS estimator and is described in section 3.2. Denoting by images the matrix of covariance of the errors of the images equations, the variance of the errors of the system is images, and the SUR estimator is:

equation

This expression involves square matrices of dimensions equal to the sample size. It is therefore not operational for large samples, and it is numerically inefficient anyway. It is therefore preferred, as often happens for GLS estimators, to apply OLS on transformed data. Denoting by images the elements of the matrix images, each variable images of the model is transformed by pre‐multiplying it by: images. We then have:

equation

The three‐stage least squares estimator is obtained by using the moment conditions: images, for which the variance is: images. Consistently with the method of moments approach, the estimator is obtained by minimizing a quadratic form of the vector of moments, using the inverse of the variance matrix of these moments:

equation

First order conditions for a minimum are:

equation

Solving this linear system of equations, we obtain the 3SLS estimator:

The 3SLS estimator may be obtained by employing the instrumental variables estimator, pre‐multiplying the covariates and the response by images and the instruments by images. The instruments are then images and define the following projection matrix:

equation

But:

equation

We then have

equation

Using this projection matrix in the formula of the instrumental variables estimator 6.3 we finally get:

(6.13)equation

or

equation

which is the formula 6.12 of the 3SLS estimator. Of course, as in the GLS estimator, images is in practice unknown and shall be estimated based on the results from a consistent preliminary estimation.

The practical computation of the 3SLS estimator consists then of the following steps:

  • each equation is first estimated independently using the instrumental variables estimator, which leads to a matrix of residuals images which is a consistent estimate of the errors of the equations,
  • the covariance matrix of the errors of the system is then estimated: images
  • the Cholesky decomposition of this matrix is computed: images,
  • the variables are transformed using this matrix: images, images and images.
  • and finally the instrumental variables estimator is applied to the transformed system.

The computation of the within or between 3SLS estimators is straightforward, as it consists in applying the 3SLS to within or between transformed data.

6.4.2 The Error Components Three Stage Least Squares Estimator

Balestra and Varadharajan‐Krishnakumar (1987) and Baltagi (1981) have proposed 3SLS estimators that use the inter‐ and intra‐individual variations of the data in an optimal way.

From now, three indexes must be considered, the individual images and time indexes images as usual, but also the equation index images.

equation

Denoting by images, the error vector for individual images and equation images, the error vector for the system of equations is:

equation

The covariance matrix of the errors is then:

equation

The presence of individual effects makes this model specific compared to the standard 3SLS estimator. Compared to the standard error component model, scalars images and images are replaced by two covariance matrices images and images.

equation

The 3SLS estimator can then be computed the following way:

  • firstly, the different equations are estimated using 2SLS so that a consistent estimator of the matrix of the errors of the different equations images may be computed;
  • then, images and images are estimated by images and images,
  • covariates and responses are transformed by pre‐multiplying them by: images,
  • instrumental variables are transformed by pre‐multiplying them by: images,
  • the 2SLS estimator is then applied to the transformed data.

As for the 2SLS estimator, the difference between the estimators of Baltagi (1981) and Balestra and Varadharajan‐Krishnakumar (1987) is that the former uses the within and the between transformations of the instruments, while the latter uses a quasi‐difference transformation.

6.5 More Empirical Examples

Acconcia et al. (2014) seek to estimate the multiplier effect of public spending. This is a difficult task, as public spending can hardly be considered exogenous. They use a panel of 95 Italian administrative regions (provinces) for the years 1990‐1999 and take advantage of the implementation of anti‐mafia laws, which resulted in the eviction of some elected officials who were replaced by external commissioners. This replacement, which led to a drastic reduction in local public spending, represents an exogenous source of variation in public spending that can be usefully employed as instrument. Using a fixed effects 2SLS estimator, they estimate the long‐term public spending multiplier to be 1.95, a much larger value than the one obtained using the within estimator. The Mafia dataset is available in the pder package.

Egger and Pfaffermayr (2004) studied the determinants of bilateral trade of two countries, Germany and the United States, with their partners, bilateral trade being measured by imports and exports on the one hand, and by foreign direct investment on the other. The authors suspect that the individual effect, which indicates a propensity to trade with a given country for geographical and cultural reasons, is correlated with the distance. In this case, this variable, which is the only time‐invariant one, is certainly correlated with the individual effect. The authors use the estimator of Hausman and Taylor (1981) for each equation and also for the system of two equations. The data are provided as TradeFDI in the pder package.

Hutchison and Noy (2005) study the effects of twin crises, characterized by the simultaneous occurrence of a bank and a currency crisis, on the wealth of countries. The panel consists of 24 developing countries for the 1975‐1997 period. The response is the growth rate of the GDP and the two main covariates are the lag of the growth rate and a dummy variable indicating the occurrence of a twin crisis. Employing the lag of the growth rate as a covariate induces an endogeneity problem, which the authors tackle using an error component 2SLS estimator. The results indicate that the cost of a currency crisis is about 5‐8% in terms of growth every year for about 2‐4 years, while for the bank crisis this is about 8‐10%. The article doesn't provide any evidence of a specific effect of twin crises. The data are provided as TwinCrises in the pder package.

Cornwell and Trumbull (1994) and Baltagi (2006) estimate a crime economics model for the counties of North Carolina. The response is the criminality rate and, among the covariates, they introduce the probability of being arrested and the number of policemen per inhabitant. These two covariates induce an endogeneity problem: one actually wants to estimate the causal effect of police on crime, but a reverse causality effect is also likely, because more crime will induce the presence of more policemen. Two instrumental variables are used: the offense mix, which is defined as the ratio of crimes involving face‐to‐face contact to those that do not, and the per capita tax revenue. The first instrument is positively correlated with the probability of being arrested (because the offender may be identified by the victim). The second variable is positively correlated with the number of policemen, more tax income indicating a strong preference for public services and particularly for security. The 2SLS error component model indicates a much stronger effect of the probability of being arrested than for the other estimators, especially the within estimator. The data are provided as Crime in the plm package.

Baltagi and Khanti‐Akom (1990) and Cornwell and Rupert (1988) estimate a wage function using a panel of American individuals, with particular interest in the return to education. A well‐known problem of such studies is that unobserved characteristics of individuals, called abilities, are part of the individual effects and may be correlated with education. Using the within model, the education covariate disappears: the use of the estimator of Hausman and Taylor (1981) is therefore very relevant in this context. Two time‐invariant covariates (being black and being a female) are assumed exogenous, while the level of education is endogenous. Some other time‐varying covariates are assumed exogenous and therefore provide two instruments so that the model is identified. The coefficient of education from the Hausman and Taylor (1981) estimator is larger than the one obtained using GLS (0.14 vs 0.10). The data are provided as Wages in the plm package.

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.141.47.221