Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 3
Advanced Error Components Models

3.1 Unbalanced Panels

For unbalanced panels, the number of observations for each individual is now individual specific and denoted by . We'll denote by the total number of observations. Compared to the balanced panel case, three complications appear:

firstly, the covariance matrix of the errors cannot be written any more as a linear combination of idempotent and mutually orthogonal matrices (for the one‐way error component model, the within and the between matrices), the weights being the variances of the errors ( and ). Denoting by and two matrices of individual and time dummies, matrices of the type , returning either the sum of the values for an individual or for a time series, will explicitly appear, and these matrices are not idempotent;
secondly, for the individual effects model, the within transformation still consists of removing the individual mean from the variable. On the contrary, for the two‐ways effects, the within transformation is not obtained by performing a difference with the individual and time means, as in the balanced panel case, but requires more tedious matrix algebra;
finally, to estimate the components of the variance, we will still compute quadratic forms of the residuals of some consistent preliminary estimations, but there is no obvious choice of denominators, as there was in the balanced case.

3.1.1 Individual Effects Model

The model to be estimated can be written:

The fixed effects model may be estimated by regressing on and . Like in the balanced panel case, the Frisch‐Waugh theorem enables to avoid the estimation of the fixed effects. The estimation of may be obtained by regressing in a first stage and on , computing the residuals and then regressing in the second stage the residuals of on those of . As in the balanced panel case, these residuals are just the individual within transformation, i.e., or in matrix form, and the fixed effects model is simply obtained by regressing on .

For the GLS model, the covariance matrix of the errors is:

The GLS estimator writes:

is a block‐diagonal matrix that contains square matrices of ones of dimension . For balanced panels, and . returns the sum of the values of for each individual. is also a block‐diagonal matrix, with blocks of the form:

with .

The inverse of a block‐diagonal matrix being equal to a block‐diagonal matrix for which the blocks are the inverses of those of the initial matrix, it is sufficient to calculate the inverse of . As it is a linear combination of two idempotent and orthogonal matrices, the general formula for any power of is:

In particular, the inverse is:

which can also be written as , with:

The GLS estimator may then be obtained by applying OLS on variables that have been transformed by pre‐multiplying them by or, equivalently, by (which will simplify notation):

As in the balanced case, the transformed data can be expressed as quasi‐differences, , with:

the only difference being that now, the proportion of the individual mean that is removed is not a constant, as it depends on the number of observations for each individual.

3.1.2 Two‐ways Error Component Model

For the two‐ways error component model, we have:

or, in matrix form:

where and are matrices of respectively individual and time dummies. Pre‐multiplying a vector by and returns, respectively, the individual and time sum of the variable.

and are two diagonal matrices that contain the number of observations for each individual and time‐series. Pre‐multiplying a vector by or by returns, respectively, the individual and the time series means. Finally, is a matrix of ones and zeros, which indicates whether an observation for a specific individual and time period is present or not.

To help visualizing these matrices, we consider a panel with 3 individuals and 4 periods; the panel is unbalanced, as the first individual is not observed in the third and fourth periods, and the third one is not observed in the first period.

3.1.2.1 Fixed Effects Model

The fixed effects model can be estimated regressing on and the two matrices associated with the effects vectors and .

The application of the Frisch‐Waugh theorem implies that the estimation can be performed by regressing in a first stage , , and on and then, in a second stage, by regressing the residuals of on those of and , which means regressing on and .

Applying the same theorem again, one can regress in and on in the first stage, and the residuals of on those of in the second stage.

Residuals of a regression on are obtained by pre‐multiplying the variables by the matrix:

where, for any matrix , is the generalized inverse of . Finally, the two‐ways error component fixed effects model may be obtained by applying to and every column of the following transformation:

The double‐within transformation consists then, for unbalanced panels, in multiplying any data vector by the following matrix:

Therefore, the two‐ways fixed effects model is still easy to compute even if the panel is unbalanced: all that is required is OLS estimation and the computation of deviations from the individual means. One proceeds as follows:

first, the individual within transformation is applied to , and ,
next, and are regressed on ,
finally, from these regressions, the residuals of and are obtained; then the residuals of the latter are regressed on those of the former.

The within transformation is performed on variables, and then preliminary linear estimations are performed on covariates before the final estimation for which there are covariates.

Note that no specific matrix computation is required and that, in particular, the matrix of individual dummies, which is often very large (), need not to be stored during the estimation.

3.1.2.2 Random Effects Model

The variance matrix of the errors is:

with:

Denote the covariance matrix of the errors of the individual one‐way error components model. We then have:

is block‐diagonal, with blocks: . and being idempotent and orthogonal, the matrix (defined so that ) is also a block‐diagonal matrix with blocks: images . We then have:

for which the inverse is:

We then apply the following result: to the matrix in brackets:

Finally, we have:

and the GLS estimator is:

Let and the matrix of the covariates and of the time dummies measured in quasi‐difference from the individual means. We then have:

and a similar expression for . With the two matrices and in hand, the computation of the estimator requires:

computing the cross products of the two matrices , and ,
computing the inverse of a matrix of dimension .

These are reasonable computational tasks: note especially that the matrix of individual effects needn't be stored and that the dimension of the matrix that has to be inverted is and not or and that, at least for micro‐panels, is relatively small. Note also that computation of the GLS estimator requires explicit matrix operations and it can no longer be obtained as a series of linear regressions on transformed data.

3.1.3 Estimation of the Components of the Error Variance

Remember that, in the balanced panel case, we used the result that natural estimators of and were:

Feasible estimates were obtained by replacing by the residuals from a consistent estimation. For the balanced case, and were natural denominators. This is no longer the case when the panel is unbalanced, as is not the same for all individuals (and is not the same for all time periods).

The strategy used here consists in computing the expected values of the quadratic forms in order to obtain unbiased estimators of the variance components:

first, for a given estimator, define the matrix that transforms the errors into the residuals of the estimator: ,
compute the two quadratic forms of the within and between transformation of the residuals: and ( and for the two‐ways error component model).
compute the expected values of these quadratic forms, which are functions of , (and for the two‐ways model),
equate the quadratic forms to their expected values and solve the system of two (or three equations for the two‐ways error component model) for , (and in the latter case).

Different estimators are obtained using different preliminary models to obtain the residuals. Among the numerous possible choices, as previously seen on chapter 2:

Wallace and Hussain (1969) use the residuals of the pooling estimator for the two quadratic forms,
Amemiya (1971) use the residuals of the OLS estimator for the two quadratic forms,
Swamy and Arora (1972) use the residuals of the OLS estimator for the first quadratic form and those of the OLS estimator for the second one.

The model and its estimation are:

(3.1)

The intercept can be removed by pre‐multiplying every element of the model by: , which subtracts from every variable its overall mean and therefore removes the intercept, as .

(3.2)

Subtracting the expression of the model and of its estimation, we get:

(3.3)

The three estimators we use (OLS, within, and between) can be seen as GLS estimators of this model, with being equal, respectively, to , , and :

(3.4)

Using the two previous expressions, we get with:

(3.5)

is the matrix that transforms the error vector into the residuals vector. Note that it is not a symmetric matrix, at least unless (which corresponds to the pooling model). The quadratic form of the residuals with a matrix is:

being a scalar, it is also equal to its trace:

Using the cyclic property of the trace operator, we get:

from which, taking expectations, we obtain:

with .

Finally, we get:

Replacing by its expression and denoting , we get:

or, denoting :

The most common estimators are obtained by considering the quadratic forms with the within, between‐individual, and between‐time matrices. We then get the following system of equations:

(3.6)

with

(3.7)

Using the following results: , , , , , , , , , and , and , , , , , , , we get:

or:

The estimator is obtained by equating the quadratic form and its expected value:

The matrices corresponding to the three most common estimators are presented in Figure 3.1.

Example 3‐1 unbalanced panel – `Tileries` data set

To illustrate the estimation of unbalanced panels, we employ the Tileries data, concerning the weekly production of cement floor tiles for 25 Egyptian small‐scale tileries in 1982–1983. The data are observed over 66 weeks in total, and aggregated on periods of three weeks. The number of observations for each firm ranges from 12 to 22, which can be checked using the pdim function.

 data("Tileries", package = "pder")
head(Tileries, 3)
  id week   area output labor machine
1  2    1 fayoum  5.650 4.533   4.663
2  2    2 fayoum  6.522 5.347   4.234
3  2    3 fayoum  6.303 4.970   4.234
pdim(Tileries)
Unbalanced Panel: n = 25, T = 12-22, N = 483

We estimate a Cobb‐Douglas production function where the production (output) depends on the quantity of two inputs, labor (labor) and machines (machine). We first check that the same fixed effects model can still be estimated either by applying OLS on the within transformed variables or using individual dummy variables:

 Tileries <- pdata.frame(Tileries)
plm.within <- plm(log(output) ˜ log(labor) + log(machine), Tileries)
y <- log(Tileries$output)
x1 <- log(Tileries$labor)
x2 <- log(Tileries$machine)
lm.within <- lm(I(y - Between(y)) ˜ I(x1 - Between(x1)) + I(x2 - Between(x2)) - 1)
lm.lsdv <- lm(log(output) ˜ log(labor) + log(machine) + factor(id), Tileries)
coef(lm.lsdv)[2:3]
  log(labor) log(machine)
     0.87062      0.02438
coef(lm.within)
I(x1 - Between(x1)) I(x2 - Between(x2))
            0.87062             0.02438
coef(plm.within)
  log(labor) log(machine)
     0.87062      0.02438

The one‐way random effects model is then estimated:

 tile.r <- plm(log(output) ˜ log(labor) + log(machine), Tileries, model = "random")
summary(tile.r)
Oneway (individual) effect Random Effect Model
   (Swamy-Arora's transformation)

Call:
plm(formula = log(output) ˜ log(labor) + log(machine), data = Tileries,
    model = "random")

Unbalanced Panel: n = 25, T = 12-22, N = 483

Effects:
                   var  std.dev share
idiosyncratic 0.002640 0.051377  0.81
individual    0.000623 0.024964  0.19
theta:
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max.
  0.489   0.573   0.582   0.578   0.590   0.598

Residuals:
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max.
-0.1866 -0.0272  0.0031  0.0000  0.0334  0.2268

Coefficients:
             Estimate Std. Error t-value Pr(>|t|)
(Intercept)    0.2779     0.0608    4.57  6.1e-06 ***
log(labor)     0.9088     0.0300   30.25  < 2e-16 ***
log(machine)   0.0240     0.0270    0.89     0.38
‐‐‐
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Total Sum of Squares:    4.84
Residual Sum of Squares: 1.3
R-Squared:      0.732
Adj. R-Squared: 0.731
F-statistic: 656.318 on 2 and 480 DF, p-value: <2e-16

The transformation parameter is now individual specific; more precisely, it depends on the number of available observations for every individual. The parameter here varies from 0.49 to 0.60.

The two‐ways random effect model is obtained by setting the effect argument to 'twoways'.

We check that the OLS model cannot be obtained any more by applying OLS to variables where the individual and time means have been removed.

 plm.within <- plm(log(output) ˜ log(labor) + log(machine),
                  Tileries, effect = "twoways")
lm.lsdv <- lm(log(output) ˜ log(labor) + log(machine) +
                  factor(id) + factor(week), Tileries)
y <- log(Tileries$output)
x1 <- log(Tileries$labor)
x2 <- log(Tileries$machine)
y <- y - Between(y, "individual") - Between(y, "time") + mean(y)
x1 <- x1 - Between(x1, "individual") - Between(x1, "time") + mean(x1)
x2 <- x2 - Between(x2, "individual") - Between(x2, "time") + mean(x2)
lm.within <- lm(y ˜ x1 + x2 - 1)
coef(plm.within)
  log(labor) log(machine)
     0.86951      0.03539
coef(lm.within)
     x1      x2
0.88085 0.03554
coef(lm.lsdv)[2:3]
  log(labor) log(machine)
     0.86951      0.03539

Finally we estimate the time and individual random effects model, using the three methods of estimation we have described:

 wh <- plm(log(output) ˜ log(labor) + log(machine), Tileries,
          model = "random", random.method = "walhus",
          effect = "twoways")
am <- update(wh, random.method = "amemiya")
sa <- update(wh, random.method = "swar")
ercomp(sa)
                   var  std.dev share
idiosyncratic 0.002589 0.050884  0.77
individual    0.000625 0.025001  0.19
time          0.000158 0.012551  0.05
theta:
        Min. 1st Qu. Median   Mean 3rd Qu.   Max.
id    0.4934  0.5769 0.5858 0.5813  0.5941 0.6019
time  0.1962  0.3461 0.3544 0.3487  0.3625 0.3702
total 0.1665  0.3023 0.3097 0.3058  0.3186 0.3295

The shares of the individual and the time effects in the total error variance are now about 19 and 5% for the Swamy‐Arora estimator.

 re.models <- list(walhus = wh, amemiya = am, swar = sa)
sapply(re.models, function(x) sqrt(ercomp(x)$sigma2))
       walhus amemiya    swar
idios 0.05167 0.05088 0.05088
id    0.02778 0.03192 0.02500
time  0.01177 0.01267 0.01255
sapply(re.models, coef)
              walhus amemiya    swar
(Intercept)  0.27420 0.28560 0.26528
log(labor)   0.90778 0.90062 0.91279
log(machine) 0.02696 0.02774 0.02692

3.2 Seemingly Unrelated Regression

3.2.1 Introduction

Very often in economics, the phenomenon under investigation is not well described by a single equation but by a system of equations. It is particularly the case in the field of micro‐econometrics of consumption or production. For example, the behavior of a producer is described by a minimum cost equation along with equations of factor demand. In this case, there are two advantages in considering the whole system of equations:

firstly, the errors of the different equations for an observation may be correlated. In this case, even if the estimation of a single equation is consistent, it is inefficient because it does not take into account the correlation between the errors,
secondly, economic theory may impose restrictions on different coefficients of the system, for example, the equality of two coefficients in two different equations of the system. In this case, these restrictions can be taken into account using the method of constrained least squares.

3.2.2 Constrained Least Squares

Linear restrictions on the vector of coefficients to be estimated can be represented using a restriction matrix and a numeric vector :

For example, if the sum of the first two coefficients must equal 1 and the first and third ones should be equal, the joint restrictions can be written as:

To estimate the constrained OLS estimator, we write the Lagrangian:

with and the vector of Lagrange multipliers associated to the different constraints.¹ The Lagrangian can also be written as:

The first‐order conditions become:

which can also be written in matrix form:

The constrained OLS estimator can be obtained using the formula for the inverse of a partitioned matrix (see equation 2.18):

with and .

We have here . The constrained estimator is then: , with and

The unconstrained estimator being , we finally get:

The difference between the constrained and the unconstrained estimators is then a linear combination of the excess of the linear constraints of the model evaluated for the unconstrained model.

3.2.3 Inter‐equations Correlation

We consider a system of equations denoted , with . In matrix form, the system can be written as follows:

The covariance matrix of the errors of the system is:

We suppose that the errors of two equations and for the same observations are correlated and that the covariance, denoted by , is constant. With this hypothesis, the covariance matrix is:

Denoting by the matrix of inter‐equations covariances, we have:

Because of the inter‐equations correlations, the efficient estimator is the GLS estimator: . This estimator, first proposed by Zellner (1962), is known by the acronym SUR for seemingly unrelated regression. It can be obtained by applying OLS on transformed data, each variable being pre‐multiplied by . This matrix is simply . Denoting by the elements of , the transformed response and covariates are:

is a matrix that contains unknown parameters, which can be estimated using residuals of a consistent but inefficient preliminary estimator, like OLS. The efficient estimator is then obtained the following way:

first, each equation is estimated separately by OLS and we note the matrix for which every column is the residual vector of one of the equations in the system,
then, estimate the covariance matrix of the errors: ,
compute the matrix and use it to transform the response and the covariates of the model,
finally, estimate the model by applying OLS on transformed data.

can conveniently be computed using the Cholesky decomposition, i.e., computing the lower‐triangular matrix such that .

3.2.4 SUR With Panel Data

Applying the SUR estimator on panel data is straightforward when only the between or the within variability of the data is taken into account. In this case, one just has to apply the above formula using the variables in individual means (between‐SUR) or in deviations from individual means (within‐SUR). Taking into account both sources of variability requires more attention and leads to the SUR error component model proposed by Avery (1977) and Baltagi (1980). The errors of the model then present two sources of correlation:

the correlation of the SUR model, i.e., inter‐equations correlation,
the correlation taken into account in the error component model, i.e., the intra‐individual correlations.

Every observation is now characterized by three indexes: is the observation of for equation , individual and period . The observations are first ordered by equation, then by individual. Denoting the error vector for equation and individual , one gets:

The errors concerning different individuals being uncorrelated, the correlation matrix for two equations and all individuals is:

Finally, for the whole system of equations, denoting and the two matrices of dimension containing the parameters and , the covariance matrix of the errors is:

The SUR error component model may be obtained by applying OLS on transformed data, every variable being pre‐multiplied by .

(3.8)

and may be estimated using the Cholesky decomposition of and (see Kinal and Lahiri, 1990).

The two error covariance matrices being unknown, the error‐component SUR estimator is obtained with the following steps:

first, each equation is estimated separately using a consistent method of estimation (for example OLS): we denote by and the matrices of residuals in deviation from the individual means and in individual means, respectively,
next, we estimate the error covariance matrices: and ,
we then compute the matrices and and hence, through 3.8, we obtain the transformed variables and ,
finally, we apply OLS on and .

Different choices of preliminary estimates lead to different SUR‐error component estimators. For example, Baltagi (1980) used the method of Amemiya (1971) while Avery (1977) chose the one of Swamy and Arora (1972).

Example 3‐2 SUR estimation – `TexasElectr` data set

A common application of the SUR model is the analysis of production cost. The cost function returns the minimum cost of production for a given vector of prices of the production factors and the level of output . The minimum cost function is . It has several properties:

it is homogeneous of degree 1 with respect to the factor prices: ,
the demand functions for production factors are the derivatives of the minimum cost function with respect to factor prices,² i.e., the gradient of the cost function:
the Hessian matrix of the cost function is symmetric: .

The most common functional form assumed for the cost function is the translog, defined by:

Dividing total cost and factor prices by one of these prices (the first, for example), homogeneity of degree 1 with respect to prices is imposed:

Shephard's lemma implies that: , that is, the logarithmic derivative of the cost with respect to the price of a factor equals the share of that factor in total cost. The share of factor is then:

It is customary to divide each price and the production by their means. In this case, and are zero at the sample mean, which gives an intuitive meaning to the first‐order coefficients. is then the cost elasticity with respect to the production level at the sample mean, and the share of factor in the cost at the sample mean.

The data we use concern the production cost of ten electricity producers in Texas over 18 years (from 1966 to 1983). They have been analyzed by Kumbhakar (1996), Horrace and Schmidt (1996) and Horrace and Schmidt (2000). Three production factors are used: fuel, labor, and capital. For each factor, we observe unit factor prices (pfuel, plab, and pcap) and factor expenses (expfuel, explab, and expcap).

We first compute the prices in logarithms, we divide them by their sample mean, and we also divide them by one of the prices, here fuel price. We perform this task using the mutate function of the dplyr package.

 data("TexasElectr", package = "pder")
library("dplyr")
TexasElectr <- mutate(TexasElectr,
                      pf = log(pfuel / mean(pfuel)),
                      pl = log(plab / mean(plab)) - pf,
                      pk = log(pcap / mean(pcap)) - pf)

The production is also measured in logarithms and divided by its sample mean.

 TexasElectr <- mutate(TexasElectr, q = log(output / mean(output)))

We then compute total production cost by summing the expenses for the three factors and factor shares. Finally, we measure the cost in logarithms and divide it by its sample mean and by the reference price.

 TexasElectr <- mutate(TexasElectr,
                      C = expfuel + explab + expcap,
                      sl = explab / C,
                      sk = expcap / C,
                      C = log(C / mean(C)) - pf)

Finally, we compute the squares and the interaction terms for the variables.

 TexasElectr <- mutate(TexasElectr,
                      pll = 1/2 * pl ^ 2,
                      plk = pl * pk,
                      pkk = 1/2 * pk ^ 2,
                      qq = 1/2 * q ^ 2)

We define the three equations of the system, one for total cost and the other two for factor shares.³

 cost <- C ˜ pl + pk + q + pll + plk + pkk + qq
shlab <- sl ˜ pl + pk
shcap <- sk ˜ pl + pk

Factor shares being the derivatives of the cost function, the following restrictions must be imposed:

the coefficient of pl in the cost equation must equal the intercept in the labor share equation,
the coefficient of pk in the cost equation must equal the intercept in the capital share equation,
the coefficient of pll in the cost equation should equal the coefficient associated to pl in the labor share equation,
the coefficient of pkk in the cost equation should be equal to the coefficient associated to pk in the capital share equation,
the coefficient of plk in the cost equation should equal the coefficient of pk in the labor share equation and the coefficient of pl in the capital share equation.

We construct for this purpose a 6 (number of restrictions) by 14 (number of coefficients) matrix.

 R <- matrix(0, nrow = 6, ncol = 14)
R[1, 2] <- R[2, 3] <- R[3, 5] <- R[4, 6] <- R[5, 6] <- R[6, 7] <- 1
R[1, 9] <- R[2, 12] <- R[3, 10] <- R[4, 11] <- R[5, 13] <- R[6, 14] <- -1

The first line of the matrix indicates that the second coefficient (the one associated to pl in the cost equation) must be equal to the ninth (the constant term in the labor share equation).

The SUR model is estimated providing a list of formulae, defining the system of equations to be estimated, as the first argument to plm. The different formulae in the list can be named, which makes the output more readable. The model argument is set to 'random' in order to estimate the SUR error components model. Lastly, the arguments restrict.matrix and restrict.rhs allow to specify the matrix and the vector defining the linear constraints of the model. If, as happens here, all elements of are zero, the restrict.rhs argument can be omitted.

 z <- plm(list(cost = C ˜ pl + pk + q + pll + plk + pkk + qq,
              shlab = sl ˜ pl + pk,
              shcap = sk ˜ pl + pk),
         TexasElectr, model = "random",
         restrict.matrix = R)
summary(z)
Oneway (individual) effect Random Effect Model
   (Swamy-Arora's transformation)
Call:
plm.list(formula = list(cost = C ˜ pl + pk + q + pll + plk +
    pkk + qq, shlab = sl ˜ pl + pk, shcap = sk ˜ pl + pk), data = TexasElectr,
    model = "random", restrict.matrix = R)

Balanced Panel: n = 10, T = 18, N = 180

Effects:

  Estimated standard deviations of the error
        cost  shlab  shcap
id    0.1429 0.0248 0.0270
idios 0.0377 0.0195 0.0175

  Estimated correlation matrix of the individual effects
         cost shlab shcap
cost   1.0000     .     .
shlab -0.6926  1.00     .
shcap -0.0964  0.21     1

  Estimated correlation matrix of the idiosyncratic effects
         cost shlab shcap
cost   1.0000     .     .
shlab  0.2813 1.000     .
shcap -0.0766 0.204     1

 - cost
            Estimate Std. Error t-value Pr(>|t|)
(Intercept) -0.22924    0.04175   -5.49  6.2e-08 ***
pl           0.12484    0.00614   20.32  < 2e-16 ***
pk           0.31573    0.00612   51.59  < 2e-16 ***
q            0.85452    0.01200   71.20  < 2e-16 ***
pll          0.13698    0.00931   14.71  < 2e-16 ***
plk         -0.04025    0.00867   -4.64  4.3e-06 ***
pkk          0.19884    0.00832   23.90  < 2e-16 ***
qq           0.19821    0.01150   17.23  < 2e-16 ***
‐‐‐
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

 - shlab
            Estimate Std. Error t-value Pr(>|t|)
(Intercept)  0.12484    0.00614   20.32  < 2e-16 ***
pl           0.13698    0.00931   14.71  < 2e-16 ***
pk          -0.04025    0.00867   -4.64  4.3e-06 ***
‐‐‐
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

 - shcap
            Estimate Std. Error t-value Pr(>|t|)
(Intercept)  0.31573    0.00612   51.59  < 2e-16 ***
pl          -0.04025    0.00867   -4.64  4.3e-06 ***
pk           0.19884    0.00832   23.90  < 2e-16 ***
‐‐‐
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The results indicate the presence of increasing returns to scale, as q is significantly lower than 1. Factor shares at the sample mean for labor and capital are respectively 12 and 31%.

3.3 The Maximum Likelihood Estimator

An alternative to the OLS estimator presented in the previous chapter is the maximum likelihood estimator. Contrary to the GLS estimator, the parameters are not estimated sequentially (first and then ) but simultaneously.

3.3.1 Derivation of the Likelihood Function

In order to write the likelihood of the model, the distribution of the errors must be perfectly characterized; compared with the GLS model, we then must add an hypothesis concerning the distribution of the two components of the error term, the individual and the idiosyncratic effects: we'll suppose that they are both normally distributed. The likelihood is the joint density for the whole sample, which is the product of the individual densities in the case of a random sample. This is not the case here, as the observations of individual are correlated because of the common individual effect. The model to be estimated is then:

with and . For a given value of the individual effect, , the density for is:

For a given value of , the distribution of is the one of a vector of independent random deviates, and the joint distribution is therefore the product of individual densities:

The unconditional distribution is obtained by integrating out the individual effects , which means that the mean value of the density is computed for all possible values of :

with, denoting , and :

Denoting by the first term, we have and the joint density is then (denoting ):

For the second term, we have:

so that the joint density for an individual is finally:

The contribution of the ‐th individual to the log likelihood function is simply the logarithm of the joint density:

The log likelihood function is then obtained by summing over all the individuals of the panel:

or, more simply in the special case of a balanced panel:

Note also that:

3.3.2 Computation of the Estimator

The first derivatives of the log likelihood are, denoting :

(3.9)

(3.10)

(3.11)

Solving 3.9, we obtain:

(3.12)

The estimator of is simply obtained by using 3.10 as the residual variance of the model estimated on the transformed data:

(3.13)

Finally, using 3.11 and 3.13, the transformation parameter is:

(3.14)

The estimation can be performed iteratively. Starting from an estimator of (for example the within estimator), we calculate using the formula given by 3.14. We then transform the response and the covariates using this estimator of and we compute a second estimation of using 3.12. These computations are repeated until the convergence of and . is then estimated using 3.13.

Example 3‐3 maximum likelihood estimator – `RiceFarms` data set

The maximum likelihood estimator is available in the pglm package. The pglm function enables maximum likelihood estimation of generalized linear models for panel data.⁴ We have to specify the distribution of the errors of the model, here normal, by setting the argument family to 'gaussian'.

 data("RiceFarms", package = "splm")
Rice <- pdata.frame(RiceFarms, index = "id")
library("pglm")
rice.ml <- pglm(log(goutput) ˜ log(seed) + log(totlabor) + log(size),
                data = Rice, family = gaussian)

 summary(rice.ml)
--------------------------------------------
Maximum Likelihood estimation
Newton-Raphson maximisation, 5 iterations
Return code 2: successive function values within tolerance limit
Log-Likelihood: -460.5
6  free parameters
Estimates:
              Estimate Std. error t value Pr(> t)
(Intercept)     5.3125     0.2038   26.07 < 2e-16 ***
log(seed)       0.2200     0.0283    7.76 8.2e-15 ***
log(totlabor)   0.2855     0.0311    9.20 < 2e-16 ***
log(size)       0.5280     0.0326   16.17 < 2e-16 ***
sd.id           0.1190     0.0171    6.95 3.7e-12 ***
sd.idios        0.3637     0.0086   42.28 < 2e-16 ***
---
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
--------------------------------------------

The coefficients are very similar to those obtained with the GLS estimator. The two parameters called sd.idios and sd.id are the estimated standard deviations of the idiosyncratic and of the individual parts of the error. These values are also almost equal to those obtained using the GLS estimator.

3.4 The Nested Error Components Model

3.4.1 Presentation of the Model

The nested random effect model is relevant when the individuals can be put together in different groups. For example, with a panel of firms, groups may be constituted by regions or production sectors.

In this chapter, we'll restrict ourselves to panels with two characteristics:

panels without time effects,
balanced panels inside each group, which means that, for every group, the number of observations for each individual is the same.

The number of individuals and the length of time series for two groups may be different. This is why this model, presented in Baltagi et al. (2001) is called the unbalanced nested error component model, even if its unbalancedness must be understood in the very restrictive sense we've just described.

Three effects will now be considered: the usual individual and idiosyncratic effects, but also a new one that represents group effects . Denoting by the matrix of group dummies:

is block‐diagonal with (the number of groups) blocks of the following shape:

Replacing by and by , this can be rewritten as a linear combination of three symmetric, idempotent, and orthogonal matrices which sum to :

where:

is the within‐individual transformation,
is the between‐individual transformation measured as a difference with the group mean,
is the between‐group transformation.

This expression enables to easily find the expression for , denoting and :

which finally writes:

with and .

The model can therefore be estimated by OLS on transformed variables for which part of the individual and the group mean (respectively and have been subtracted).

3.4.2 Estimation of the Variance of the Error Components

We proceed along the lines of section 3.1.3. Using residuals from a preliminary estimation denoted , we compute a quadratic form of with a matrix .

Replacing by its expression and denoting , we obtain:

or, denoting :

The most popular estimators are obtained by computing the three quadratic forms with the within‐individual, between‐individual and between‐group matrices. We then get the following system of equations:

(3.15)

with:

(3.16)

Using the following results: , , , , , , , , , , , , , , , , , , .

We finally obtain:

or:

Baltagi et al. (2001) have proposed a variant of the Amemiya (1971) estimator (where the within estimator is used for the three quadratic forms), the Wallace and Hussain (1969) estimator (the OLS estimator is used for the three quadratic forms) and of the Swamy and Arora (1972) estimator (the within, between‐individual and between‐group are used respectively for the within, between‐individual, and between‐group quadratic forms). The detailed formulas are presented in Figure 3.2.

Example 3‐4 nested error component model – `Produc` data set

Baltagi et al. (2001) have estimated the nested error component model extending the work of Baltagi and Pinnoi (1995). This article, inspired by Munnell (1990), aims at analyzing the effect of public capital on production. The dataset consists on 48 American states for the period 1970–1986. The observations are nested, as the states can be grouped in 9 regions, which contain between 3 and 8 states. The panel is therefore unbalanced, as the number of individuals differs from one group to another, but the number of time series is the same for all the individuals inside a group (and in fact here for every individual), which is a necessity in order to be able to estimate the model.

A Cobb‐Douglas production function is estimated; the state output gsp is explained by the private capital stock pc and non‐agricultural labor emp, but also by three measures of the public capital stock:

roads and highways hwy,
water infrastructure water,
other public buildings and infrastructure util.

The state unemployment rate is also used as a covariate in order to take into account the business cycle of every state. All the covariates, except for the unemployment rate, are in logarithms.

In order to estimate the model, one has to indicate which variable is the group index. This can be done using several variants:

if the first three columns of the data frame are the individual, time, and group indexes, the structure of the panel is directly understood by plm,
the index argument of plm or of pdata.frame can also be used to indicate which variable is the group index, naming this variable if the other indexes are not indicated.

To illustrate these different possibilities, we use the RiceFarms data set from plm.

 data("RiceFarms", package = "plm")
head(RiceFarms, 2)
      id size status varieties bimas seed urea phosphate
1 101001    3  owner     mixed mixed   90  900        80
2 101001    2  owner      trad mixed   40  600         0
  pesticide pseed purea pphosph hiredlabor famlabor
1      6000    80    75      75       2875       40
2      3000    70    75      75       2110       45
  totlabor  wage goutput noutput price        region
1     2915 68.49    7980    6800    60 wargabinangun
2     2155 60.09    4083    3500    60 wargabinangun

The individual index is id (the fist column), we use region, the last column of the data frame as the group index. The three lines below give the same results:

 R1 <- pdata.frame(RiceFarms, index = c(id = "id", time = NULL, group = "region"))
R2 <- pdata.frame(RiceFarms, index = c(id = "id", group = "region"))
R3 <- pdata.frame(RiceFarms, index = c("id", group = "region"))
head(index(R1))
      id time        region
1 101001    1 wargabinangun
2 101001    2 wargabinangun
3 101001    3 wargabinangun
4 101001    4 wargabinangun
5 101001    5 wargabinangun
6 101001    6 wargabinangun

For the Produc data frame, it is easier to describe the structure of the sample as the first three columns are the individual, time, and group indexes.

To estimate the nested error component model, the model must be set to 'nested'. We first estimate the Swamy and Arora (1972) model:

 data("Produc", package = "plm")
nswar <- plm(log(gsp) ˜ log(pc) + log(emp) + log(hwy) + log(water) +
                 log(util) + unemp, data = Produc,
             model = "random", effect = "nested",
             random.method = "swar", index = c(group = "region"))
summary(nswar)
Nested effects Random Effect Model
   (Swamy-Arora's transformation)

Call:
plm(formula = log(gsp) ˜ log(pc) + log(emp) + log(hwy) + log(water) +
    log(util) + unemp, data = Produc, effect = "nested", model = "random",
    random.method = "swar", index = c(group = "region"))

Balanced Panel: n = 48, T = 17, N = 816

Effects:
                  var std.dev share
idiosyncratic 0.00135 0.03676  0.19
individual    0.00428 0.06541  0.60
group         0.00146 0.03815  0.21
theta:
         Min. 1st Qu.  Median    Mean 3rd Qu.    Max.
id    0.86493 0.86493 0.86493 0.86493 0.86493 0.86493
group 0.03961 0.04669 0.05714 0.05578 0.06458 0.06458

Residuals:
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max.
-0.1062 -0.0248 -0.0018 -0.0001  0.0198  0.1828

Coefficients:
             Estimate Std. Error t-value Pr(>|t|)
(Intercept)  2.089211   0.145702   14.34  < 2e-16 ***
log(pc)      0.274124   0.020544   13.34  < 2e-16 ***
log(emp)     0.739838   0.025750   28.73  < 2e-16 ***
log(hwy)     0.072736   0.022025    3.30    0.001 **
log(water)   0.076453   0.013858    5.52  4.6e-08 ***
log(util)   -0.094374   0.016773   -5.63  2.5e-08 ***
unemp       -0.006163   0.000903   -6.82  1.8e-11 ***
‐‐‐
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Total Sum of Squares:    43
Residual Sum of Squares: 1.12
R-Squared:      0.974
Adj. R-Squared: 0.974
F-statistic: 5025.33 on 6 and 809 DF, p-value: <2e-16

We then update the model in order to use the two other estimators of the variances of the components of the error. The results are summarized using the screenreg function of the texreg package.

 library("texreg")
namem <- update(nswar, random.method = "amemiya")
nwalhus <- update(nswar, random.method = "walhus")
iswar <- update(nswar, effect = "individual")
iwith <- update(nswar, model = "within", effect = "individual")
screenreg(list("fe-id" = iwith, "re-id" = iswar,
               "Swamy_Arora" = nswar, "Wallas-Hussein" = nwalhus,
               "Amemiya" = namem), digits = 3)

===============================================================================
             fe-id        re-id        Swamy_Arora  Wallas-Hussein  Amemiya
‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐
log(pc)        0.235 ***    0.273 ***    0.274 ***    0.273 ***       0.264 ***
              (0.026)      (0.020)      (0.021)      (0.021)         (0.022)
log(emp)       0.801 ***    0.749 ***    0.740 ***    0.742 ***       0.758 ***
              (0.030)      (0.025)      (0.026)      (0.026)         (0.027)
log(hwy)       0.077 *      0.062 **     0.073 **     0.075 ***       0.072 **
              (0.031)      (0.022)      (0.022)      (0.022)         (0.024)
log(water)     0.079 ***    0.076 ***    0.076 ***    0.076 ***       0.076 ***
              (0.015)      (0.014)      (0.014)      (0.014)         (0.014)
log(util)     -0.115 ***   -0.098 ***   -0.094 ***   -0.095 ***      -0.102 ***
              (0.018)      (0.017)      (0.017)      (0.017)         (0.017)
unemp         -0.005 ***   -0.006 ***   -0.006 ***   -0.006 ***      -0.006 ***
              (0.001)      (0.001)      (0.001)      (0.001)         (0.001)
(Intercept)                 2.168 ***    2.089 ***    2.082 ***       2.131 ***
                           (0.143)      (0.146)      (0.150)         (0.160)
‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐‐
R^2            0.946        0.961        0.974        0.972           0.968
Adj. R^2       0.942        0.961        0.974        0.972           0.968
Num. obs.    816          816          816          816             816
s_idios                     0.037        0.037        0.038           0.037
s_id                        0.082        0.065        0.067           0.083
s_gp                                     0.038        0.052           0.047
===============================================================================
*** p < 0.001, ** p < 0.01, * p < 0.05

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.