To this end, the paper estimates a DCDP model in which immigrants can be in one of the following states: employed in a blue-collar occupation, employed in a white-collar occupation, attending a training course in a blue-collar occupation, attending a training course in a white-collar occupation, or unemployed. An immigrant’s language ability (which is self-reported) is assumed to evolve exogenously. In each period, individuals have some probability of receiving a white- or blue-collar job offer and of receiving a white-collar training offer. Blue-collar training is always an option for those with sufficient knowledge of Hebrew. Wage offers depend on the immigrants’ accumulated human capital, which in turn depends on accumulated experience and training, language fluency and imported skills. The model includes four unobserved types to capture unobservable heterogeneity. It is estimated by simulated maximum likelihood.

Using the model, Cohen-Goldner and Eckstein (2008) estimate the returns from job training, controlling through the decision model for selectivity into training. They find that participating in white-collar training increases mean accepted wages by 6% and blue-collar training by 9.8%. Participating in white-collar training also doubles the white-collar job offer probability. The net present value of government sponsored training to the immigrants is estimated to be 2.8-3.7%.

Models of early childhood investments

As earlier described, much of the work on understanding the sources of inequality in lifetime earnings and utility among individuals emphasizes the role of pre-market factors. A small literature in economics seeks to model in a dynamic setting how parental investments affect human capital formation during childhood and adolescent years and how parental investment levels are chosen. Bernal (2008) develops and estimates a DCDP model of employment and child care decisions of women, using the NLSY-Children dataset. She finds that mother’s employment status and child care choices affect children’s cognitive achievement outcomes, as measured by scores on the PPVT and PIAT (math and reading). Having a mother who works full-time and uses child care reduces test scores by about 2%. She uses the model to explore the effects of policy interventions that include child care subsidies and maternity leave entitlements, which she finds, on average, have adverse affects on pre-school aged children’s cognitive outcomes. For example, a 35% child care subsidy, which increases the labor supply of mothers, reduces test scores by 0.23%-1.8%, depending on the age of the child.

Recent work by Cunha et al. (2010) develops new methods for estimating the so-called “technology of human capital formation.” In particular, they develop and estimate a multistage, dynamic model of the evolution of cognitive and non-cognitive skills as determined by parental investments in different periods of a child’s life. As in Bernal (2008), they use the NLSY-Children data and measure cognitive skills by math and reading scores. Non-cognitive skills would include such factors as motivation, self-efficacy, stubbornness and temperament, for which empirical measures are also available. The paper posits a conceptual framework in which current skills are a function of previous skill levels and intervening parental investments, parental skills, and unobserved components. It is assumed that parents sequentially choose investment levels to maximize their child’s expected net present value of lifetime earnings, which implies that investment is endogenous in the estimation of the skill production technology. The paper develops an approach for addressing the endogeneity problem that jointly estimates the skill production function along with the investment policy function that can be viewed as an approximation to the decision rule from the dynamic programming problem.

A major goal of the paper is to recover substitution parameters that govern the relative important of early versus late parental investment for subsequent lifetime achievement. Cunha et al. (2010) find that investments in the early years are particularly important for the formation of adult cognitive skills and that investments in cognitive skill are much more productive if applied at younger ages. On the other hand, for non-cognitive skills, there are not dramatic differences in the productivity of investments across different life-cycle stages.

4.3.4 Summary

There is a rather consistent finding that human capital accumulation that has already occurred by the age of college attendance decisions plays a large role in subsequent labor market outcomes. Thus, policies like tuition subsidies, student loans, affirmative action, etc., aimed at affecting college attendance, may come too late in the life-cycle to remediate the main factors that lead to inequality in outcomes.159 The literature on schooling began with and has always emphasized the college attendance decision (see Walsh, 1935). If the goal is to understand the determinants of inequality and the effects of policies for reducing inequality, then the results to date strongly suggest that the focus should shift to human capital investment decisions at earlier ages.

5 Concluding Remarks—How Credible are DCDP Models?

As we have illustrated, applications of the DCDP approach have addressed challenging and important questions often involving the evaluation of counterfactual scenarios or policies. The ambitiousness of the research agenda that the DCDP approach can accommodate is a major strength. This strength is purchased at a cost. To be able to perform such counterfactual analyses in such a variety of settings, DCDP models must rely on extra-theoretic modeling choices, including functional form and distributional assumptions. It is tempting to dismiss the approach for that reason, although we see no other empirical methodology with which to replace it. All approaches fall short of an assumption-free ideal that does not and is likely never to exist. And, as we have noted, DCDP researchers have taken seriously the need to provide credible validation.

There are two approaches to model validation, stemming from different epistemological perspectives. The first is the view that knowledge is absolute, that is, there exists a “true” decision-theoretic model from which observed data are generated. This leads naturally to a model validation strategy based on testing the validity of the model’s behavioral implications and/or testing the fit of the model to the data. A model is deemed invalid if it is rejected according to some statistical criterion. Rejected or invalid models are discarded.

The second approach is based on a pragmatic epistemological view, in which it is acknowledged that all models are necessarily simplifications of agents’ actual decision-making behavior. Hypothesis testing as a means of model validation or selection is eschewed because, given enough data, all models would be rejected as true models. In this pragmatic view, there is no true decision-theoretic model, only models that perform better or worse in addressing particular questions. Models are chosen that are “best” for some specific purpose; alternative models may be valid for different purposes.

Decision-theoretic models are typically designed and estimated with the goal of predicting the impact on economic agents of changes in the economic environment. Thus, one criterion for model validation/selection that fits within the “pragmatic” view is to examine a model’s predictive accuracy, namely, how successful the model is at predicting outcomes of interest within the particular context for which the model was designed. In contrast, in the absolutist view, a model would be considered useful for prediction only if it were not rejected on statistical grounds, even though non-rejection does not necessarily imply predicted effects will be close to actual effects. Nor will non-rejected models necessarily outperform rejected models in terms of their (context-specific) predictive accuracy.

Can one provide convincing evidence about the credibility of these exercises? Put differently, how can DCDP models be validated and choices be made among competing models?

There are a number of possible approaches to model validation/selection.

1. Robustness to assumptions: One method is to check how robust the empirical findings are to alternative assumptions. Although, in principle, such a method would provide evidence on the credibility of any particular set of findings, the number of assumptions in these models, their computational burden, and disagreements among researchers as to the a priori importance of particular assumptions, has led practitioners to limit the use of this approach.160 That is not to say that evidence accumulated from the estimation of models by different researchers, each with different modeling inclinations, is not valuable.161 Indeed, contradicting findings could be very revealing.

2. Within-sample model fit: DCDP papers commonly assess model fit to the estimation sample, often, but not always, using formal statistical tests. The problem with basing validation on model fit is that, like nonstructural estimation, model building is an inductive as well as deductive exercise. The final specification results from a process in which the model structure is revised as estimation proceeds, by adding parameters and changing functional forms, as deficiencies in model fit are discovered. This process of repeated model pre-testing invalidates the application of standard formal statistical tests. Nevertheless, it is interesting to note that formal tests generally reject DCDP models. Although these models tend to have a lot of parameters, sometimes numbering into the hundreds, given the extensiveness of the data moments that these models attempt to fit, the models are actually parsimonious. In practice, researchers tend to stop well short of fitting the model to idiosyncratic features of the data just to improve model fit.

3. Out-of-sample validation: Out-of-sample validation relies on there being sample data not used in estimation, but that is assumed to come from the same underlying population. This validation sample can have a number of sources. One source for the validation sample is based on regime shifts. McFadden and Talvitie (1977), for example, estimated a random utility model (RUM) of travel demand before the introduction of the San Francisco Bay Area Rapid Transit (BART) system, obtained a forecast of the level of patronage that would ensue, and then compared the forecast to actual usage after BART’s introduction. McFadden’s model validation treats pre-BART observations as the estimation sample and post-BART observations as the validation sample. The validation exercise exploited data that were unavailable at the time of estimation to validate their model.

Some researchers have deliberately held out data to use for validation purposes. Lumsdaine et al. (1992), for example, estimated a model of the retirement behavior of workers in a single firm who were observed before and after the introduction of a temporary one-year pension window. They estimated several models on data before the window was introduced and compared the forecast of the impact of the pension window on retirement based on each estimated model to the actual impact as a means of model validation and selection. Keane and Moffitt (1998) estimated a model of labor supply and welfare program participation using data after federal legislation (OBRA 1981) that significantly changed the program rules. They used the model to predict behavior prior to that policy change. Keane and Wolpin (2007) estimated a model of welfare participation, schooling, labor supply, marriage and fertility on a sample of women from five US states and validated the model based on a forecast of those behaviors on a sixth state. The validation sample was purposely drawn from a state in which welfare benefits were significantly lower than in the estimation sample.

Randomized social experiments have also provided opportunities for model validation and selection. Wise (1985) exploited a housing subsidy experiment to evaluate a model of housing demand. In the experiment, families that met an income eligibility criterion were randomly assigned to control and treatment groups. The latter were offered a rent subsidy. The model was estimated using only control group data and was used to forecast the impact of the program on the treatment group. The forecast was compared to its actual impact. More recently, Todd and Wolpin (2006) used data from a large-scale school subsidy experiment in Mexico, where villages were randomly assigned to control and treatment groups. Using only the control villages, they estimated a behavioral model of parental decisions about child schooling and work, as well as family fertility. The validity of the model was then assessed according to how well it could forecast (predict) the behavior of households in the treatment villages.162

As should be clear from this discussion, model validation, and model building more generally, are part art and part science. For this reason, researchers will attach different priors to a model’s credibility, different weights to the validation evidence, and may, therefore, come to different conclusions about the plausibility of the results. Presumably, disagreements can be reduced as confirmatory or contradictory evidence is accumulated. Whatever empirical approach to inference is adopted, structural or nonstructural, researchers should strive to provide as much validation evidence as the data and methods permit.

References

Aguirregebaria, V., Mira, P., Dynamic discrete choice structural models: a survey. Journal of Econometrics (forthcoming)

J. Altonji. Intertemporal substitution in labor supply: evidence from micro data. Journal of Political Economy. 1986;94(Part 2):S176-S215.

S. Altug, R.A. Miller. Household choices in equilibrium. Econometrica. 1990;58:543-570.

S. Altug, R.A. Miller. The effect of work experience on female wages and labor supply. Review of Economic Studies. 1998;65:45-85.

J. Albrecht, B. Axell. An equilibrium model of search employment. Journal of Political Economy. 1984;92:824-840.

P. Arcidiacono. Affirmative action in higher education: how do admission and financial aid rules affect future earnings? Econometrica. 2005;73:1477-1524.

P. Arcidiacono, J.B. Jones. Finite mixture distributions, sequential likelihood, and the EM algorithm. Econometrica. 2003;71:933-946.

Arcidiacono, P., Miller, R.A., 2008. CCP estimation of dynamic discrete choice models with unobserved heterogeneity. Mimeo, Duke University

Attanasio, O., Meghir, C., Santiago, A., 2005. Education choices in Mexico: Using a structural model and a randomized experiment to evaluate Progresa. Mimeo, University College London

P. Bajari, A. Hortacsu. Are structural estimates of auction models reasonable? Evidence from experimental data. Journal of Political Economy. 2005;113:703-741.

R.E. Baldwin, G.C. Cain. Shifts in relative US wages: the role of trade, technology, and factor endowments. Review of Economics and Statistics. 2000;82:580-595.

Becker, G.S., 1964. Human Capital: A Theoretical and Empirical Analysis, with Special Reference to Education. National Bureau of Economic Research, New York

Becker, G.S., 1967. Human capital and the personal distribution of income: an analytical approach. Woytinsky Lecture No. 1. University of Michigan, Ann Arbor, Institute of Public Administration

R. Bellman. Dynamic Programming. Princeton: Princeton University Press; 1957.

R. Bellman, R. Kalaba, B. Kootkin. Polynomial approximation—a new computational technique in dynamic programming allocation processes. Mathematics of Computation. 1963;1:155-161.

C. Belzil, J. Hansen. Unobserved ability and the return to schooling. Econometrica. 2002;70:2075-2091.

Y. Ben-Porath. The production of human capital and the life-cycle of earnings. Journal of Political Economy. 1967;75:352-365.

R. Bernal. The effect of maternal employment and child care on children’s cognitive development. International Economic Review. 2008;49:1173-1209.

D Blau. Search for nonwage job characteristics: a test of the reservation wage hypothesis. Journal of Labor Economics. 1991;9:186-205.

R.W. Blundell, I. Walker. A life cycle consistent empirical model of labor supply using cross section data. Review of Economic Studies. 1986;53:539-558.

R.W. Blundell, A. Duncan, C. Meghir. Estimating labour supply responses using tax policy reforms. Econometrica. 1998;66:827-861.

C. Bontemps, J.-M. Robin, G.J. van den Berg. An empirical equilibrium job search model with search on the job and heterogeneous workers and firms. International Economic Review. 1999;40:1039-1074.

C. Bontemps, J.-M. Robin, G.J. van den Berg. Equilibrium search with continuous productivity dispersion: theory and nonparametric estimation. International Economic Review. 2000;41:305-358.

J. Bound, G. Johnson. Changes in the structure of wages in the 1980s: an evaluation of alternative explanations. American Economic Review. 1992;82:371-392.

K. Burdett. A theory of employee job search and quit rates. American Economic Review. 1978;68:212-220.

K. Burdett, D.T. Mortensen. Wage differentials, employer size and unemployment. International Economic Review. 1998;39:257-273.

K. Burdett, J.I. Ondrich. How changes in labor demand affect unemployed workers. Journal of labor Economics. 1985;3:1-10.

P. Cahuc, F. Postel-Vinay, J.-M. Robin. Wage bargaining with on the job search: theory and evidence. Econometrica. 2006;74:323-364.

S.V. Cameron, J.J. Heckman. Life cycle schooling and dynamic selection bias: models and evidence for five cohorts of American males. Journal of Political Economy. 1998;106:262-333.

S.V. Cameron, J.J. Heckman. Can tuition policy combat rising wage inequality? In: M. Kosters, editor. Financing College Tuition: Government Policies Social Priorities. Washington, DC: AEI Press; 1999:76-121.

D. Card. Estimating the returns to schooling: progress on some persistent econometric problems. Econometrica. 2001;69:1127-1160.

G. Chamberlain. Panel data. Z. Griliches, M. Intriligator, editors. Handbook of Econometrics, vol. 2. Amsterdam: North Holland, 1984.

Ching, A., Imai, S., Ishihara, M., Jain, N., 2010. A guide to Bayesian estimation of discrete choice models with an application to a store level reward program. Mimeo. University of Toronto

B.J. Christensen, N.M. Kiefer. The exact likelihood function for an empirical job search model. Econometric Theory. 1991;7:464-486.

B.J. Christensen, N.M. Kiefer. Economic Modeling and Inference. Princeton: Princeton University Press; 2009.

K.P. Classen. The effect of unemployment insurance on the duration of unemployment and subsequent earnings. Industrial and Labor Relations Review. 1977;30:438-444.

J. Cogan. Fixed costs and labor supply. Econometrica. 1981;49:945-964.

S. Cohen-Goldner, Z. Eckstein. Labor mobility of immigrants: training, experience, language and opportunities. International Economic Review. 2008;49:837-874.

F. Cunha, J.J. Heckman, S.M. Schennach. Estimating the technology of cognitive and noncognitive skill formation. Econometrica. 2010;78:883-931.

T. Daula, R. Moffitt. Estimating dynamic models of quit behavior: the case of military reenlistment. Journal of Labor Economics. 1995;13:499-523.

J.P. Danforth. On the role of consumption and decreasing absolute risk aversion in the theory of job search. In: S.A. Lippmand, J.J. McCall, editors. Studies in the Economics of Search. New York: North Holland; 1979:109-131.

S. Della Vigna, M.D. Paserman. Job search and impatience. Journal of Labor Economics. 2005;23:527-588.

A.P. Dempster, M. Laird, D.B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B. 1977;39:1-38.

P.A. Diamond. A model of price adjustment. Journal of Economic Theory. 1970;3:156-168.

P.A. Diamond, E. Maskin. An equilibrium analysis of search and breach of contracts I: steady states. Bell Journal of Economics. 1979;10:282-316.

J. Dominitz, C.F. Manski. Eliciting student expectations of the returns to schooling. Journal of Human Resources. 1996;31:1-26.

J. Dominitz, C.F. Manski. Using expectations data to study subjective income expectations. Journal of the American Statistical Association. 1997;92:855-867.

Z. Eckstein, E. Nagypal. The evolution of US earnings inequality: 1961-2002. Federal Reserve Bank of Minneapolis Quarterly Review. 2004;28:10-29.

Z. Eckstein, G. Vandenberg. Empirical labor search: a survey. Journal of Econometrics. 2007;136:531-564.

Z. Eckstein, K.I. Wolpin. The specification and estimation of dynamic stochastic discrete choice models. Journal of Human Resources. 1989;24:562-598.

Z. Eckstein, K.I. Wolpin. Dynamic labor force participation of married women and endogenous wage growth. Review of Economic Studies. 1989;56:375-390.

Z. Eckstein, K.I. Wolpin. Estimating a market equilibrium search model from panel data on individuals. Econometrica. 1990;58:783-808.

Z. Eckstein, K.I. Wolpin. Duration to first job and the return to schooling: estimates from a search-matching model. Review of Economic Studies. 1995;62:263-286.

Z. Eckstein, K.I. Wolpin. Why youths drop out of high school: the impact of preferences, opportunities, and abilities. Econometrica. 1999;67:1295-1339.

D. Epple, R. Romano, H. Sieg. Admission, tuition, and financial aid policies in the market for higher education. Econometrica. 2006;74:885-928.

H. Fang, D. Silverman. Time inconsistency and welfare program participation: evidence from the NLSY. International Economic Review. 2009;50:1043-1078.

C. Ferrall. Unemployment insurance eligibility and the school to work transition in Canada and the United States. Journal of Business Economics and Statistics. 1997;15:115-129.

C.J. Flinn. Minimum wage effects on labor market outcomes under search, matching and endogenous contact rates. Econometrica. 2006;74:1013-1062.

C.J. Flinn, J.J. Heckman. New methods for analyzing structural models of labor force dynamics. Journal of Econometrics. 1982;18:114-142.

M. Francesconi. A joint dynamic model of fertility and work of married women. Journal of labor Economics. 2002;20:336-380.

E. French, C. Taber. Identification of models of the labor market. In: O. Ashenfelter, D. Card, editors. Handbook of Labor Economics, vol. 4A. Amsterdam: Elsevier; 2011:537-617.

Fu, Chao, 2009. An equilibrium model of the college market: an empirical investigation of tuition, applications, admissions and enrollment. Mimeo, University of Pennsylvania

Fuchs, V.R., 1980. Economic Growth and the Rise of the Service Sector. Working paper no. 386, National Bureau of Economic Research

Gemici, A., 2007. Family migration and labor market outcomes. Mimeo, University of Pennsylvania

J. Geweke, M.P. Keane, Mariano, Schuermann. Bayesian Inference for Dynamic Discrete Choice Models without the Need for Dynamic Programming. Cambridge: Cambridge University Press, 2000.

J. Geweke, M.P. Keane. Computationally intensive methods for integration in econometrics. J.J. Heckman, E.E. Leamer, editors. Handbook of Econometrics, vol. 5. Amsterdam: Elsevier, 2001.

P. Gottschalf, R. Moffitt. The growth of earnings instability in the US labor market. Brookings Papers on Economic Activity. 1994;2:217-272.

Gotz, G.A., McCall, J.J., 1984. A dynamic retention model for air force officers: theory and estimates. RAND, R-3028-AF

Eric Gould. Marriage and career: the dynamic decisions of young men. Journal of Human Capital. 2008;2:337-378.

E. Greenberger, L. Steinberg. When Teenagers Work: The Psychological and Social Costs of Adolescent Employment. New York: Basic Books; 1986.

Z. Griliches. Estimating the returns to schooling: some persistent econometric problems. Econometrica. 1977;45:1-22.

J.J. Heckman. Shadow prices, market wages and labor supply. Econometrica. 1974;42:679-694.

J.J. Heckman. Sample selection bias as a specification error. Econometrica. 1979;47:153-162.

J.J. Heckman. The incidental parameters problem and the problem of initial conditions in estimating a discrete time—discrete data stochastic process. In: C.F. Manski, D. McFadden, editors. Structural Analysis of Discrete Data with Econometric Applications. Cambridge MA: MIT Press, 1981.

J.J. Heckman, B.E. Honore. The empirical content of the Roy model. Econometrica. 1990;58:1121-1149.

J.J. Heckman, L. Lochner, C. Taber. Explaining rising wage inequality: explorations with a dynamic general equilibrium model of earnings with heterogeneous agents. Review of Economic Dynamics. 1998;1:1-58.

J.J. Heckman, T.E. MaCurdy. A life cycle model of female labor supply. Review of Economic Studies. 1980;47:47-74.

J.J. Heckman, T.E. MaCurdy. Corrigendum on a life cycle model of female labor supply. Review of Economic Studies. 1982;49:659-660.

J.J. Heckman, S. Polachek. Empirical evidence on the functional form of the earnings-schooling relationship. Journal of the American Statistical association. 1974;69:350-354.

J.J. Heckman, G. Sedlacek. Heterogeneity, aggregation and market wage functions: an empirical model of self-selection in the labor market. Journal of Political Economy. 1985;93:1077-1125.

J.J. Heckman, B. Singer. A method for minimizing the impact of distributional assumptions in econometric models for duration data. Econometrica. 1984;52:271-320.

J.J. Heckman, R.J. Willis. A Beta-logistic model for the analysis of sequential labor force participation by married women. Journal of Political Economy. 1977;85:27-58.

A. Hornstein, P. Krusell, G.L. Violante. The effects of technical change on labor market inequalities. In: P. Aghion, S. Durlauf, editors. Handbook of Economic Growth. Amsterdam: Elsevier, 2005.

J. Hotz, R.A. Miller. Conditional choice probabilities and the estimation of dynamic models. Review of Economic Studies. 1993;60:497-530.

J. Hotz, R.A. Miller, S. Sanders, J. Smith. A simulation estimator for dynamic models of discrete choice. Review of Economic Studies. 1994;61:265-289.

H. Ichimura, C. Taber. Semi-parametric reduced form estimation of tuition subsidies. American Economic Association, Papers and Proceedings. 2002:286-292.

S. Imai, N. Jain, A. Ching. Bayesian estimation of discrete choice models. Econometrica. 2009;77:1865-1900.

S. Imai, M.P. Keane. Intertemporal labor supply and human capital accumulation. International Economic Review. 2004;45:601-641.

B. Jovanovic. Job matching and the theory of turnover. Journal of Political Economy. 1979;87:972-990.

C. Juhn, K.M. Murphy, B. Pierce. Wage inequality and the rise in returns to skill. Journal of Political Economy. 1993;101:410-442.

T. Kane. The Price of Admission: Rethinking How Americans Pay for College. Washington, DC: The Brookings Institution; 1999.

L.F. Katz, D.H. Autor. Changes in the age structure and earnings inequality. In: O. Ashenfelter, D. Card, editors. Handbook of Labor Economics, vol. 3. Amsterdam: North Holland; 1999:1463-1555.

L.F. Katz, K.M. Murphy. Changes in relative wages, 1963-1987: supply and demand factors. Quarterly Journal of Economics. 1992;107:35-78.

Keane, M.P., 2009a. Labor supply and taxes: a survey. Mimeo. University of Technology Sidney

Keane, M.P., 2009b. Income taxation in a life cycle model with human capital. Mimeo. University of Technology Sidney

M.P. Keane, R. Moffitt. A structural model of multiple welfare program participation and labor supply. International Economic Review. 1998;39:553-590.

M.P. Keane, R. Moffitt, D. Runkle. Real wages over the business cycle: estimating the impact of heterogeneity with micro data. Journal of Political Economy. 1988;96:1232-1266.

Keane, M.P., Sauer, R.M., 2009. A computationally practical simulation estimator for panel data models with unobserved endogenous state variables. International Economic Review (forthcoming)

M.P. Keane, K.I. Wolpin. The solution and estimation of discrete choice dynamic programming models by simulation: Monte Carlo evidence. Review of Economics and Statistics. 1994:648-672.

M.P. Keane, K.I. Wolpin. The career decisions of young men. Journal of Political Economy. 1997;105:473-522.

M.P. Keane, K.I. Wolpin. Eliminating race differences in school attainment and labor market success. Journal of Labor Economics. 2000;18:614-652.

M.P. Keane, K.I. Wolpin. The effect of parental transfers and borrowing constraints on educational attainment. International Economic Review. 2001;42:1051-1103.

M.P. Keane, K.I. Wolpin. Estimating welfare effects consistent with forward looking behavior: an article in two parts. Journal of Human Resources. 2002;37:570-622.

M.P. Keane, K.I. Wolpin. Exploring the usefulness of a non-random holdout sample for model validation: welfare effects on female behavior. International Economic Review. 2007;48:1351-1378.

M.P. Keane, K.I. Wolpin. Empirical applications of discrete choice dynamic programming models. Review of Economic Dynamics. 2009;12:1-22.

M.P. Keane, K.I. Wolpin. The role of labor and marriage markets, preference heterogeneity, and the welfare system in the life cycle decisions of black, Hispanic and white women. International Economic Review. 2010;51:851-892.

N.M. Kiefer, G.R. Neumann. Wage dispersion and homogeneity: the empirical equilibrium search model. In: H. Bunzel, et al, editors. Panel Data and Labor Market Dynamics. Amsterdam: North Holland, 1993.

J. Kimmel, T.J. Knieser. New evidence on labor supply: employment vs. hours elasticities by sex and marital status. Journal of Monetary Economics. 1998;42:289-301.

P. Krusell, L.E. Ohanian, J.V. Ríos-Rull, G.L. Violante. Capital-skill complementarity and inequality: A macroeconomic analysis. Econometrica. 2000;68:1029-1053.

D. Lee. An estimable dynamic general equilibrium model of school, work and occupational choice. International Economic Review. 2005;46:1-34.

D. Lee, K.I. Wolpin. Intersectoral labor mobility and the growth of the service sector. Econometrica. 2006;74:1-46.

D. Lee, K.I. Wolpin. Accounting for wage and employment changes in the US from 1968-2000: a dynamic model of labor market equilibrium. Journal of Econometrics. 2010;156:68-85.

R. Lentz. Optimal unemployment insurance in an estimated job search model with savings. Review of Economic Studies. 2009;12:37-57.

Lise, J., Seitz, S., Smith, J., 2003. Equilibrium policy experiments and the evaluation of social programs. Mimeo, Queens University

Robin L. Lumsdaine, James H. Stock, David A. Wise. Pension plan provisions and retirement: men and women, medicare, and models. D.A. Wise, editor. Studies in the Economics of Aging. Chicago: University of Chicago Press; 1992.

J.J. McCall. Economics of information and job search. Quarterly Journal of Economics. 1970;84:113-126.

T.E. MaCurdy. An empirical model of labor supply in a life cycle setting. Journal of Political Economy. 1981;89:1059-1085.

T.E. MaCurdy. A simple scheme for estimating an intertemporal model of labor supply and consumption in the presence of taxes and uncertainty. International Economic Review. 1983;24:265-289.

D. McFadden. A method of simulated moments for estimation of discrete response models with numerical integration. Econometrica. 1989;57:995-1028.

McFadden, Daniel, Talvitie, A.P., associates, 1977. Validation of disaggregate travel demand models: Some tests. Urban demand forecasting project, final report. Volume V, Institute of Transportation Studies, University of California, Berkeley

C.F. Manski. Measuring expectations. Econometrica. 2004;72:1329-1376.

R.L. Matzkin. Nonparametric and distribution-free estimation of the binary threshhold crossing and the binary choice models. Econometrica. 1993;60:239-270.

Mazzocco, M., Yamaguchi, S., 2006. Labor supply, wealth dynamics and marriage decisions. Mimeo. UCLA

B. Meyer. Unemployment insurance and unemployment spells. Econometrica. 1990;58:757-782.

R.A. Miller. Job matching and occupational choice. Journal of Political Economy. 1984;92:1086-1120.

R.A. Miller. Estimating models of dynamic optimization with microeconomic data. In: M.H. Pesaran, P. Schmidt, editors. Handbook of Applied Econometrics: Microeconomics. Basil Blackwell, 1997.

J. Mincer. Investment in human capital and personal income distribution. Journal of Political Economy. 1958;66:281-302.

Mincer, J., 1962. Labor force participation of married women: a study of labor supply. In: Aspects of Labor Economics. National Bureau of Economic Research, Princeton, NJ, pp. 63–97

R. Moffitt. An economic model of welfare stigma. American Economic Review. 1983;73:1023-1035.

D.T. Mortensen. A theory of wage and employment dynamics. In: E.S. Phelps, et al, editors. Microeconomic Foundations of Employment and Inflations Theory. New York: W.W. Horton, 1970.

D.T. Mortensen. Unemployment insurance and job search decisions. Industrial and Labor Relations Review. 1976;30:505-517.

D.T. Mortensen. The matching process as a noncooperative game. In: J.J. McCall, editor. The Economics of Information and Uncertainty. Chicago: NBER, University of Chicago Press, 1982.

D.T. Mortensen. Job search and labor market analysis. In: Handbook of Labor Economics. Amsterdam: North-Holland; 1986.

K.M. Murphy, F. Welch. The structure of wages. Quarterly Journal of Economics. 1992;107:285-326.

K.M. Murphy, F. Welch. Occupational change and the demand for skill: 1940-1990. American Economic Review. 1993;83:122-126.

A. Norets. Inference in dynamic discrete choice models with serially correlated unobserved state variables. Econometrica. 2009;77:1665-1682.

A. Pagan, A. Ullah. Nonparametric Econometrics. Cambridge: Cambridge University Press; 1999.

A. Pakes. Patents as options: some estimates of the value of holding European patent stocks. Econometrica. 1986;54:755-784.

M.D. Paserman. Job search and hyperbolic discounting: structural estimation and policy evaluation. The Economic Journal. 2008;118:1418-1452.

F. Postel-Vinay, J.-M. Robin. Equilibrium wage dispersion and with heterogeneous workers and firms. Econometrica. 2002;70:2295-2350.

S. Rendon. Job search and asset accumulation under borrowing constraints. International Economic Review. 2006;47:233-264.

S. Rosen. Human capital: a survey of empirical research. R. Ehrenberg, editor. Research in Labor Economics, vol. 1. Greenwhich, CT: JAI Press, 1977.

A. Roy. Some thoughts on the distribution of earnings. Oxford Economic Papers. 1951;3:135-146.

J. Rust. Optimal replacement of GMC bus engines: an empirical model of Harold Zurcher. Econometrica. 1987;55:999-1034.

J. Rust. Structural estimation of markov decision models. R. Engle, D. McFadden, editors. Handbook of Econometrics, vol. 4. Amsterdam: North Holland, 1993.

J. Rust. Numerical dynamic programming in economics. In: H.M. Amman, et al, editors. Handbook of Computational Economics. Amsterdam: North Holland, 1994.

J. Rust. Using randomization to break the curse of dimensionality. Econometrica. 1997;65:487-516.

R. Sauer. Job mobility and the market for lawyers. Journal of Political Economy. 1998;106:147-171.

T.J. Sargent. Dynamic Macroeconomic Theory. Cambridge, MA: Harvard University Press; 1987.

R. Sauer. Education financing and lifetime earnings. Review of Economic Studies. 2004;71:1189-1216.

K. Shaw. Life cycle labor supply with human capital accumulation. International Economic Review. 1989;30:431-456.

G. Stigler. The economics of information. Journal of Political Economy. 1961;69:213-225.

T.R. Stinebrickner. Serially correlated variables in dynamic, discrete choice models. Journal of Applied Econometrics. 2000;15:595-624.

T.R. Stinebrickner. A dynamic model of teacher labor supply. Journal of Labor Economics. 2001;19:196-230.

S. Stern. Estimating a simultaneous search model. Journal of Labor Economics. 1989;7:348-369.

P. Sullivan. A dynamic analysis of educational attainment, occupational choices and job search. International Economics Review. 2010;51:289-317.

Tartari, M., 2007. Divorce and Cognitive Achievement of Children. Mimeo. Yale University

P. Todd, K.I. Wolpin. Assessing the impact of a school subsidy program in Mexico: using a social experiment to validate a dynamic behavioral model of child schooling and fertility. American Economic Review. 2006;96:1384-1417.

P. Todd, K.I. Wolpin. Ex-ante evaluation of social programs. Annals of Economics and Statistics. 2010.

G.J. Van den Berg. Nonstationarity in job search theory. Review of Economic Studies. 1990;57:255-277.

G.J. Van den Berg, G. Ridder. An empirical equilibrium search model of the labor market. Econometrica. 1998;66:1183-1221.

W. Van der Klaauw. Female labour supply and marital status decisions: a life cycle model. Review of Economic Studies. 1996;63:199-235.

Van der Klaauw, W., 2000. On the use of expectations data in estimating structural dynamic models: an analysis of career choices. Mimeo. University of North Carolina

W. Van der Klaauw, K.I. Wolpin. Social security and the retirement and savings behavior of low income households. Journal of Econometrics. 2008;145:21-42.

Van Soest, A.I. Woittiez, A. Kapteyn. Labor supply, income taxes and hours restrictions in the Netherlands. Journal of Human Resources. 1990;25:517-558.

Y. Weiss, R. Gronau. Expected interruptions in labour force participation and sex-related differences in earnings. Review of Economic Studies. 1981;48:607-619.

J.R. Walsh. Capital concept applied to man. Quarterly Journal of Economics. 1935;49:255-285.

F. Welch. What have we learned from empirical studies of unemployment insurance? Industrial and Labor Relations Review. 1977;30:450-461.

F. Welch. Growth in womens’ relative wages and in inequality among men:one phenomenon or two. American Economic Review. 2000;90:444-449.

R. Willis, S. Rosen. Education and self-selection. Journal of Political Economy. 1979;87:S7-S36.

D. Wise. A behavioral model versus experimentation: the effects of housing subsidies on rent. P. Brucker, R. Pauly, editors. Methods of Operations Research. 1985;50:441-489.

A. Wolinsky. Matching, search and bargaining. Journal of Economic Theory. 1987;42:311-333.

K.I. Wolpin. A dynamic stochastic model of fertility and child mortality. Journal of Political Economy. 1984;92:852-874.

K.I. Wolpin. Estimating a structural search model: the transition from school to work. Econometrica. 1987;55:801-817.

K.I. Wolpin. The determinants of black-white differences in early employment careers: search, layoffs, quits and endogenous wage growth. Journal of Political Economy. 1992;100:535-560.

K.I. Wolpin. Empirical Methods for the Study of Labor Force Dynamics. Luxembourg: Harwood Academic Publishers; 1995.

K.I. Wolpin. Wage Equations and Education Policy. Drewatripont, et al. Advances in Econometrics and Economics. vol. II. Cambridge: Cambridge University Press; 2003.

1 More technical discussions can be found in the surveys by Rust (1993, 1994), Miller (1997) and Aguirregebaria and Mira (forthcoming), as well as in a number of papers cited throughout this chapter.

2 Their use has spread to areas outside of traditional economics, such as marketing, in which it is arguably now the predominant approach to empirical research.

3 Most applications of DCDP models assume that agents, usually individuals or households, solve a finite horizon problem in discrete time. For the most part, we concentrate on that case and defer discussion of infinite horizon models to the discussion of the special case of job search models. We do not discuss continuous time models except in passing.

4 The conventional approach assumes that agents have rational expectations. An alternative approach directly elicits subjective expectations (see, e.g., Dominitz and Manski, 1996, 1997; Van der Klaauw, 2000; Manski, 2004).

5 A notable omission is the literature on retirement behavior. Although that literature relies heavily on the DCDP approach, the previous Handbook of Labor Economics chapter by Lumsdaine and Mitchell provides an extensive survey up to that time. We decided to concentrate on DCDP literature that to date has not been surveyed in the Handbook.

6 As will be seen in the empirical applications we consider, there are a wide range of types of variables that would be included in X. Their common feature is that they are not directly choices of the agent, although they may be affected by prior choices or correlated with choices without being directly affected by them.

7 By forward looking, we simply mean that agents take into account the effect of their current actions on future welfare. How exactly they form expectations about the impact of those actions and about future preferences and constraints are specific modeling choices.

8 By maintaining the same joint distribution when performing the ceteris paribus change, we are assuming that the change in an observable variable does not induce a change in the joint distribution of unobservables. This assumption is not the same as assuming conditional independence.

9 The modern approach to this topic began with Mincer (1962).

10 We treat the price of child care as parametric in part to illustrate how alternative approaches to estimation are related to achieving goal 3. A more complete model would allow for a choice among alternative types of child care, for example, of varying qualities, which differ in their price and which may vary over time.

11 In this taxonomy, semi-parametric and semi-structural categories fall into the parametric (P) and structural (S) categories.

12 As before (see footnote 8), we assume that the change in an observable variable does not induce a change in the joint distribution of unobservables.

13 Results from Matzkin (1993) apply to the case where all wage offers are observed (regardless of participation). In that case, aside from normalizations, image and the joint distribution, image are nonparametrically identified.

14 Pagan and Ullah (1999), Chapter 7, provides a good introduction to semi-parametric estimation of discrete choice models.

15 The unconventional assumption of normality for the wage distribution (allowing, as it does, for negative wage offers) is adopted in order to obtain a decision rule that is linear and additive in unobservables. We present a more general formulation in later sections.

16 As we show below, the additive error image is convenient in calculating choice probabilities and is maintained for illustrative purposes. However, as we also show below, the additive structure is fragile. It is lost, for example, if the wage function takes a semi-log form or if the utility function is nonlinear in consumption. Note that the linearity and separability of consumption in the utility function implies that husband’s income does not enter image and, thus, does not affect the participation decision.

17 We call image a probability, but it is actually a mixed probability for image and a density for image. Note that the Jacobian of the transformation from the wage density to the wage error density is one.

18 Given the assumptions of the model, full independence of the joint error distribution with respect to observables is not necessary. See French and Taber (2011) for an extended discussion of identification of selection models.

19 If all wage offers were observed, it would be possible to achieve all three goals without imposing parametric assumptions or structure. With respect to the policy counterfactual (goal 3), because of the subsidy acts like a wage tax, the effect of the subsidy can be calculated by comparing participation rates of women with a given wage to women with a wage augmented by image (see Ichimura and Taber (2002) and Todd and Wolpin (2010)).

20 Another reason for adopting the P-S estimation approach is that separating out preferences from opportunities (wage offers) helps to understand important social and economic phenomena, for example, in assessing how much of the difference in labor market outcomes of black and white women is due to differences in preferences and how much to differences in wage opportunities. Such an assessment could be useful in the design of public policies aimed at ameliorating those differences.

21 The assumption that the woman’s initial work experience at the time marriage is zero, which is undoubtedly in many cases untrue, is made for ease of exposition. We discuss in a later section the complications introduced by accounting for the fact that work experience is accumulated prior to marriage and varies across women.

22 The finite horizon assumption is immaterial for the points we wish to make. If the current period utility is bounded at all image and the discount factor is less than one, then the solution to the infinite horizon problem can be approximated arbitrarily closely by the solution to a long but finite horizon problem. The essential difference between a finite and infinite horizon model in terms of the predictions about behavior is that in the finite horizon case there are implications for age patterns in behavior.

23 The terminal period of the model would be at the termination of the marriage or the retirement of the wife. Accounting for divorce, even taking it to be exogenous, would unduly complicate the model. For illustrative purposes, then, we assume that the wife retires at image. The value function at image is normalized to zero, although a more complete formulation would make the retirement decision of both spouses a choice and would, at the least, specify the determination of post-retirement income through the social security system.

24 Given the lack of separate identification of image and image, we set image to reduce notation.

25 Note that if preference or wage shocks were serially correlated, the observable and unobservable state variables would not generally be additively separable as in the second equality. The additive separability arises because, with serial independence, image, which does not include image or image can replace image in the future component of the value functions,.that is image. We discuss the case of serially correlated errors below.

26 Because of the linearity and additive separability of consumption in utility, husband’s income does not affect the participation decision. We therefore do not need to specify what is known about future husband’s income (see below). Again, this assumption is made so that the solution method can be illustrated most effectively.

27 Later, we introduce stochastic fertility, allowing for the decision model to begin at the time of marriage, when we consider an extension of the model to a multinomial choice setting.

28 Suppose we define a young child as a child under the age of six (that is, not of school age). Consider a couple who at the start of the woman’s infecund period has a 3 year old child and thus for whom image. Then, for that couple, image and image.

29 This expression uses the fact that for any two random variables image and image,


image

30 Although image would surely be zero at some point, we carry it along to emphasize its perfect foresight property.

31 In solving for the latent variable functions, we could thus set image (or any other arbitrary value) for all image.

32 If the structure does not yield an additive (composite) error, the latent variable function becomes image. Calculating the joint regions of image, image that determine the probabilities that enter the likelihood function and that are used to calculate the image function must, in that case, be done numerically. We address this more general case below.

33 As in the static case, the Jacobian of the transformation from the density of the wage offer to the density of image is one.

34 In the current example, couples are assumed to know the full structure of the model and to use it in forming their forecasts of future wage offers and their future preferences.

35 It is possible that in some models additional parameters might enter image, say through the transition functions of state variables (see below for an example). While the same heuristic argument would apply, its validity would be less apparent.

36 More generally, if agents have beliefs about future policies (or policy changes), such beliefs should be incorporated into the solution and estimation of the decision model.

37 We ignore the possibility that the husband’s type also affects his earnings because, in the model as specified, his earnings has no effect on the participation decision. In a more general specification, one would probably add this source of heterogeneity.

38 There are obviously restrictions across the husband and wife individual type probabilities and couple type probabilities.

39 We could combine the permanent-transitory scheme with the AR(1) scheme by allowing the image and image shocks in (41) and (42) to be AR(1).

40 Alternatively as noted, we could assume, unrealistically as well, that the experience that women have at the start of marriage is exogenous with respect to future participation decisions.

41 We would also need to include any other initial conditions that affect wage offers (image’s) or preferences (image’s), for example, completed schooling.

42 If there is both unobserved permanent heterogeneity and serial correlation, and letting image be the exogenous initial conditions at the time of marriage, then in the likelihood function (43), image would be replaced with image. Note that image must contain a variable other than image and image in order to identify the effect of image on a couple’s type.

43 Note that we do not discuss methods like Hotz and Miller (1993) here. They propose a method to circumvent having to obtain a full solution of the DP problem while still obtaining parameter estimates, not a method for solving the DP problem (see below).

44 There is no formal proof of this proposition, though, as noted, Keane and Wolpin (1994) provide Monte Carlo evidence for a particular model that supports the intuition.

45 Geweke and Keane (2001) give an example where the curse of dimensionality is broken. This is when the Emax can be expressed as a function of the expected value of each alternative. (That is, these expected values are a sufficient statistic for all the state variables that determine them.) The size of this set of variables remains fixed at image, where image is the number of alternatives, even as the state space grows larger.

46 Technically this is not quite enough, as convergence must be uniform and not just pointwise.

47 Because image is now an unobserved component of the state space, estimation of image must be carried out jointly. This would require a distributional assumption for image and raises issues of the separate identification of image and of the effect of image on wages.

48 This is analogous to the fact that the asymptotic properties of competing estimators (under the hypothetical scenario of increasing sample size) do not reveal which will perform best given finite samples.

49 Stinebrickner (2000) compares several approximation methods in the context of a DCDP model with serially correlated shocks.

50 The Ben-Porath (1967) model of human capital accumulation leads to a semi-log form and Heckman and Polachek (1974) show using a Box-Cox transformation that a semi-log form is not rejected by the data.

51 Approximations to DCDP model decision rules were first discussed in Heckman (1981) and Wolpin (1984). For an empirical application in the labor economics literature, see Keane and Wolpin (2001).

52 In theory, the period length should correspond to the frequency of decision-making, which, in principle, may differ among choice variables. Like the specification of the model structure (including assumptions about expectations formation and optimization), the discrete time framework is adopted as an approximation. A continuous time framework would be more general, but would require assumptions about the joint process generating decision times for the choice variables.

53 To the extent that variations in hours worked within those categories represents differences in the choice of optimal hours, the discretization of hours induces measurement error. In the data, the mean and standard deviation of hours based on the categorization (where the categories are assigned 0, 1000, 2000 and 3000 hours) are almost identical to that based on actual annual hours worked. The standard deviation of hours within the categories is 145, 286, 224 and 429.

54 We have not, however, in this exploratory stage allowed for serially correlated unobservables either through permanent unobserved heterogeneity or serially correlated shocks.

55 The constant term in the contraceptive cost function, say image cannot be separately identified from image, that is, the goods cost of a newborn (a child age 0-1). Note that image.

56 One could instead allow for a choice of whether to contracept or not with pregnancy being an uncertain outcome. We ignore this extension for ease of presentation.

57 In Ben-Porath’s (1967) model of the production of human capital, an individual’s wage was given by the product of a human capital per-unit rental price times the individual’s human capital stock. Griliches (1977) operationalized the human capital production function as depending on arguments such as schooling, work experience and ability.

58 Husband’s are assumed to work full-time, which implies that, given schooling, age and work experience are isomorphic.

59 For convenience, image is in 1000 hour units.

60 image is obtained after substituting for the wife’s wage and the husband’s earnings in the budget constraint and then substituting for consumption in the utility function.

61 Keane and Wolpin (1994) discuss various specifications of the regression function.

62 In the labor force participation model, the total number of potential state points increases in image as feasible work experience and numbers of children increase. A researcher might, as the notation indicates, vary the number of randomly drawn state points with image.

63 Interpolating functions should be chosen with great care. To avoid overfitting, it is useful to solve the model at more state points than used in estimating the interpolating function and use the additional points for cross-validation. For example, we might solve the model at 4000 state points, estimate the interpolating function on 2000 points and check the fit, say the image, using the other 2000 points.

64 Note that the type-specific parameters, image’s, are essentially the constant terms in the image production function and cannot be separately identified from the skill rental prices.

65 Note that in writing (82) we are implicitly assuming that the state space evolves deterministically, conditional on the current state and current choice. Otherwise (82) would require a double sum, where the inner sum is over states that could potentially be reached from image given the choice image. Norets (2009) handles the stochastically evolving state space case. See also Ching et al. (2010).

66 Of course, providing such a distribution is possible without adopting a Bayesian approach, although it can be computationally burdensome.

67 The main insight in the multinomial setting is the same and the extension is straightforward.

68 Lee and Wolpin (2006); Lentz (2009) allow for (equilibrium) skill prices to change with calendar time due to technical change.

69 In recent work, Arcidiacono and Miller (2008) have developed methods for extending the HM approach to allow for unobserved state variables. However, there has as yet been no empirical implementation of that approach to a model as rich as those found in the literature.

70 Indeed, to our knowledge the only paper that has attempted to do so is Keane and Wolpin (2001). That paper models the labor supply and human capital investment decisions of young men, who often have low participation rates.

71 For men, strict application of the DCDP approach would require discretization of hours as an approximation to the choice set. In that case, the parallel to the multinomial choice problem considered above is exact. However, the main insight of the DCDP approach to estimation applies as well to continuous choices and to discrete-continuous choices in which the underlying dynamic programming problem is solved based on first-order conditions or Kuhn-Tucker conditions. That insight was simply the observation that because the continuation value (the image max function) is a deterministic function of state variables, the static model and the dynamic programming model have a common empirical structure.

72 Their approach builds on the seminal work of MaCurdy (1981) on the labor supply of men.

73 image is set at 8760 hours.

74 If a woman never works, the likelihood of that event is maximized by setting the fixed effect to image. Adjustments for this sample selection made little difference to the estimates.

75 Although the effect of the wife’s age may be interpreted as an estimate of image, it may also reflect changing preferences for leisure with age.

76 Note that this is still a rather large increase, consistent with a Marshallian elasticity of 13. 6/15 = 0.90.

77 The only exceptions we have come across are Van Soest et al. (1990) and Keane and Moffitt (1998). Both papers note that it is rare to observe people working very low levels of hours (the former paper looking at men, the latter looking at single mothers). Van Soest et al. (1990) capture this by building in a job offer distribution where few jobs with low levels of hours are available. Keane and Moffitt (1998) build in actual measures of fixed costs of working (e.g., estimates of child care costs).

78 See Altug and Miller (1998), Eqs. (6.8) and (6.9), which give the final simple expressions for the labor supply and participation equations. Hotz et al. (1994) develop a simulation method for implementing the Hotz and Miller (1993) conditional choice probability approach.

79 Note it is important not to include the aggregate prices image and image in these regressions. Agents are assumed not to know the future realizations of these variables and so cannot condition on them when forming expected future payoffs.

80 An alternative computational approach to taking out group and time means is to regress the group mean of hours on the group mean of wages and a complete set of time and group dummies. Then the wage effect is identified purely from the wage variation not explained by time or group. The advantage of the more involved two-step procedure is that the coefficient on the residual provides a test of exogeneity of wages.

81 Annual earnings if the woman works are assumed to equal 2000 times the hourly wage rate, regardless of how many hours the woman actually works. This is necessitated by the 1/0 nature of the work decision.

82 Eckstein and Wolpin (1989b) also assume that husband’s earnings is a deterministic function of husband’s age, a fixed effect, and a schooling/age interaction. If there were taste shocks or shocks to husband’s earnings they would have to be integrated out in solving the DP problem.

83 Note that the measurement error in wages cannot be estimated using wage data alone. But joint estimation of a wage equation and a labor supply model does allow measurement error to be estimated, as true wage variation affects behavior while measurement error does not. Of course, any estimate of the extent of measurement error so obtained will be contingent on the behavioral model.

84 This method is not necessarily more parsimonious than modeling a variable as a choice, trading off an additional choice variable (whether to have a child in this case with the corresponding utility and cost parameters ) against additional parameters governing the stochastic outcome (the probability of having a child). A limitation of this method is that it does not allow for effects of contemporaneous shocks, for example a high wage draw for the female, on the probability of having a child.

85 An alternative approach would be to assume the four errors follow a generalized extreme value distribution (see Arcidiacono (2005)).

86 The estimates imply that a married woman who works receives 34% of husband income. Unfortunately, the share if she does not work is not identified. As can be seen from (118), if a married woman does not work her utility from consumption is image times her share of husband income. Only this product is identified in the model.

87 Van der Klaauw (1996) simulates that a $1000 (or 5%) increase in husband offer wages would reduce average duration to first marriage by 1 year, increase average years of marriage (by age 35) by 2.3 years, and reduce average years of work by 2.6 years, or 27%. These are very large income effects, but they are not comparable to standard income effect measures, as they refer to changes in husband offer wages as opposed to changes in actual husband wages (or changes in some other type of non-labor income). Furthermore, it is not clear how much credence we can give to these figures since, as noted earlier, all permanent differences in husband income in the model are generated by differences in the wife’s own characteristics.

88 There are separate part- and full-time wage functions.

89 This is a sub-sample of a group of 1,783 women who were married at least once during the period (the larger sample including women who leave a partner during the sample period).

90 Neither model captures the sharp decline in participation in their 60’s due to retirement. But to be fair neither model incorporates any features designed to explain retirement behavior (such as pensions or Social Security).

91 The choice set differs across women for a number of reasons. For instance, only unmarried women with children under 18 have the option to participate in welfare, and working while on welfare is not an option if the offer wage rate is high enough that income would exceed the eligibility level. Also, girls under 16 cannot choose marriage.

92 Childcare time is, in turn, a weighted sum of time required to care for children in different age ranges.

93 The utility function includes some miscellaneous additional terms that were added to capture some specific features of the data. Full and part-time work are interacted with school to capture the fact that people who work while in school tend to have a strong preference for part-time over full-time work. Work variables are also interacted with a school less than 12 dummy to capture that part-time work is far more prevalent among high school students. Pregnancy is interacted with school to capture that women rarely go to school while pregnant. Tastes for school, marriage and pregnancy are also allowed to shift at certain key ages (16, 18 and 21). And there is a linear time trend (across cohorts) in tastes for marriage.

94 Keane and Wolpin (2002), which presents a nonstructural analysis of the same data, provides a more detailed description.

95 Note that this is an elasticity for hours conditional on working. It is unfortunate that Cogan does not report a participation elasticity, as, given his estimates, this would presumably have been much larger.

96 For recent attempts see Mazzocco and Yamaguchi (2006) and Tartari (2007).

97 Imai and Keane (2004) actually assume a much more complex process, designed to capture patterns of complementarity between human capital and hours of work in the human capital production function. But use of this simpler form helps to clarify the key points.

98 Although Imai and Keane assume a constant rental price, allowing for time varying rental rates is fairly straightforward.

99 Adding a bequest motive to the model, as in Imai and Keane (2004) is straightforward. This extension can be accommodated by adding a terminal value function, say image to image.

100 These equations may have multiple solutions. If there are, then one would need to check second order conditions or calculate image.

101 Keane and Wolpin (2001) first developed this approach to forming the likelihood in DCDP models. Keane and Sauer (2009) extended the approach to nonstructural panel data models.

102 Van der Klaauw and Wolpin (2008) estimate a collective model of the joint labor supply decisions of a married couple nearing retirement They allow for savings and human capital accumulation, incorporating as well a detailed representation of US social security system.rules. As noted in the introduction, we do not review the DCDP retirement literature in this chapter.

103 The estimate of image in Van der Klaauw and Wolpin (2008) is −0.6, which is also in line with other estimates from the retirement literature. They also include liquidity constraints.

104 Wages would grow deterministically if image contains age or job tenure.

105 A number of these earlier papers appeared in a 1977 symposium volume of the Industrial and Labor Relations Review. Most relevant in that volume are papers by Classen, and the comment on them by Welch (1977).

106 Burdett (1978) extended the standard unemployment search model to allow for search on the job.

107 It is a common theme in the structural literature to build upon and extend the theoretical literature in developing estimable models. This is the case with Rendon’s (2006) paper, which builds on the earlier work of Danforth (1979). Lentz (2009) also structurally estimates a sequential search model with savings. Unlike the standard model, the wage offer distribution is taken to be degenerate and agents choose their search intensity, which affects the rate at which job offers are received. Lentz uses the model to empirically determine the optimal unemployment insurance scheme.

108 Examples of papers based on wage posting models include Eckstein and Wolpin (1990), Kiefer and Neumann (1993), Van den Berg and Ridder (1998), Bontemps et al. (1999, 2000) and Postel-Vinay and Robin (2002). Those based on search-matching-bargaining.models include, among others, Eckstein and Wolpin (1995), Cahuc et al. (2006) and Flinn (2006).

109 See Mortensen (1986) for the continuous time case.

110 The LHS is linearly increasing in image and passes through the origin. The RHS is monotonically decreasing in image until it reaches image, and is then constant. There will be a unique intersection, and a unique image, as long as image.

111 In a continuous time model in which the arrival of offers follows a Poisson process with parameter image, the implicit reservation wage equation is identical except that the instantaneous arrival rate image replaces the offer probability, image.

112 Given a distributional assumption for image, the solution for the reservation wage involves numerically (if, as is for most distributions the case, there is no closed form solution) solving a nonlinear equation. An alternative solution method would be to start from the reservation wage for the final period of a finite horizon problem (see below) and iterate on the reservation wage until it converges. Convergence is assured because the value function is a contraction mapping (see Sargent (1987)).

113 Wolpin (1995) provides a proof.

114 The distribution of accepted wages is the truncated distribution of wage offers, namely, image.

115 An alternative would be to assume that once the terminal period is reached, the individual accepts the next offer that arrives, in which case the reservation wage at image is zero.

116 See Wolpin (1987), for the particular case in which image is either normal or log normal.

117 There are statistical issues better handled by specifying the hazard function, such as dealing with incomplete spells and time-varying regressors. See Meyer (1990) for an example of this approach. The issues we raise, however, are easier to demonstrate in a regression framework, but hold in the hazard framework as well.

118 The assumption that new unemployment spells are renewal processes rules out any structural connection between spells; for example, it rules out that the benefit level depends on the pre-unemployment wage.

119 As we noted, it is also usual to include some aggregate labor market statistic like the local unemployment rate. The idea is that the aggregate statistic reflects labor market demand and so will affect the offer rate or the wage offer distribution. As shown in Wolpin (1995), because the aggregate statistic is simply the aggregation of the search decisions over the unemployed population, it does not reflect solely demand conditions and estimates of UI benefit effects suffer from proxy variable bias.

120 This section follows the development in Flinn and Heckman (1982) and Wolpin (1995).

121 The minimum observed wage is a superconsistent estimator of the reservation wage in that it converges at rate image. This leads to nonstandard asymptotics in the likelihood estimation of the search model (see Flinn and Heckman (1982) and Christensen and Kiefer (1991)).

122 Setting image and image yields the infinite horizon implicit reservation wage function.

123 The implicit reservation wage Eq. (138) would no longer hold in this case. In particular, the integration would have be taken also over the distribution of the unobserved cost of search, recognizing that the reservation wage would be a function of that cost.

124 In some instances, wage rates that are directly reported in hours or weeks are inaccurate. In other cases, wage rates are derived from a division of earnings, reported over a longer period, say annually, and hours worked reported over that period. The inaccuracy arises from a seeming mismatch in the time period between earnings and hours.

125 Theoretical models in which job searchers faced borrowing constraints appeared much earlier, starting with Danforth (1979). However, formal empirical implementation did not become feasible until the development of estimation methods for DCDP models.

126 The search model does not have to incorporate savings for it to be optimal for individuals to quit into unemployment. It is sufficient that there be a finite horizon (retirement) and either that the offer rate be higher in unemployment or that there be a wage return to general experience (Wolpin, 1992).

127 This procedure for reducing the size of the state space follows Wolpin (1992).

128 Assets are only observed annually. The data are from the NLSY79.

129 Note that the cost of search in his model is isomorphic to the probability of receiving an offer.

130 The reservation wage, mean accepted wage and hazard rate are all functions of image. They can be estimated as nonparametric functions of image Paserman also models search after the exhaustion of benefits. In that case, it is assumed that individuals solve an infinite horizon problem.

131 The model was estimated both for a normal and log normal wage offer distribution. As found by others (for example,Wolpin (1987), the normal distribution assumption proved problematic. Paserman also allows for measurement error for the reasons previously discussed.

132 Ferrall also estimates a model for the US, but, because the UI system varies from state to state and is much less generous than the Canadian system, he does not incorporate UI benefits into the analysis. We focus on the Canadian data to highlight the fact that the DCDP approach allows for a detailed representation of UI policy.

133 Actually, this waiting period is only for those who are unemployed through layoff. Those who quit or were fired had a waiting period of five weeks. Ferrall assumes the waiting period to be two weeks independent of the reason for the unemployment spell.

134 As previously noted, the lower bound of the support for the Pareto distribution cannot be identified. Ferrall fixes that value. Christensen and Kiefer (2009) also use the Pareto distribution and impose, based on the wage posting model of Burdett and Mortensen (1998), the individual’s reservation wage as the lower bound.

135 UI system parameters depend on region and some of the structural parameters are allowed to differ by education.

136 Walsh (1935) is cited by Becker (1964), which is perhaps the most influential work in the development of the human capital literature.

137 The sources include a survey of 15,000 former members of the Alpha Kappa Psi fraternity on the education and earnings, a survey of Land Grant colleges made by the US Department of Interior, published figures on the earnings of physician and doctors, and a survey of Harvard Law graduates.

138 Heckman and Honore (1990) derive additional implications for earnings distributions and provide identification results in the case of non-normal distributions.

139 Much of the empirical literature prior to Willis and Rosen (1979) was concerned with estimating rates of return to schooling. A considerable amount of effort was (and still is) devoted to accounting for bias in the schooling coefficient due to omitted ability in a Mincer-style earnings function. See Card (2001) and Wolpin (2003) for an assessment of that literature.

140 Specifically, pay grades and promotions are assumed to be probabilistic functions of observable state variables.

141 A related study by Daula and Moffitt (1995) uses a similar DCDP model to analyze the effect of financial incentives on Army infantry reenlistments.

142 The model is solved using the Gittens index, a simplification in the solution of the dynamic programming problem that arises because tenure in one job does not affect the rate of learning in other jobs.

143 Work experience, image.

144 Keane and Wolpin (2001) also find that, on average, youths receive a transfer from their parents in excess of what is received when not attending college, sufficient to fully subsidize college tuition costs. The subsidy ranges from about one-half of the tuition cost for youths whose parents are the least educated (neither a high school graduate) to almost twice the tuition cost for youths whose parents are the most educated (at least one parent a college graduate). It might appear that it is because of the largesse of parents that relaxing borrowing constraints has only a minimal impact on college attendance. However, simulating the impact of relaxing the borrowing constraint in a regime where parents are assumed to provide no additional transfers to children who attend college leads to the same result. Transfers do, however, have a non-negligible effect on school attendance. If transfers are equalized across children, high SES children go to school less, but low SES children do not increase their attendance by much.

145 The approach taken by Cameron and Heckman (1999) can be interpreted as estimating the approximate decision rules from a DCDP model.

146 All individuals are assumed to complete at least six years.

147 In the model, there is also a probability of experiencing a so-called interruption, which is a decision period when no decision is made and the stock of accumulated human capital remains fixed, intended to capture an event such as illness or academic failure that lasts one period.

148 See Belzil and Hansen (2002) Table VIII. The previously described Keane and Wolpin (1997) model did not allow for nonlinearities in returns to education but did allow the return to differ between white and blue collar occupations and found a much higher return in the white collar occupation, which could be viewed as consistent with Belzil and Hansen’s (2002) finding of high returns at higher education levels.

149 The model is life cycle rather than dynamic in the sense that new information is not revealed to the agent in each decision period.

150 An earlier paper by Lee and Wolpin (2006) develops a similar model with a focus on examining the relative importance of labor demand and supply factors in explaining the expansion of service sector employment

151 The general rise in inequality and the college premium have often been linked (for example, Murphy and Welch (1992)), but not together with the rise in female-male wages and the growth of the service sector. The growth in the service sector has also been linked with the rise in female employment Fuchs, 1980; Welch, 2000 draws a link between the rise in wage inequality among men and the reduction in the gender wage gap.

152 For a review of the larger literature, see Katz and Autor (1999). The papers they survey include Bound and Johnson (1992), Gottschalf and Moffitt (1994), Juhn et al. (1993), Katz and Murphy (1992), Krusell et al. (2000) and Murphy and Welch (1992, 1993). Recent contributions to this literature include Baldwin and Cain (2000), Eckstein and Nagypal (2004), Hornstein et al. (2005) and Welch (2000).

153 There is a large labor economics literature on interindustry wage differentials among otherwise observably identical workers. Frictions to switching sectors are sometimes proposed to explain these differentials.

154 The maximum likelihood approach, developed by Keane and Wolpin (2001) and extended in Keane and Sauer (2009) allows for classification error and for missing state variables.

155 These results are similar to those Keane and Wolpin (2001) obtain from simulations of the effect of reducing borrowing constraints on enrollment.

156 See Greenberger and Steinberg (1986).

157 Recent work that specifies and estimates equilibrium models of the college market include Epple et al. (2006) and Fu (2009).

158 Estimation is based on the EM algorithm developed in Dempster et al. (1977) and adapted to DCDP models by Arcidiacono and Jones (2003).

159 The DCDP schooling models described previously take endowments at college entry ages as given. It is still an open question whether college subsidies would induce an increase in parental investments at younger ages and thus affect the endowments.

160 Of course, robustness by itself cannot be conclusive; all of the models could give similarly biased results.

161 Such prejudices are revealed by the contrast between the structure of the DCDP model that Todd and Wolpin (2006) used to evaluate a conditional cash transfer program in Mexico and the model used by Attanasio et al. (2005). As another example, there are several applications of DCDP models applied to traditional topics that take a behavioral economics view. As seen, Paserman (2008) studies a job search model. In addition, Fang and Silverman (2009) study a model of women’s welfare participation assuming that agents use hyperbolic discounting.

162 Similarly, Lise et al. (2003) used data from a Canadian experiment designed to move people off of welfare and into work to validate a calibrated search-matching model of labor market behavior. Bajari and Hortacsu (2005) employ a similar validation methodology in the case of a laboratory auction experiment.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.222.108.185