Swapping the Nested Fixed-Point Algorithm: a Class of Estimators for Discrete Markov Decision Models

This paper proposes a procedure for the estimation of discrete Markov decision models and studies its statistical and computational properties. Our method is similar to Rust's Nested Fixed-Point algorithm (NFXP), but the order of the two nested algorithms is swapped. First, we prove that this method produces the maximum likelihood estimator under the same conditions as NFXP. However, our procedure requires significantly fewer policy iterations than NFXP. Second, based on this algorithm, we define a class of sequential consistent estimators, K -stage Policy Iteration (PI) estimators, that encompasses MLE and Holz-Miller, and we obtain a recursive expression for their asymptotic covariance matrices. This presents the researcher with a 'menu' of sequential estimators reflecting a trade-off between efficiency and computational cost. Using actual and simulated data we compare the relative performance of these estimators. In all our experiments, the benefits in efficiency of using a two-stage PI estimator instead of a one-stage estimator (i.e., Hotz-Miller) are very significant. More interestingly, the benefits of MLE relative to two-stage PI are small.

[1]  V. J. Hotz,et al.  Conditional Choice Probabilities and the Estimation of Dynamic Models , 1993 .

[2]  Kenneth I. Wolpin,et al.  The Solution and Estimation of Discrete Choice Dynamic Programming Models by Simulation and Interpol , 1994 .

[3]  V. J. Hotz,et al.  A Simulation Estimator for Dynamic Models of Discrete Choice , 1994 .

[4]  Steven T. Berry Estimating Discrete-Choice Models of Product Differentiation , 1994 .

[5]  John Rust Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher , 1987 .

[6]  Victor Aguirregabiria THE DYNAMICS OF MARKUPS AND INVENTORIES IN RETAILING FIRMS , 1999 .

[7]  Herman J. Bierens,et al.  Uniform Consistency of Kernel Estimators of a Regression Function under Generalized Conditions , 1983 .

[8]  A. Pakes Dynamic Structural Models: Problems and Prospects. Mixed Continuous Discrete Controls and Market Interactions , 1991 .

[9]  Miguel A. Delgado,et al.  On asymptotic inferences in non-parametric and semiparametric models with discrete and mixed regressors , 1995 .

[10]  M. Keane,et al.  The Career Decisions of Young Men , 1997, Journal of Political Economy.

[11]  K. Judd Numerical methods in economics , 1998 .

[12]  M. Slade Optimal Pricing with Costly Adjustment: Evidence from Retail-Grocery Prices , 1998 .

[13]  Steven Stern,et al.  Job Exit Behavior of Older Men , 1991 .

[14]  Sumru Altug,et al.  The Effect of Work Experience on Female Wages and Labour Supply , 1998 .

[15]  John Rust,et al.  Estimation of Dynamic Structural Models: Problems and Prospects , 1991 .

[16]  Robert A. Miller Job Matching and Occupational Choice , 1984, Journal of Political Economy.

[17]  John Rust Maximum likelihood estimation of discrete control processes , 1988 .

[18]  Namkee Ahn,et al.  Measuring the Value of Children by Sex and Age Using a Dynamic Programming Model , 1995 .

[19]  Donna B. Gilleskie,et al.  A Dynamic Stochastic Model of Medical Care Use and Work Absence , 1998 .

[20]  John Rust Using Randomization to Break the Curse of Dimensionality , 1997 .

[21]  D. McFadden Econometric Models of Probabilistic Choice , 1981 .

[22]  C. Manski Dynamic choice in social settings: Learning from the experiences of others , 1993 .

[23]  Miguel A. Delgado,et al.  Nonparametric and Semiparametric Estimation with Discrete Regressors , 1995 .

[24]  John Rust Numerical dynamic programming in economics , 1996 .

[25]  John Rust A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-Random, and Deterministic Discretizations , 1997 .

[26]  J. Kadane Structural Analysis of Discrete Data with Econometric Applications , 1984 .

[27]  Steven T. Berry,et al.  Automobile Prices in Market Equilibrium , 1995 .

[28]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[29]  Christian Gourieroux,et al.  Statistics and econometric models , 1995 .

[30]  John Rust,et al.  Structural estimation of markov decision processes , 1986 .

[31]  Kenneth I. Wolpin,et al.  Public-Policy Uses of Discrete-Choice Dynamic Programming Models , 1996 .

[32]  John Rust,et al.  How Social Security and Medicare affect retirement behavior in a world of incomplete markets , 1994 .