A Monte Carlo experiment to analyze the curse of dimensionality in estimating random coefficients models with a full variance-covariance matrix

When the dimension of the vector of estimated parameters increases, simulation based methods become impractical, because the number of draws required for estimation grows exponentially with the number of parameters. In simulation methods, the lack of empirical identification when the number of parameters increases is usually known as the “curse of dimensionality” in the simulation methods. We investigate this problem in the case of the random coefficients Logit model. We compare the traditional Maximum Simulated Likelihood (MSL) method with two alternative estimation methods: the Expectation–Maximization (EM) and the Laplace Approximation (HH) methods that do not require simulation. We use Monte Carlo experimentation to investigate systematically the performance of the methods under different circumstances, including different numbers of variables, sample sizes and structures of the variance–covariance matrix. Results show that indeed MSL suffers from lack of empirical identification as the dimensionality grows while EM deals much better with this estimation problem. On the other hand, the HH method, although not being simulation-based, showed poor performance with large dimensions, principally because of the necessity of inverting large matrices. The results also show that when MSL is empirically identified this method seems superior to EM and HH in terms of ability to recover the true parameters and estimation time.

[1]  C. Bhat Simulation estimation of mixed discrete choice models using randomized and scrambled Halton sequences , 2003 .

[2]  J. Cramer,et al.  Robustness of Logit Analysis: Unobserved Heterogeneity and Mis-specified Disturbances* , 2007 .

[3]  J. Halton On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals , 1960 .

[4]  V. Hajivassiliou,et al.  Smooth unbiased multivariate probability simulators for maximum likelihood estimation of limited dependent variable models , 1993 .

[5]  Andrew T. A. Wood,et al.  Laplace approximations for hypergeometric functions with matrix argument , 2002 .

[6]  Zsolt Sándor,et al.  Quasi-random simulation of discrete choice models , 2004 .

[7]  David A. Hensher,et al.  The Mixed Logit Model: the State of Practice and Warnings for the Unwary , 2001 .

[8]  C. A. Guevara,et al.  Estimating Random Coefficient Logit Models with Full Covariance Matrix , 2009 .

[9]  D. McFadden,et al.  MIXED MNL MODELS FOR DISCRETE RESPONSE , 2000 .

[10]  Elisabetta Cherchi,et al.  Empirical Identification in the Mixed Logit Model: Analysing the Effect of Data Richness , 2008 .

[11]  John M. Rose,et al.  Stated Preference Experimental Design Strategies , 2007 .

[12]  A. Tversky Elimination by aspects: A theory of choice. , 1972 .

[13]  Stephane Hess,et al.  On the use of a Modified Latin Hypercube Sampling (MLHS) method in the estimation of a Mixed Logit Model for vehicle choice , 2006 .

[14]  David A. Hensher,et al.  Handbook of Transport Modelling , 2000 .

[15]  Juan de Dios Ortúzar,et al.  Can mixed logit reveal the actual data generating process? Some implications for environmental assessment , 2010 .

[16]  Joan L. Walker Mixed Logit (or Logit Kernel) Model: Dispelling Misconceptions of Identification , 2002 .

[17]  Paul A. Ruud,et al.  Approximation and Simulation of the Multinomial Probit Model : An Analysis of Covariance Matrix Estimation , 1996 .

[18]  H. Williams,et al.  Behavioural theories of dispersion and the mis-specification of travel demand models☆ , 1982 .

[19]  Juan de Dios Ortúzar,et al.  On the Treatment of Repeated Observations in Panel Data: Efficiency of Mixed Logit Parameter Estimates , 2011 .

[20]  Chandra R. Bhat,et al.  The maximum approximate composite marginal likelihood (MACML) estimation of multinomial probit-based unordered response choice models , 2011 .

[21]  Kenneth Train,et al.  EM algorithms for nonparametric estimation of mixing distributions , 2008 .

[22]  Erik Meijer,et al.  Measuring Welfare Effects in Models with Random Coefficients , 2000 .

[23]  Kenneth Train,et al.  A Recursive Estimator for Random Coefficient Models , 2007 .

[24]  Lesley Chiou,et al.  Masking Identification of Discrete Choice Models Under Simulation Methods , 2007 .

[25]  William N. Venables,et al.  Modern Applied Statistics with S-Plus. , 1996 .

[26]  C. Bhat Quasi-random maximum simulated likelihood estimation of the mixed multinomial logit model , 2001 .

[27]  Chandra R. Bhat,et al.  An Endogenous Segmentation Mode Choice Model with an Application to Intercity Travel , 1997, Transp. Sci..

[28]  K. Train,et al.  Mixed Logit with Repeated Choices: Households' Choices of Appliance Efficiency Level , 1998, Review of Economics and Statistics.

[29]  Joan L. Walker Extended discrete choice models : integrated framework, flexible error structures, and latent variables , 2001 .

[30]  D. McFadden A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration , 1989 .

[31]  Russel E. Caflisch,et al.  Quasi-Random Sequences and Their Discrepancies , 1994, SIAM J. Sci. Comput..

[32]  M. Harding,et al.  Using a Laplace Approximation to Estimate the Random Coefficients Logit Model By Nonlinear Least Squares , 2006 .

[33]  Chandra R. Bhat,et al.  A simulation evaluation of the maximum approximate composite marginal likelihood (MACML) estimator for mixed multinomial probit models , 2011 .

[34]  K. Train Recreation Demand Models with Taste Differences Over People , 1998 .

[35]  M. Bliemer,et al.  Construction of experimental designs for mixed logit models allowing for correlation across choice observations , 2010 .