Minorization-Maximization (MM) algorithms for semiparametric logit models: Bottlenecks, extensions, and comparisons

Motivated by the promising performance of alternative estimation methods for mixed logit models, in this paper we derive, implement, and test minorization-maximization (MM) algorithms to estimate the semiparametric logit-mixed logit (LML) and mixture-of-normals multinomial logit (MON-MNL) models. In particular, we show that the reported computational efficiency of the MM algorithm is actually lost for large choice sets. Because the logit link that represents the parameter space in LML is intrinsically treated as a large choice set, the MM algorithm for LML actually becomes unfeasible to use in practice. We thus propose a faster MM algorithm that revisits a simple step-size correction. In a Monte Carlo study, we compare the maximum simulated likelihood estimator (MSLE) with the algorithms that we derive to estimate LML and MON-MNL models. Whereas in LML estimation alternative algorithms are computationally uncompetitive with MSLE, the faster-MM algorithm appears emulous in MON-MNL estimation. Both algorithms – faster-MM and MSLE – could recover parameters as well as standard errors at a similar precision in both models. We further show that parallel computation could reduce estimation time of faster-MM by 45% to 80%. Even though faster-MM could not surpass MSLE with analytical gradient (because MSLE also leveraged similar computational gains), parallel faster-MM is a competitive replacement to MSLE for MON-MNL that obviates computation of complex analytical gradients, which is a very attractive feature to integrate it into a flexible estimation software. We also compare different algorithms in an empirical application to estimate consumer’s willingness to adopt electric motorcycles in Solo, Indonesia. The results of the empirical application are consistent with those of the Monte Carlo study.

[1]  Nada Wasi,et al.  COMPARING ALTERNATIVE MODELS OF HETEROGENEITY IN CONSUMER CHOICE BEHAVIOR , 2012 .

[2]  Chandra R. Bhat,et al.  An Endogenous Segmentation Mode Choice Model with an Application to Intercity Travel , 1997, Transp. Sci..

[3]  B. Lindsay,et al.  Monotonicity of quadratic-approximation algorithms , 1988 .

[4]  Paul A. Ruud,et al.  Extensions of estimation methods using the EM algorithm , 1991 .

[5]  Christopher R. Cherry,et al.  Use characteristics and mode choice behavior of electric bike users in China , 2007 .

[6]  Kenneth Train,et al.  Mixed logit with a flexible mixing distribution , 2016 .

[7]  MM Algorithm for General Mixed Multinomial Logit Models , 2017 .

[8]  Michel Bierlaire,et al.  PythonBiogeme: a short introduction , 2016 .

[9]  David A. Hensher,et al.  Revealing additional dimensions of preference heterogeneity in a latent class mixed multinomial logit model , 2010 .

[10]  Xiao-Li Meng,et al.  Using EM to Obtain Asymptotic Variance-Covariance Matrices: The SEM Algorithm , 1991 .

[11]  Kenneth Train,et al.  EM algorithms for nonparametric estimation of mixing distributions , 2008 .

[12]  Christopher R. Cherry,et al.  The effect of incentives and technology on the adoption of electric motorcycles: A stated choice experiment in Vietnam , 2013 .

[13]  D. McFadden,et al.  MIXED MNL MODELS FOR DISCRETE RESPONSE , 2000 .

[14]  Prateek Bansal,et al.  Extending the logit-mixed logit model for a combination of random and fixed parameters , 2017, Journal of Choice Modelling.

[15]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[16]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[17]  I. Meilijson A fast improvement to the EM algorithm on its own terms , 1989 .

[18]  Cristian Angelo Guevara,et al.  A Monte Carlo experiment to analyze the curse of dimensionality in estimating random coefficients models with a full variance-covariance matrix , 2012 .

[19]  Jeremy T. Fox,et al.  A simple estimator for the distribution of random coefficients , 2011 .

[20]  Assessing User Benefits with Discrete Choice Models , 2005 .

[21]  Chandra R. Bhat,et al.  A new mixed MNP model accommodating a variety of dependent non-normal coefficient distributions , 2017 .

[22]  K. Train Discrete Choice Methods with Simulation , 2003 .

[23]  Ricardo A. Daziano,et al.  Conditional-logit Bayes estimators for consumer valuation of electric vehicle driving range , 2013 .

[24]  D. Hunter,et al.  Optimization Transfer Using Surrogate Objective Functions , 2000 .

[25]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[26]  Rico Krueger,et al.  Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions , 2017 .

[27]  R. Jennrich,et al.  Standard errors for EM estimation , 2000 .

[28]  Mogens Fosgerau,et al.  Investigating the distribution of the value of travel time savings , 2006 .

[29]  Jeremy T. Fox,et al.  Linear Regression Estimation of Discrete Choice Models with Nonparametric Distributions of Random Coefficients , 2007 .

[30]  Prateek Bansal,et al.  Comparison of parametric and semiparametric representations of unobserved preference heterogeneity in logit models , 2018, Journal of Choice Modelling.

[31]  Erick Guerra,et al.  Electric vehicles, air pollution, and the motorcycle city: A stated preference survey of consumers’ willingness to adopt electric motorcycles in Solo, Indonesia , 2017, Transportation Research Part D: Transport and Environment.

[32]  Keemin Sohn,et al.  An Expectation-Maximization Algorithm to Estimate the Integrated Choice and Latent Variable Model , 2017, Transp. Sci..

[33]  David A. Hensher,et al.  The Mixed Logit Model: the State of Practice and Warnings for the Unwary , 2001 .

[34]  Liberato Camilleri Bias of Standard Errors in Latent Class Model Applications Using Newton-Raphson and EM Algorithms , 2009, J. Adv. Comput. Intell. Intell. Informatics.

[35]  Michel Bierlaire,et al.  A practical test for the choice of mixing distribution in discrete choice models , 2005 .

[36]  Riccardo Scarpa,et al.  A Monte Carlo Evaluation of the Logit-Mixed Logit under Asymmetry and Multimodality , 2017 .

[37]  Adam Domanski,et al.  Estimation and welfare analysis from mixed logit models with large choice sets , 2018, Journal of Environmental Economics and Management.