Prediction and behavioral analysis of travel mode choice: A comparison of machine learning and logit models

Abstract Some recent studies have shown that machine learning can achieve higher predictive accuracy than logit models. However, existing studies rarely examine behavioral outputs (e.g., marginal effects and elasticities) that can be derived from machine-learning models and compare the results with those obtained from logit models. In other words, there has not been a comprehensive comparison between logit models and machine learning that covers both prediction and behavioral analysis, two equally important subjects in travel-behavior study. This paper addresses this gap by examining the key differences in model development, evaluation, and behavioral interpretation between logit and machine-learning models for mode-choice modeling. We empirically evaluate the two approaches using stated-preference survey data. Consistent with the literature, this paper finds that the best-performing machine-learning model, random forest, has significantly higher predictive accuracy than multinomial logit and mixed logit models. The random forest model and the two logit models largely agree on several aspects of the behavioral outputs, including variable importance and the direction of association between independent variables and mode choice. However, we find that the random forest model produces behaviorally unreasonable arc elasticities and marginal effects when these behavioral outputs are computed from a standard approach. After the introduction of some modifications that overcome the limitations of tree-based models, the results are improved to some extent. There appears to be a tradeoff between predictive accuracy and behavioral soundness when choosing between machine learning and logit models in mode-choice modeling.

[1]  Rodrigo Acuna-Agost,et al.  Airline itinerary choice modeling using machine learning , 2019, Journal of Choice Modelling.

[2]  David A. Hensher,et al.  The Mixed Logit Model: the State of Practice and Warnings for the Unwary , 2001 .

[3]  Anjali Awasthi,et al.  Prediction of Individual Travel Mode with Evidential Neural Network Model , 2013 .

[4]  Sendhil Mullainathan,et al.  Machine Learning: An Applied Econometric Approach , 2017, Journal of Economic Perspectives.

[5]  Catherine L. Ross,et al.  Machine Learning Travel Mode Choices: Comparing the Performance of an Extreme Gradient Boosting Model with a Multinomial Logit Model , 2018 .

[6]  Eleni I. Vlahogianni,et al.  Statistical methods versus neural networks in transportation research: Differences, similarities and some insights , 2011 .

[7]  M. Gevrey,et al.  Review and comparison of methods to study the contribution of variables in artificial neural network models , 2003 .

[8]  Pascal Van Hentenryck,et al.  Benders Decomposition for the Design of a Hub and Shuttle Public Transit System , 2015, Transp. Sci..

[9]  S. Menard Six Approaches to Calculating Standardized Logistic Regression Coefficients , 2004 .

[10]  K. Train Discrete Choice Methods with Simulation , 2003 .

[11]  Nima Golshani,et al.  Modeling travel mode and timing decisions: Comparison of artificial neural networks and copula-based joint model , 2018 .

[12]  Mark Wardman,et al.  Meta-analysis of UK values of travel time: An update , 2011 .

[13]  Chi Xie,et al.  WORK TRAVEL MODE CHOICE MODELING USING DATA MINING: DECISION TREES AND NEURAL NETWORKS , 2002 .

[14]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[15]  Yunlong Zhang,et al.  Travel Mode Choice Modeling with Support Vector Machines , 2008 .

[16]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[17]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[18]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[19]  Julian Hagenauer,et al.  A comparative study of machine learning classifiers for modeling travel mode choice , 2017, Expert Syst. Appl..

[20]  D. McFadden,et al.  MIXED MNL MODELS FOR DISCRETE RESPONSE , 2000 .

[21]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Gavin C. Cawley,et al.  On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[23]  Hjp Harry Timmermans,et al.  Using ensembles of decision trees to predict transport mode choice decisions: effects on predictive success and uncertainty estimates , 2014 .

[24]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[25]  Cinzia Cirillo,et al.  Validation and Forecasts in Models Estimated from Multiday Travel Survey , 2010 .

[26]  Trevor Hastie,et al.  Causal Interpretations of Black-Box Models , 2019, Journal of business & economic statistics : a publication of the American Statistical Association.

[27]  Hiroyuki Iseki,et al.  Not All Transfers Are Created Equal: Towards a Framework Relating Transfer Connectivity to Travel Behaviour , 2009 .

[28]  Long Cheng,et al.  Applying a random forest method approach to model travel mode choice behavior , 2019, Travel Behaviour and Society.

[29]  André Luiz Cunha,et al.  Estimating motorized travel mode choice using classifiers: An application for high-dimensional multicollinear data , 2017 .

[30]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[31]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[32]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[33]  Subeh Chowdhury,et al.  Transfer penalties in multimodal public transport networks , 2018, Transportation Research Part A: Policy and Practice.

[34]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[35]  Jonathan Levine,et al.  Integrating ridesourcing services with public transit: An evaluation of traveler responses combining revealed and stated preference data , 2019, Transportation Research Part C: Emerging Technologies.

[36]  Emil Pitkin,et al.  Peeking Inside the Black Box: Visualizing Statistical Learning With Plots of Individual Conditional Expectation , 2013, 1309.6392.

[37]  Hichem Omrani,et al.  Predicting Travel Mode of Individuals by Machine Learning , 2015 .

[38]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[39]  Susan Athey,et al.  Beyond prediction: Using big data for policy problems , 2017, Science.

[40]  John M. Rose,et al.  Applied Choice Analysis: A Primer , 2005 .

[41]  Donald Eugene. Farrar,et al.  Multicollinearity in Regression Analysis; the Problem Revisited , 2011 .

[42]  Chih-Jen Lin,et al.  Probability Estimates for Multi-class Classification by Pairwise Coupling , 2003, J. Mach. Learn. Res..

[43]  Guillaume-Alexandre Bilodeau,et al.  Discriminative conditional restricted Boltzmann machine for discrete choice and latent variable modelling , 2017, ArXiv.

[44]  H. Allen Klaiber,et al.  Do Random Coefficients and Alternative Specific Constants Improve Policy Analysis? An Empirical Investigation of Model Fit and Prediction , 2019 .

[45]  K. Train,et al.  Forecasting new product penetration with flexible substitution patterns , 1998 .

[46]  Xiqun Chen,et al.  Understanding ridesplitting behavior of on-demand ride services: An ensemble learning approach , 2017 .

[47]  Mark Last,et al.  Improving Stability of Decision Trees , 2002, Int. J. Pattern Recognit. Artif. Intell..