Machine learning applications in activity-travel behaviour research: a review

ABSTRACT This paper reviews the activity-travel behaviour literature that employs Machine Learning (ML) techniques for empirical analysis and modelling. Machine Learning algorithms, which attempt to build intelligence utilizing the availability of large amounts of data, have emerged as powerful tools in the fields of pattern recognition and big data analysis. These techniques have been applied in activity-travel behaviour studies since the early ’90s when Artificial Neural Networks (ANN) were employed to model mode choice decisions. AMOS, an activity-based modelling system developed in the mid-’90s, has ANN at its core to model and predict individual responses to travel demand management measures. In the dawn of 2000, ALBATROSS, a comprehensive activity-based travel demand modelling system, was proposed by Arentze and Timmermans using Decision Trees. Since then researchers have been exploring ML techniques like Support Vector Machines (SVM), Decision Trees (DT), Neural Networks (NN), Bayes Classifiers, and more recently, Ensemble Learners to model and predict activity-travel behaviour. A large number of publications over the years and an upward trend in the number of published articles over time indicate that Machine Learning is a promising tool for activity-travel behaviour analysis and prediction. This article, first of its kind in the literature, reviews these studies and explores the trends in activity-travel behaviour research that apply ML techniques. The review finds that mode choice decisions have received wide attention in the literature on ML applications. It was observed that most of the studies identify the lack of interpretability as a serious shortcoming in ML techniques. However, very few studies have attempted to improve the interpretability of the models. Further, some studies report the importance of feature engineering in ML-based studies, but very few studies adopt feature engineering before model development. Spatiotemporal transferability of models is another issue that has received minimal attention in the literature. In the end, the paper discusses possible directions for future research in the area of activity-travel behaviour modelling using ML techniques.

[1]  Tao Cheng,et al.  Inferring hybrid transportation modes from sparse GPS data using a moving window SVM classification , 2012, Comput. Environ. Urban Syst..

[2]  Hjp Harry Timmermans,et al.  A learning-based transportation oriented simulation system , 2004 .

[3]  Wei Wang,et al.  Evaluating staggered working hours using a multi-agent-based Q-learning model , 2014 .

[4]  Qiuping Wang,et al.  A Bayesian Network Model on the Public Bicycle Choice Behavior of Residents: A Case Study of Xi’an , 2017 .

[5]  Yalin Baştanlar,et al.  Introduction to machine learning. , 2014, Methods in molecular biology.

[6]  I-Cheng Yeh,et al.  First and second order sensitivity analysis of MLP , 2010, Neurocomputing.

[7]  C. S. Pitombo,et al.  An exploratory analysis of relationships between socioeconomic, land use, activity participation variables and travel patterns , 2011 .

[8]  Shuo Yang,et al.  The research on prediction models for urban family member trip generation , 2016 .

[9]  Tai-Yu Ma,et al.  Bayesian networks for constrained location choice modeling using structural restrictions and model averaging , 2018, European Journal of Transport and Infrastructure Research.

[10]  San Cristóbal Mateo,et al.  The Lack of A Priori Distinctions Between Learning Algorithms , 1996 .

[11]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[12]  Ondřej Přibyl,et al.  Simulation of Daily Activity Patterns Incorporating Interactions Within Households: Algorithm Overview and Performance , 2005 .

[13]  Marco Diana,et al.  Classification of Tours in the U.S. National Household Travel Survey through Clustering Techniques , 2016 .

[14]  Geert Wets,et al.  Nonlinear Models for Determining Mode Choice , 2007, EPIA Workshops.

[15]  Shangguang Wang,et al.  Learning Transportation Annotated Mobility Profiles from GPS Data for Context-Aware Mobile Services , 2016, 2016 IEEE International Conference on Services Computing (SCC).

[16]  Matthew J. Roorda,et al.  Prototype Model of Household Activity-Travel Scheduling , 2003 .

[17]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[18]  Hing-Po Lo,et al.  Lifestyle classifications with and without activity-travel patterns , 2009 .

[19]  王炜,et al.  Multiagent-Based Simulation of Temporal-Spatial Characteristics of Activity-Travel Patterns Using Interactive Reinforcement Learning , 2014 .

[20]  Philip S. Yu,et al.  Transportation mode detection using mobile phones and GIS information , 2011, GIS.

[21]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[22]  Ta Theo Arentze,et al.  Reinduction of Albatross Decision Rules with Pooled Activity-Travel Diary Data and an Extended Set of Land Use and Cost-Related Condition States , 2003 .

[23]  A. Bazzan,et al.  Reinforcement learning for route choice in an abstract traffic scenario , 2012 .

[24]  Davy Janssens,et al.  Improving the Performance of a Multi-Agent Rule-Based Model for Activity Pattern Decisions Using Bayesian Networks , 1997 .

[25]  Jian-Min Xu,et al.  A dynamic route guidance arithmetic based on reinforcement learning , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[26]  André Luiz Cunha,et al.  Estimating motorized travel mode choice using classifiers: An application for high-dimensional multicollinear data , 2017 .

[27]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[28]  Cody Rose,et al.  A FRAMEWORK FOR MODELING OCCUPANCY SCHEDULES AND LOCAL TRIPS BASED ON ACTIVITY BASED SURVEYS , 2014 .

[29]  Harry Timmermans,et al.  Spatial Transferability of the Albatross Model System: Empirical Evidence from Two Case Studies , 2002 .

[30]  Giulio Erberto Cantarella,et al.  Multilayer Feedforward Networks for Transportation Mode Choice Analysis: An Analysis and a Comparison with Random Utility Models , 2005 .

[31]  Matthew J. Roorda,et al.  Incorporating Within-Household Interactions into Mode Choice Model with Genetic Algorithm for Parameter Estimation , 2006 .

[32]  Abolfazl Mohammadian,et al.  Application of Artificial Neural Network Models to Activity Scheduling Time Horizon , 2003 .

[33]  Jean-Claude Thill,et al.  Tree Induction of Spatial Choice Behavior , 1999 .

[34]  Tai-Yu Ma,et al.  Bayesian Networks for Multimodal Mode Choice Behavior Modelling: A Case Study for the Cross Border Workers of Luxembourg , 2015 .

[35]  Wei Yu,et al.  Making pervasive sensing possible: Effective travel mode sensing based on smartphones , 2016, Comput. Environ. Urban Syst..

[36]  Joseph Y. J. Chow,et al.  Causal structure learning for travel mode choice using structural restrictions and model averaging algorithm , 2017 .

[37]  Na Chen,et al.  Effects of neighborhood types & socio-demographics on activity space , 2016 .

[38]  Dan Zhao,et al.  Application of wavelet neural networks for trip chaining recognition , 2010, 2010 Sixth International Conference on Natural Computation.

[39]  Anjali Awasthi,et al.  Prediction of Individual Travel Mode with Evidential Neural Network Model , 2013 .

[40]  Baher Abdulhai,et al.  Using Smartphones and Sensor Technologies to Automate Collection of Travel Data , 2013 .

[41]  Harry J. P. Timmermans,et al.  Simulating the Influence of Life Trajectory Events on Transport Mode Behavior in an Agent-based System , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[42]  Byungkyu Park,et al.  Route choice modeling with Support Vector Machine , 2017 .

[43]  Ta Theo Arentze,et al.  Experiences with developing ALBATROSS: a learning-based transportation oriented simulation system , 1998 .

[44]  Soora Rasouli,et al.  Activity-based models of travel demand: promises, progress and prospects , 2014 .

[45]  Geert Wets,et al.  Identifying Decision Structures Underlying Activity Patterns: An Exploration of Data Mining Algorithms , 2000 .

[46]  Nedal T. Ratrout,et al.  Mode choice behavior of high school goers: Evaluating logistic regression and MLP neural networks , 2018, Case Studies on Transport Policy.

[47]  Xian Li,et al.  A Bayesian Network Modeling for Departure Time Choice: A Case Study of Beijing Subway , 2018 .

[48]  Ana Aguiar,et al.  Impact of Crowdsourced Data Quality on Travel Pattern Estimation , 2017, CrowdSenSys@SenSys.

[49]  Vibhav Gogate,et al.  Modeling Transportation Routines using Hybrid Dynamic Mixed Networks , 2005, UAI.

[50]  T. Arentze,et al.  The Impact of Simplification in a Sequential Rule-Based Model of Activity-Scheduling Behavior , 2005 .

[51]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[52]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[53]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[54]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[55]  Martin F. Lambert,et al.  A probabilistic method for assisting knowledge extraction from artificial neural networks used for hydrological prediction , 2006, Math. Comput. Model..

[56]  Yoshihide Sekimoto,et al.  Trip reconstruction and transportation mode extraction on low data rate GPS data from mobile phone , 2013 .

[57]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[58]  Xiaolei Ma,et al.  Mining smart card data for transit riders’ travel patterns , 2013 .

[59]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[60]  Una-May O'Reilly,et al.  Machine learning or discrete choice models for car ownership demand estimation and prediction? , 2017, 2017 5th IEEE International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS).

[61]  Fei Yang,et al.  Multimode trip information detection using personal trajectory data , 2016, J. Intell. Transp. Syst..

[62]  Nima Golshani,et al.  Modeling travel mode and timing decisions: Comparison of artificial neural networks and copula-based joint model , 2018 .

[63]  C. Pronello,et al.  Travellers’ profiles definition using statistical multivariate analysis of attitudinal variables , 2011 .

[64]  D. Ragland,et al.  Bicycle commuting market analysis using attitudinal market segmentation approach , 2013 .

[65]  Ning Jia,et al.  A Day-to-Day Route Choice Model Based on Reinforcement Learning , 2014 .

[66]  Chenfeng Xiong,et al.  A mixed Bayesian network for two-dimensional decision modeling of departure time and mode choice , 2018 .

[67]  Lei Zhang,et al.  Spatial Transferability of Neural Network Models in Travel Demand Modeling , 2018, J. Comput. Civ. Eng..

[68]  S. Travis Waller,et al.  Developing a disaggregate travel demand system of models using data mining techniques , 2017 .

[69]  Jean-Claude Thill,et al.  TRIP DISTRIBUTION FORECASTING WITH MULTILAYER PERCEPTRON NEURAL NETWORKS: A CRITICAL EVALUATION , 2000 .

[70]  Davy Janssens,et al.  Integrating Bayesian networks and decision trees in a sequential rule-based transportation model , 2006, Eur. J. Oper. Res..

[71]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[72]  Zhicai Juan,et al.  Travel Mode Detection Using GPS Data and Socioeconomic Attributes Based on a Random Forest Classifier , 2018, IEEE Transactions on Intelligent Transportation Systems.

[73]  John D. Kelleher,et al.  Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies , 2015 .

[74]  Q. Tu,et al.  Modeling Mode Choice Behaviors for Public Transport Commuters in Beijing , 2018, Journal of urban planning and development.

[75]  C. S. Pitombo,et al.  An Application of Exploratory Multivariate Data Analysis Techniques in a Peer Study of Land Use Influence on Individual Destination Choices , 2008, 2008 11th IEEE International Conference on Computational Science and Engineering - Workshops.

[76]  Raúl Rojas,et al.  Neural Networks - A Systematic Introduction , 1996 .

[77]  Toshiyuki Yamamoto,et al.  Drivers’ Route Choice Behavior: Analysis by Data Mining Algorithms , 2002 .

[78]  Kenji Kato,et al.  Microsimulation for Commuters' Mode and Discretionary Activities by Using Neural Networks , 2002 .

[79]  K. Nagel,et al.  Generating complete all-day activity plans with genetic algorithms , 2005 .

[80]  Andrei Lobov,et al.  Travel mode estimation for multi-modal journey planner , 2017 .

[81]  Julian Hagenauer,et al.  A comparative study of machine learning classifiers for modeling travel mode choice , 2017, Expert Syst. Appl..

[82]  Yingling Fan,et al.  Trip chain extraction using smartphone-collected trajectory data , 2019 .

[83]  Pei-Fen Kuo,et al.  Variable Selection of Travel Demand Models for Paratransit Service: A Data Mining Approach , 2015 .

[84]  Eleni I. Vlahogianni,et al.  Statistical methods versus neural networks in transportation research: Differences, similarities and some insights , 2011 .

[85]  Will Recker,et al.  Activity Pattern Recognition by Using Support Vector Machines with Multiple Classes , 2013 .

[86]  Abolfazl Mohammadian,et al.  Investigating Transferability of National Household Travel Survey Data , 2007 .

[87]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[88]  Chi Xie,et al.  WORK TRAVEL MODE CHOICE MODELING USING DATA MINING: DECISION TREES AND NEURAL NETWORKS , 2002 .

[89]  Lei Liu,et al.  A time-use activity-pattern recognition model for activity-based travel demand modeling , 2019 .

[90]  David A. Hensher,et al.  A comparison of the predictive potential of artificial neural networks and nested logit models for commuter mode choice , 1997 .

[91]  Eric J. Miller,et al.  Nested Logit Models and Artificial Neural Networks for Predicting Household Automobile Choices: Comparison of Performance , 2002 .

[92]  Yunlong Zhang,et al.  Travel Mode Choice Modeling with Support Vector Machines , 2008 .

[93]  Hjp Harry Timmermans,et al.  Representing mental maps and cognitive learning in micro-simulation models of activity-travel choice dynamics , 2005 .

[94]  Davy Janssens,et al.  Simulation of sequential data: An enhanced reinforcement learning approach , 2009, Expert Syst. Appl..

[95]  Ilan Salomon,et al.  Neural network analysis of travel behavior : Evaluating tools for prediction , 1996 .

[96]  Yang Liu,et al.  Applied Gaussian Process in Optimizing Unburned Carbon Content in Fly Ash for Boiler Combustion , 2017 .

[97]  Muhamad Nazri Borhan,et al.  Analysis of transportation mode choice using a comparison of artificial neural network and multinomial logit models , 2017 .

[98]  Hjp Harry Timmermans,et al.  Comparison of advanced imputation algorithms for detection of transportation mode and activity episode using GPS data , 2016 .

[99]  K. Goulias,et al.  Immigration, residential location, car ownership, and commuting behavior: a multivariate latent class analysis from California , 2008 .