Learning Daily Activity Sequences of Population Groups using Random Forest Theory

The choice of daily activity sequences differs between individuals based on their socio-demographic characteristics and their health and/or mobility status. The aim of this paper is to provide an improved methodology for learning and modeling the daily activity engagement patterns of individuals using a state-of-the-art machine learning algorithm. The dependencies between activity type, activity frequency, activity sequence, and socio-demographic characteristics of individuals are taken into account by employing a random forest model. In order to capture the heterogeneity and diversity among the predictor variables, we employed two different methods for split selection in the random forest algorithm: Classification and Regression Tree (CART) and curvature search. These two methods were examined under two different layer settings. In the first setting, the algorithm grows trees using all alternative predictor variables, whereas in the second setting the importance of the predictor variables is estimated and then the algorithm grows trees using only high-score predictor variables. The models were applied to time use data from the large Halifax Space-Time Activity Research (STAR) household travel diary survey. We evaluated the estimation accuracy of the proposed models using confusion matrix, transition matrix, and sequential alignment techniques. Results show that the random forest model with CART split selection using the first layer setting has the best accuracy in replicating activity agendas and activity sequences of individuals. The results of this paper are expected to be implemented within the activity-based travel demand model, Scheduler for Activities, Locations, and Travel (SALT).

[1]  Soora Rasouli,et al.  Activity-based models of travel demand: promises, progress and prospects , 2014 .

[2]  Mohammad Hesam Hafezi,et al.  Daily activity and travel sequences of students, faculty and staff at a large Canadian university , 2018 .

[3]  Hjp Harry Timmermans,et al.  A learning-based transportation oriented simulation system , 2004 .

[4]  Chinh Q. Ho,et al.  Tour-based mode choice of joint household travel patterns on weekend and weekday , 2013, Transportation.

[5]  D. McFadden Econometric Models for Probabilistic Choice Among Products , 1980 .

[6]  Hugh Millward,et al.  Time use, travel behavior, and the rural–urban continuum: results from the Halifax STAR project , 2011 .

[7]  Moshe Ben-Akiva,et al.  Activity Based Travel Demand Model Systems , 1998 .

[8]  Eiji Hato,et al.  Use of acceleration data for transportation mode prediction , 2015 .

[9]  Torsten Hägerstraand WHAT ABOUT PEOPLE IN REGIONAL SCIENCE , 1970 .

[10]  Lei Liu,et al.  Modeling Activity Scheduling Behavior of Travelers for Activity-Based Travel Demand Models , 2018 .

[11]  Joshua Auld,et al.  Framework for the development of the Agent-based Dynamic Activity Planning and Travel Scheduling (ADAPTS) model , 2009 .

[12]  Ryuichi Kitamura,et al.  Generation of Synthetic Daily Activity-Travel Patterns , 1997 .

[13]  Jingqiu Guo,et al.  Real-time crash prediction on freeways using data mining and emerging techniques , 2017, Journal of Modern Transportation.

[14]  Ta Theo Arentze,et al.  Parametric Action Decision Trees: Incorporating Continuous Attribute Variables Into Rule-Based Models of Discrete Choice , 2007 .

[15]  Chandra R. Bhat,et al.  Comprehensive Econometric Microsimulator for Daily Activity-Travel Patterns , 2004 .

[16]  Ryuichi Kitamura,et al.  Micro-simulation of daily activity-travel patterns for travel demand forecasting , 2000 .

[17]  Mohammad Hesam Hafezi,et al.  Understanding and Modeling the Activity-Travel Behavior of University Commuters at a Large Canadian University , 2018, Journal of Urban Planning and Development.

[18]  Chandra R. Bhat,et al.  A household-level activity pattern generation model with an application for Southern California , 2013 .

[19]  Shan Suthaharan,et al.  Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning , 2015 .

[20]  Will Recker,et al.  Activity Pattern Recognition by Using Support Vector Machines with Multiple Classes , 2013 .

[21]  A. M. Lockwood,et al.  Exploratory Analysis of Weekend Activity Patterns in the San Francisco Bay Area, California , 2005 .

[22]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[23]  Lei Liu,et al.  Identification of Representative Patterns of Time Use Activity Through Fuzzy C-Means Clustering , 2017 .

[24]  Harry J. P. Timmermans,et al.  Unconditional and conditional competing risk models of activity duration and activity sequencing decisions: An empirical comparison , 2002, J. Geogr. Syst..

[25]  S. Rigatti Random Forest. , 2017, Journal of insurance medicine.

[26]  Erwan Scornet,et al.  A random forest guided tour , 2015, TEST.

[27]  Moshe Ben-Akiva,et al.  PII: S0965-8564(99)00043-9 , 2000 .

[28]  Lei Liu,et al.  A time-use activity-pattern recognition model for activity-based travel demand modeling , 2017, Transportation.

[29]  J. Spinney,et al.  Weather impacts on leisure activities in Halifax, Nova Scotia , 2011, International journal of biometeorology.

[30]  Lei Liu,et al.  Trip chaining and tour mode choice of non-workers grouped by daily activity patterns , 2018 .