Travel Mode Detection Using GPS Data and Socioeconomic Attributes Based on a Random Forest Classifier

The past few years have witnessed the rapid growth in the collection of large-scale GPS data via smartphone-based travel surveys around the world, following which transportation modes detection received significant attention. A mass of methods varying from Criteria-based rules to Machine Learning technology were employed to recognize the travel modes. However, the limited sample size, deficient feature selection and the less emphasis on addressing confusion modes, which leave room for improvement. This paper therefore sought to develop and evaluate a Random Forest classifier combined with a rule-based method to detect six travel modes (subway, walking, bicycle, e-bike, bus and car). Seven GPS-related variables are selected as feature set from the initial list of 22 variables. Consequently, more than 98% subway trips were correctly identified and the overall accuracy of the rest five modes classification is obtained as high as 93.11%. More than 85% trips were successfully identified for each mode except for the bus. More importantly, results show that socioeconomic attributes data could significantly improve the prediction of e-bike and address the confusion between bus and car modes. The employment of ROC curve provides a statistical proof to the excellent classification capacity of Random Forest in this study. Besides, the comparison with two representative classifiers demonstrates the applicability of Random Forest classifier for travel modes detection incorporating multi-source attributes.

[1]  Tao Cheng,et al.  Inferring hybrid transportation modes from sparse GPS data using a moving window SVM classification , 2012, Comput. Environ. Urban Syst..

[2]  Xing Xie,et al.  Understanding transportation modes based on GPS data for web applications , 2010, TWEB.

[3]  Tom M Mitchell,et al.  Mining Our Reality , 2009, Science.

[4]  Peter R. Stopher,et al.  Search for a global positioning system device to measure person travel , 2008 .

[5]  Kees Maat,et al.  Deriving and validating trip purposes and travel modes for multi-day GPS-based travel surveys: A large-scale application in the Netherlands , 2009 .

[6]  Toshiyuki Yamamoto,et al.  Deriving Personal Trip Data from GPS Data: A Literature Review on the Existing Methodologies , 2014 .

[7]  Henry A. Kautz,et al.  Inferring High-Level Behavior from Low-Level Sensors , 2003, UbiComp.

[8]  Zhicai Juan,et al.  A trip detection model for individual smartphone-based GPS records with a novel evaluation method , 2017 .

[9]  Catherine T. Lawson,et al.  Evaluating the feasibility of a passive travel survey collection in a complex urban environment: Lessons learned from the New York City case study , 2010 .

[10]  Henry A. Kautz,et al.  Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields , 2007, Int. J. Robotics Res..

[11]  Hjp Harry Timmermans,et al.  Transportation mode recognition using GPS and accelerometer data , 2013 .

[12]  Amer Shalaby,et al.  Enhanced System for Link and Mode Identification for Personal Travel Surveys Based on Global Positioning Systems , 2006 .

[13]  Baher Abdulhai,et al.  Impact of Sampling Rate of GPS-Enabled Cell Phones on Mode Detection and GIS Map Matching Performance , 2007 .

[14]  Patrick Tracy McGowen,et al.  Evaluating the Potential To Predict Activity Types from GPS and GIS Data , 2007 .

[15]  Wei Yu,et al.  Making pervasive sensing possible: Effective travel mode sensing based on smartphones , 2016, Comput. Environ. Urban Syst..

[16]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[17]  Sasu Tarkoma,et al.  Accelerometer-based transportation mode detection on smartphones , 2013, SenSys '13.

[18]  Monika Sester,et al.  Multi-stage approach to travel-mode segmentation and classification of gps traces , 2012 .

[19]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[20]  Guangnian Xiao,et al.  Inferring Trip Ends from GPS Data Based on Smartphones in Shanghai , 2014 .

[21]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[22]  Tao Cheng,et al.  A spatio-temporal approach for identifying the sample size for transport mode detection from GPS-based travel surveys: A case study of London's road network , 2014 .

[23]  Xing Xie,et al.  Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.

[24]  Jianhe Du,et al.  Increasing the accuracy of trip rate information from passive multi-day GPS travel datasets: Automatic trip end identification issues , 2007 .

[25]  P R Stopher,et al.  Use of an activity-based diary to collect household travel data , 1992 .

[26]  Christian Rudloff,et al.  Detecting Travel Modes and Profiling Commuter Habits Solely Based on GPS Data , 2010 .

[27]  Catherine T. Lawson,et al.  A GPS/GIS method for travel mode detection in New York City , 2012, Comput. Environ. Urban Syst..

[28]  Mahmoud Mesbah,et al.  Developing and Validating a Statistical Model for Travel Mode Identification on Smartphones , 2016, IEEE Transactions on Intelligent Transportation Systems.

[29]  Otto Anker Nielsen,et al.  Improved methods to deduct trip legs and mode from travel surveys using wearable GPS devices: A case study from the Greater Copenhagen area , 2015, Comput. Environ. Urban Syst..

[30]  Miguel A. Labrador,et al.  Automating mode detection for travel behaviour analysis by using global positioning systemsenabled mobile phones and neural networks , 2010 .