An automated approach from GPS traces to complete trip information

Abstract Recent advances in communication technologies have enabled researchers to collect travel data based on ubiquitous and location-aware smartphones. These advances hold out the promise of allowing the automatic detection of the critical aspects (mode of transport, purpose, etc.) of people’s trips. Until now, efforts have concentrated on one aspect of trips (e.g. mode) at a time. Such methods have typically been developed on small data sets, often with data collected by researchers themselves and not in large-scale real-world data collection initiatives. This research develops a machine learning-based framework to identify complete trip information based on smartphone location data as well as online data from GTFS (General Transit Feed Specification) and Foursquare data. The framework has the potential to be integrated with smartphone travel surveys to produce all trip characteristics traditionally collected through household travel surveys. We use data from a recent, large-scale smartphone travel survey in Montreal, Canada. The collected smartphone data, augmented with GTFS and Foursquare data are used to train and validate three random forest models to predict mode of transport, transit itinerary as well as trip purpose (activity). According to cross-validation analysis, the random forest models show prediction accuracies of 87%, 81% and 71% for mode, transit itinerary and activity, respectively. The results compare favorably with previous studies, especially when taking the large, real-world nature of the dataset into account. Furthermore, the cross validation results show that the machine learning-based framework is an effective and automated tool to support trip information extraction for large-scale smartphone travel surveys, which have the potential to be a reliable and efficient (in terms of cost and human resources) data collection technique.

[1]  Jun Wu,et al.  Automated time activity classification based on global positioning system (GPS) tracking data , 2011, Environmental health : a global access science source.

[2]  Randall Guensler,et al.  Elimination of the Travel Diary: Experiment to Derive Trip Purpose from Global Positioning System Travel Data , 2001 .

[3]  M. Kwan,et al.  Gendered Space-Time Constraints, Activity Participation and Household Structure: A Case Study Using A GPS-Based Activity Survey in Suburban Beijing, China , 2016 .

[4]  Richard Bellman,et al.  ON A ROUTING PROBLEM , 1958 .

[5]  Yang Wang,et al.  Identifying Different Transportation Modes from Trajectory Data Using Tree-Based Ensemble Classifiers , 2017, ISPRS Int. J. Geo Inf..

[6]  Darren M. Scott,et al.  Making mode detection transferable: extracting activity and travel episodes from GPS data using the multinomial logit model and Python , 2017 .

[7]  Linlin Wu,et al.  Travel Mode Detection Based on GPS Raw Data Collected by Smartphones: A Systematic Review of the Existing Methodologies , 2016, Inf..

[8]  Yuki Endo,et al.  Deep Feature Extraction from Trajectories for Transportation Mode Estimation , 2016, PAKDD.

[9]  Kevin Heaslip,et al.  Inferring transportation modes from GPS trajectories using a convolutional neural network , 2018, ArXiv.

[10]  S. Travis Waller,et al.  Developing a disaggregate travel demand system of models using data mining techniques , 2017 .

[11]  Miguel A. Labrador,et al.  Automating mode detection for travel behaviour analysis by using global positioning systemsenabled mobile phones and neural networks , 2010 .

[12]  Shanjiang Zhu,et al.  Potentials of using social media to infer the longitudinal travel behavior: A sequential model-based clustering method , 2017 .

[13]  Jean Louise Wolf,et al.  Using GPS data loggers to replace travel diaries in the collection of travel data , 2000 .

[14]  Satish V. Ukkusuri,et al.  Location Contexts of User Check-Ins to Model Urban Geo Life-Style Patterns , 2015, PloS one.

[15]  Heping Zhang,et al.  Recursive Partitioning and Applications , 1999 .

[16]  Peter Vovsha,et al.  Evaluation of Two Methods for Identifying Trip Purpose in GPS-Based Household Travel Surveys , 2014 .

[17]  Kay W. Axhausen,et al.  Trip Purpose Identification from GPS Tracks , 2014 .

[18]  Satish V. Ukkusuri,et al.  Urban activity pattern classification using topic models from online geo-location data , 2014 .

[19]  Yu Liu,et al.  Pervasive location acquisition technologies: Opportunities and challenges for geospatial studies , 2012, Comput. Environ. Urban Syst..

[20]  T. Rashidi,et al.  Exploring the capacity of social media data for modelling travel behaviour: Opportunities and challenges , 2017 .

[21]  Guangnian Xiao,et al.  Travel Mode Detection Based on Neural Networks and Particle Swarm Optimization , 2015, Inf..

[22]  William G. Griswold,et al.  Mobility Detection Using Everyday GSM Traces , 2006, UbiComp.

[23]  Konstadinos G. Goulias,et al.  Activity space estimation with longitudinal observations of social media data , 2016, Transportation.

[24]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[25]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[26]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[27]  Zachary Patterson,et al.  DataMobile: Smartphone Travel Survey Experiment , 2016 .

[28]  Eiji Hato,et al.  Travel Mode Detection with Varying Smartphone Data Collection Frequencies , 2016, Sensors.

[29]  Toshiyuki Yamamoto,et al.  Deriving Personal Trip Data from GPS Data: A Literature Review on the Existing Methodologies , 2014 .

[30]  Xuegang Ban,et al.  Vehicle classification using GPS data , 2013 .

[31]  Norbert Brändle,et al.  Supporting large-scale travel surveys with smartphones – A practical approach , 2014 .

[32]  Hao Wang,et al.  Detecting Transportation Modes Using Deep Neural Network , 2017, IEICE Trans. Inf. Syst..

[33]  Philip S. Yu,et al.  Transportation mode detection using mobile phones and GIS information , 2011, GIS.

[34]  Moshe Ben-Akiva,et al.  PII: S0965-8564(99)00043-9 , 2000 .

[35]  Yingling Fan,et al.  Real-time trip purpose prediction using online location-based search and discovery services , 2017 .

[36]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[37]  Jia Yuan Yu,et al.  Semi-Supervised travel mode detection from smartphone data , 2017, 2017 International Smart Cities Conference (ISC2).

[38]  J. Wolf,et al.  Impact of Underreporting on Mileage and Travel Time Estimates: Results from Global Positioning System-Enhanced Household Travel Survey , 2003 .

[39]  James Haworth,et al.  Who you are is how you travel: A framework for transportation mode detection using individual and environmental characteristics , 2017 .

[40]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[41]  Mehdi Ghatee,et al.  An inference engine for smartphones to preprocess data and detect stationary and transportation modes , 2016 .

[42]  Guangnian Xiao,et al.  Detecting trip purposes from smartphone-based travel surveys with artificial neural networks and particle swarm optimization , 2016 .

[43]  Guangnian Xiao,et al.  A Data-Driven Method for Trip Ends Identification Using Large-Scale Smartphone-Based GPS Tracking Data , 2017, IEEE Transactions on Intelligent Transportation Systems.

[44]  Peter R. Stopher,et al.  The Travel Survey Toolkit: Where to From Here? , 2009 .

[45]  Moshe Ben-Akiva,et al.  Stop Detection in Smartphone-based Travel Surveys , 2015 .

[46]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .

[47]  Xing Xie,et al.  Collaborative activity recognition via check-in history , 2011, LBSN '11.

[48]  Johann Schrammel,et al.  Comparison of Travel Diaries Generated from Smartphone Data and Dedicated GPS Devices , 2015 .

[49]  Hjp Harry Timmermans,et al.  Transportation mode recognition using GPS and accelerometer data , 2013 .

[50]  Eiji Hato,et al.  Use of acceleration data for transportation mode prediction , 2014, Transportation.

[51]  Peter R. Stopher,et al.  Review of GPS Travel Survey and GPS Data-Processing Methods , 2014 .

[52]  Kees Maat,et al.  Deriving and validating trip purposes and travel modes for multi-day GPS-based travel surveys: A large-scale application in the Netherlands , 2009 .

[53]  Moshe Ben-Akiva,et al.  Future Mobility Survey , 2013 .

[54]  Wei-Ying Ma,et al.  Understanding mobility based on GPS data , 2008, UbiComp.

[55]  Kellie J. Archer,et al.  Empirical characterization of random forest variable importance measures , 2008, Comput. Stat. Data Anal..

[56]  Chelsea C. White,et al.  Multiobjective A* , 1991, JACM.

[57]  Steven Skiena,et al.  Implementing discrete mathematics - combinatorics and graph theory with Mathematica , 1990 .

[58]  Kay W. Axhausen,et al.  Processing Raw Data from Global Positioning Systems without Additional Information , 2009 .