Travel mode classification of intercity trips using cellular network data

Abstract Many applications in transport planning require an understanding of travel patterns separated by travel mode. To use cellular network data as observations of human mobility in these applications, classification by travel mode is needed. Existing classification methods for GPS-trajectories are often inefficient for cellular network data, which has lower resolution in space and time than GPS data. In this study, we compare three geometry-based mode classification methods and three supervised methods to classify trips extracted from cellular network data in intercity origin-destination pairs as either road or train. To understand the difficulty of the problem, we use a labeled dataset of 255 trips in two OD-pairs to train the supervised classification methods and to evaluate the classification performance. For an OD-pair where the road and train routes are not separated by more than four kilometers, the geometry-based methods classify 4.5% - 7.1% of the trips wrong, while two of the supervised methods can classify all trips correctly. Using a large-scale dataset of 29037 trips, we find that separation between classes is less evident than in the labeled dataset and show that the choice of classification methods impacts the aggregated modal split estimate.

[1]  David Gundlegård Transport Analytics Based on Cellular Network Signalling Data , 2018 .

[2]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[3]  Alex Rutherford,et al.  On the privacy-conscientious use of mobile phone data , 2018, Scientific Data.

[4]  Robert Weibel,et al.  Transport mode detection based on mobile phone network data: A systematic review , 2019, Transportation Research Part C: Emerging Technologies.

[5]  Alexander Erath,et al.  Transport modelling in the age of big data , 2017 .

[6]  Zbigniew Smoreda,et al.  Inferring social influence in transport mode choice using mobile phone data , 2017, EPJ Data Science.

[7]  Emmanuel Morin,et al.  A Comparison of Smoothing Techniques for Bilingual Lexicon Extraction from Comparable Corpora , 2013, BUCC@ACL.

[8]  Marta C. González,et al.  The path most traveled: Travel demand estimation using big data resources , 2015, Transportation Research Part C: Emerging Technologies.

[9]  Botond Rajna,et al.  Travel demand estimation and network assignment based on cellular network data , 2016, Comput. Commun..

[10]  Pu Wang,et al.  Transportation Mode Split with Mobile Phone Data , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[11]  Y. Shafahi,et al.  Travel Mode Detection Exploiting Cellular Network Data , 2016 .

[12]  Jakob Puchinger,et al.  Inferring dynamic origin-destination flows by transport mode using mobile phone data , 2019, Transportation Research Part C: Emerging Technologies.

[13]  Nils Breyer,et al.  Comparative Analysis of Travel Patterns from Cellular Network Data and an Urban Travel Demand Model , 2020 .