Transit OD matrix estimation using smartcard data: Recent developments and future research challenges

Abstract In public transport, smartcards are primarily used for automatic fare collection purpose, which in turn generate massive data. During the last two decades, a tremendous amount of research has been done to employ this big data for various transport applications from transit planning to real-time operation and control. One of the smart card data applications is the estimation of the public transit origin–destination matrix (tOD). The primary focus of this article is to critically analyse the current literature on essential steps involved in the tOD estimation process. The steps include processes of data cleansing, estimation of unknowns, transfer detection, validation of developed algorithms, and ultimately estimation of zone level transit OD (ztOD). Estimation of unknowns includes boarding and alighting information estimation of passengers. Transfer detection algorithms distinguish between a transfer or an activity between two consecutive boarding and alighting. The findings reveal many unanswered critical research questions which need to be addressed for ztOD estimation using smartcard data. The research questions are primarily related to the conversion of stop level OD (stOD) to ztOD, transfer detection, and a few miscellaneous problems.

[1]  Oded Cats,et al.  Passenger Travel Time Reliability for Multimodal Public Transport Journeys , 2019 .

[2]  Etienne Côme,et al.  Short & long term forecasting of multimodal transport passenger flows with machine learning methods , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[3]  Jinhua Zhao,et al.  Estimating a Rail Passenger Trip Origin‐Destination Matrix Using Automatic Data Collection Systems , 2007, Comput. Aided Civ. Infrastructure Eng..

[4]  Paul Schonfeld,et al.  Measures of Travel Reliability on an Urban Rail Transit Network , 2020, Journal of Transportation Engineering, Part A: Systems.

[5]  Xin Wu,et al.  Hierarchical travel demand estimation using multiple data sources: A forward and backward propagation algorithmic framework on a layered computational graph , 2018, Transportation Research Part C: Emerging Technologies.

[6]  M. Hofmann,et al.  Transfer journey identification and analyses from electronic fare collection data , 2005, Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005..

[7]  Mahmoud Mesbah,et al.  Validating and improving public transport origin–destination estimation algorithm using smart card fare data ☆ , 2016 .

[8]  Ashish Bhaskar,et al.  A novel methodology to assimilate sub-path flows in bi-level OD matrix estimation process , 2020 .

[9]  Nigel H. M. Wilson,et al.  Analyzing Multimodal Public Transport Journeys in London with Smart Card Fare Payment Data , 2009 .

[10]  LiuLiang,et al.  Estimating Origin-Destination Flows Using Mobile Phone Location Data , 2011 .

[11]  Antonio Gschwender,et al.  Modeling bus bunching using massive location and fare collection data , 2018, J. Intell. Transp. Syst..

[12]  Baoming Han,et al.  A Review of Big Data Applications in Urban Transit Systems , 2021, IEEE Transactions on Intelligent Transportation Systems.

[13]  Sebastián Tamblay,et al.  A zonal inference model based on observed smart-card transactions for Santiago de Chile , 2016 .

[14]  Ashish Bhaskar,et al.  A Framework for the Comparative Analysis of Multi-Modal Travel Demand: Case Study on Brisbane Network , 2021 .

[15]  Keemin Sohn,et al.  Deep-learning architecture to forecast destinations of bus passengers from entry-only smart-card data , 2017 .

[16]  Ahmad Tavassoli,et al.  Application of smart card data in validating a large-scale multi-modal transit assignment model , 2017, Public Transport.

[17]  David A. Hensher,et al.  Handbook of Transport Modelling , 2000 .

[18]  Daoqin Tong,et al.  Stop Aggregation Model , 2012 .

[19]  Marcela Munizaga,et al.  Validating travel behavior estimated from smartcard data , 2013 .

[20]  Zhenliang Ma,et al.  Monitoring transit-served areas with smartcard data: A Brisbane case study , 2019, Journal of Transport Geography.

[21]  Mahmoud Mesbah,et al.  Improving alighting stop inference accuracy in the trip chaining method using neural networks , 2020, Public Transp..

[22]  Moshe Ben-Akiva,et al.  Evaluating the impact of interventions on network capacity , 2010 .

[23]  Zhenliang Ma,et al.  Activity detection and transfer identification for public transit fare card data , 2015 .

[24]  Satish V. Ukkusuri,et al.  Alighting stop determination using two-step algorithms in bus transit systems , 2019, Transportmetrica A: Transport Science.

[25]  Nigel H. M. Wilson,et al.  Estimation of population origin–interchange–destination flows on multimodal transit networks , 2018 .

[26]  J. White,et al.  Extracting origin destination information from mobile phone data , 2002 .

[27]  Howard Slavin,et al.  Use of Entry-Only Automatic Fare Collection Data to Estimate Linked Transit Trips in New York City , 2009 .

[28]  Harilaos N. Koutsopoulos,et al.  Automated Inference of Linked Transit Journeys in London Using Fare-Transaction and Vehicle Location Data , 2013 .

[29]  Bruno Agard,et al.  Measuring transit use variability with smart-card data , 2007 .

[30]  Fan Zhang,et al.  Spatiotemporal Segmentation of Metro Trips Using Smart Card Data , 2016, IEEE Transactions on Vehicular Technology.

[31]  Feng Chen,et al.  Transit smart card data mining for passenger origin information extraction , 2012, Journal of Zhejiang University SCIENCE C.

[32]  C. Mulley,et al.  Public transport pre-pay tickets: Understanding passenger choice for different products , 2012 .

[33]  Peter White,et al.  The Potential of Public Transport Smart Card Data , 2005 .

[34]  Mark R. McCord,et al.  Transit passenger origin–destination flow estimation: Efficiently combining onboard survey and large automatic passenger count datasets , 2015 .

[35]  Le Minh Kieu,et al.  A modified Density-Based Scanning Algorithm with Noise for spatial travel pattern analysis from Smart Card AFC data , 2015 .

[36]  Lingli Wang,et al.  Optimization of bus stop placement for routes on uneven topography , 2015 .

[37]  Eric J. Miller,et al.  Lessons from a Large-Scale Experiment on the Use of Smartphone Apps to Collect Travel Diary Data: The “City Logger” for the Greater Golden Horseshoe Area , 2020 .

[38]  Adam Rahbee,et al.  Origin and Destination Estimation in New York City with Automated Fare System Data , 2002 .

[39]  Carlo Ratti,et al.  Detecting weak public transport connections from cellphone and public transport data , 2014, BigDataScience '14.

[40]  Mahmoud Mesbah,et al.  Applications of transit smart cards beyond a fare collection tool: a literature review , 2018 .

[41]  Jiangtao Liu,et al.  Observability quantification of public transportation systems with heterogeneous data sources: An information-space projection approach based on discretized space-time network flow models , 2019, Transportation Research Part B: Methodological.

[42]  Costas J. Spanos,et al.  Measuring fine-grained metro interchange time via smartphones , 2017 .

[43]  Qing He,et al.  A robust method for estimating transit passenger trajectories using automated data , 2018 .

[44]  Di Huang,et al.  A Method for Bus OD Matrix Estimation Using Multisource Data , 2020 .

[45]  T. Rashidi,et al.  Exploring the capacity of social media data for modelling travel behaviour: Opportunities and challenges , 2017 .

[46]  Ding Luo,et al.  Constructing Transit Origin–Destination Matrices with Spatial Clustering , 2017 .

[47]  Ziyou Gao,et al.  Short-term prediction of passenger volume for urban rail systems: A deep learning approach based on smart-card data , 2021 .

[48]  O. Cats,et al.  Identification and classification of public transport activity centres in Stockholm using passenger flows data , 2015 .

[50]  Oded Cats,et al.  Crowding valuation in urban tram and bus transportation based on smart card data , 2018, Transportmetrica A: Transport Science.

[51]  Ziyou Gao,et al.  Estimating the influence of common disruptions on urban rail transit networks , 2016 .

[52]  Martin Trépanier,et al.  Individual Trip Destination Estimation in a Transit Smart Card Automated Fare Collection System , 2007, J. Intell. Transp. Syst..

[53]  Simon Washington,et al.  Do differences in built environments explain age differences in transport walking across neighbourhoods , 2018 .

[54]  Catherine Morency,et al.  Smart card data use in public transit: A literature review , 2011 .

[55]  Oliver Schauer,et al.  Intelligent transportation system for traffic and road infrastructure-related data , 2016 .

[56]  Ahmad Tavassoli,et al.  Evaluation of effects from sample-size origin-destination estimation using smart card fare data , 2017 .

[57]  Joo-Young Kim,et al.  Travel behavior analysis using smart card data , 2016 .

[58]  Kay W. Axhausen,et al.  Using smartcard data for agent-based transport simulation , 2016 .

[59]  Zhu Yonggang,et al.  Study on the Method of Constructing Bus Stops OD Matrix Based on IC Card Data , 2007, 2007 International Conference on Wireless Communications, Networking and Mobile Computing.

[60]  Hani S. Mahmassani,et al.  Dynamic Origin-Destination Demand Estimation with Multiday Link Traffic Counts for Planning Applications , 2003 .

[61]  Marcela Munizaga,et al.  Estimating the residence zone of frequent public transport users to make travel pattern and time use analysis , 2017 .

[62]  Krzysztof Janowicz,et al.  Can Twitter data be used to validate travel demand models , 2015 .

[63]  Stephen Greaves,et al.  Household travel surveys: Where are we going? , 2007 .

[64]  Robert Weibel,et al.  Transport mode detection based on mobile phone network data: A systematic review , 2019, Transportation Research Part C: Emerging Technologies.

[65]  Ashish Bhaskar,et al.  A novel origin destination based transit supply index: Exploiting the opportunities with big transit data , 2021 .

[66]  Xiaolei Ma,et al.  Mining smart card data for transit riders’ travel patterns , 2013 .

[67]  Yuxiong Ji,et al.  Statistical inference on transit route‐level origin–destination flows using automatic passenger counter data , 2015 .

[68]  Serge P. Hoogendoorn,et al.  A robust transfer inference algorithm for public transport journeys during disruptions , 2017 .

[69]  Moshe Ben-Akiva,et al.  ALTERNATIVE METHODS TO ESTIMATE ROUTE-LEVEL TRIP TABLES AND EXPAND ON-BOARD SURVEYS , 1985 .

[70]  Ka Kee Alfred Chu,et al.  Enriching Archived Smart Card Transaction Data for Transit Demand Modeling , 2008 .

[71]  Ernesto Cipriani,et al.  Combined signal setting design and traffic assignment problem , 2004, Eur. J. Oper. Res..

[73]  Lijun Sun,et al.  Probabilistic model for destination inference and travel pattern mining from smart card data , 2020, Transportation.

[74]  Patrick Bonnel,et al.  How comparable are origin-destination matrices estimated from automatic fare collection, origin-destination surveys and household travel survey? An empirical investigation in Lyon , 2020 .

[75]  Susan Grant-Muller,et al.  New and emerging data forms in transportation planning and policy: Opportunities and challenges for “Track and Trace” data , 2020, Transportation Research Part C: Emerging Technologies.

[76]  Gabriel E. Sánchez-Martínez Inference of Public Transportation Trip Destinations by Using Fare Transaction and Vehicle Location Data , 2017 .

[77]  Wei Wang,et al.  Recognizing metro-bus transfers from smart card data , 2019 .

[78]  Wei Wang,et al.  Bus Passenger Origin-Destination Estimation and Related Analyses , 2011 .

[79]  Keemin Sohn,et al.  Activity imputation for trip-chains elicited from smart-card data using a continuous hidden Markov model , 2016 .

[80]  W. Deng,et al.  What influences Metro station ridership in China? Insights from Nanjing , 2013 .

[81]  Mark Hickman,et al.  Transit Stop-Level Origin–Destination Estimation through Use of Transit Schedule and Automated Data Collection System , 2011 .

[82]  Qingquan Li,et al.  Spatial variations in urban public ridership derived from GPS trajectories and smart card data , 2018 .

[83]  Oded Cats,et al.  Investigating Potential Transit Ridership by Fusing Smartcard and Global System for Mobile Communications Data , 2017 .

[84]  Marcela Munizaga,et al.  Estimation of a disaggregate multimodal public transport Origin-Destination matrix from passive smartcard data from Santiago, Chile , 2012 .

[85]  Janine M Farzin Constructing an Automated Bus Origin–Destination Matrix Using Farecard and Global Positioning System Data in São Paulo, Brazil , 2008 .

[86]  Mark Hickman,et al.  Trip purpose inference using automated fare collection data , 2014, Public Transp..

[87]  Yao-Jan Wu,et al.  Origin-destination pattern estimation based on trajectory reconstruction using automatic license plate recognition data , 2018, Transportation Research Part C: Emerging Technologies.

[88]  Dazhi Sun,et al.  Smart Card Data Mining of Public Transport Destination: A Literature Review , 2018, Inf..

[89]  Oded Cats,et al.  Predicting disruptions and their passenger delay impacts for public transport stops , 2020, Transportation.

[90]  Mark Hickman,et al.  Validating and calibrating a destination estimation algorithm for public transport smart card fare collection systems , 2015 .

[91]  W. Y. Szeto,et al.  A novel discrete network design problem formulation and its global optimization solution algorithm , 2015 .

[92]  Alex Cui Bus passenger Origin-Destination Matrix estimation using Automated Data Collection systems , 2006 .

[93]  Mauricio Zúñiga,et al.  Towards a Reliable Origin-Destination Matrix from Massive Amounts of Smart Card and GPS Data: Application to Santiago , 2013 .

[94]  Mark R. McCord,et al.  Effect of Onboard Survey Sample Size on Estimation of Transit Bus Route Passenger Origin–Destination Flow Matrix Using Automatic Passenger Counter Data , 2011 .

[95]  Adrian B. Ellison,et al.  Electronic ticketing systems as a mechanism for travel behaviour change? Evidence from Sydney’s Opal card , 2017 .

[96]  Sebastián Tamblay,et al.  Extended Methodology for the Estimation of a Zonal Origin-Destination Matrix: A Planning Software Application Based on Smartcard Trip Data , 2018 .

[97]  Wang Fan,et al.  Dynamic OD estimation using Automatic Vehicle Location information , 2011, 2011 6th IEEE Joint International Information Technology and Artificial Intelligence Conference.

[98]  Zhen Chen,et al.  Extracting bus transit boarding stop information using smart card transaction data , 2018 .

[99]  Yunpeng Wang,et al.  Understanding commuting patterns using transit smart card data , 2017 .

[100]  Yasuo Asakura,et al.  Behavioural data mining of transit smart card data: A data fusion approach , 2014 .

[101]  João Falcão e Cunha,et al.  Passenger Journey Destination Estimation From Automated Fare Collection System Data Using Spatial Validation , 2016, IEEE Transactions on Intelligent Transportation Systems.

[102]  Chen Jun,et al.  Estimating Smart Card Commuters Origin-Destination Distribution Based on APTS Data , 2013 .

[103]  Rob van Nes,et al.  Service Reliability in a Network Context , 2014 .

[104]  Mahmoud Mesbah,et al.  Use of Smart Card Fare Data to Estimate Public Transport Origin–Destination Matrix , 2015 .

[105]  Ronghui Liu,et al.  Incorporating weather conditions and travel history in estimating the alighting bus stops from smart card data , 2020, Sustainable Cities and Society.

[106]  Mark R. McCord,et al.  Grouping of Bus Stops for Aggregation of Route-Level Passenger Origin–Destination Flow Matrices , 2012 .

[107]  Christian Jutten,et al.  Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects , 2015, Proceedings of the IEEE.

[108]  Le Minh Kieu,et al.  Passenger Segmentation Using Smart Card Data , 2015, IEEE Transactions on Intelligent Transportation Systems.

[109]  Jerome N. Rachele,et al.  Gender and age differences in walking for transport and recreation: Are the relationships the same in all neighborhoods? , 2016, Preventive medicine reports.