Domain-specific data mining for residents' transit pattern retrieval from incomplete information

Abstract The rapid development of Cyber-Physical System (CPS) is gradually cultivating a smart world based on Big Data and computational infrastructures. Such reformation has initiated revolutions in various industries and is gradually reshaping our daily routines. Among these trends, the data-driven intelligentization in Urban Public Transportation Systems can bring the most significant impact to our society, because they provide pervasive reachability. Optimizing and constructing such systems, which requires deep insights into passenger behavioral patterns, is a highly domain-specific data mining task. Although computational infrastructures have provided sufficient processing capacity, the practical utilization of such data is facing several challenges: information alignment in heterogeneous data sources and information enrichment for domain-specific applications. In this article, we first propose a cross-domain method to increase the usability of data by removing the inconsistency in vehicles' positioning and passengers' transaction data. We then use rule-based methods to reconstruct latent mobility information, thereby enabling small grained trajectory based applications. Finally, we provide use cases using the reconstructed information to derive insights to the public transportation system of our target city. Our work can serve as a domain-specific information enrichment and data mining framework for CPS in smart cities.

[1]  Michel Verleysen,et al.  Clustering Smart Card Data for Urban Mobility Analysis , 2017, IEEE Transactions on Intelligent Transportation Systems.

[2]  Nicholas Jing Yuan,et al.  Reconstructing individual mobility from smart card transactions: a collaborative space alignment approach , 2014, Knowledge and Information Systems.

[3]  Wei Chen,et al.  ECharts: A declarative framework for rapid construction of web-based visualization , 2018, Vis. Informatics.

[4]  Zhihan Lv,et al.  Next-Generation Big Data Analytics: State of the Art, Challenges, and Future Research Topics , 2017, IEEE Transactions on Industrial Informatics.

[5]  Meikang Qiu,et al.  A Scalable and Quick-Response Software Defined Vehicular Network Assisted by Mobile Edge Computing , 2017, IEEE Communications Magazine.

[6]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[7]  Matthew E. Kahn,et al.  Why do the poor live in cities? The role of public transportation ✩ , 2008 .

[8]  Athanasios V. Vasilakos,et al.  A Manufacturing Big Data Solution for Active Preventive Maintenance , 2017, IEEE Transactions on Industrial Informatics.

[9]  Xiaolei Dong,et al.  Security and Privacy for Cloud-Based IoT: Challenges , 2017, IEEE Communications Magazine.

[10]  Enzo Baccarelli,et al.  Energy-Efficient Adaptive Resource Management for Real-Time Vehicular Cloud Services , 2019, IEEE Transactions on Cloud Computing.

[11]  Christian Brecher,et al.  Cyber-Physical Systems: Foundations, Principles and Applications , 2016 .

[12]  Athanasios V. Vasilakos,et al.  Exploring Data Validity in Transportation Systems for Smart Cities , 2017, IEEE Communications Magazine.

[13]  Jian Huang,et al.  Travel patterns analysis of urban residents using Automated Fare Collection System , 2012, 2012 12th International Conference on ITS Telecommunications.

[14]  Houbing Song,et al.  Levering Mobile Cloud Computing for Mobile Big Data Analytics , 2018, Mobile Big Data.

[15]  Houbing Song,et al.  Secure and Trustworthy Transportation Cyber-Physical Systems , 2017, SpringerBriefs in Computer Science.

[16]  Enzo Baccarelli,et al.  Energy-Saving QoS Resource Management of Virtualized Networked Data Centers for Big Data Stream Computing , 2015 .

[17]  Nigel H. M. Wilson,et al.  Analyzing Multimodal Public Transport Journeys in London with Smart Card Fare Payment Data , 2009 .

[18]  Houbing Song,et al.  Integrating ground surveillance with aerial surveillance for enhanced amateur drone detection , 2018, Defense + Security.

[19]  Catherine Morency,et al.  Smart card data use in public transit: A literature review , 2011 .

[20]  Feng Chen,et al.  Transit smart card data mining for passenger origin information extraction , 2012, Journal of Zhejiang University SCIENCE C.

[21]  Ka Kee Alfred Chu,et al.  Enriching Archived Smart Card Transaction Data for Transit Demand Modeling , 2008 .

[22]  Etienne Côme,et al.  Analyzing year-to-year changes in public transport passenger behaviour using smart card data , 2017 .

[23]  Steve Vinoski,et al.  Node.js: Using JavaScript to Build High-Performance Network Programs , 2010, IEEE Internet Comput..

[24]  Houbing Song,et al.  Mining urban passengers' travel patterns from incomplete data with use cases , 2018, Comput. Networks.

[25]  Wei Wang,et al.  Bus Passenger Origin-Destination Estimation and Related Analyses , 2011 .

[26]  K. B. Salling,et al.  Accounting for the inaccuracies in demand forecasts and construction cost estimations in transport project evaluation , 2015 .

[27]  Xiaolei Ma,et al.  Public Transportation Big Data Mining and Analysis , 2019, Data-Driven Solutions to Transportation Problems.

[28]  Xiao Liu,et al.  Big Data Orchestration as a Service Network , 2017, IEEE Communications Magazine.

[29]  Anita Schöbel,et al.  The complexity of integrating passenger routing decisions in public transportation models , 2015, Networks.

[30]  Mark R. McCord,et al.  Use of Mobile Device Wireless Signals to Determine Transit Route-Level Passenger Origin–Destination Flows: Methodology and Empirical Evaluation , 2016 .

[31]  Luigi dell’Olio,et al.  Modelling user perception of bus transit quality , 2010 .

[32]  Ka Kee Alfred Chu,et al.  Augmenting Transit Trip Characterization and Travel Behavior Comprehension , 2010 .

[33]  Knut Haase,et al.  Urban public transit network optimization with flexible demand , 2015, OR Spectr..

[34]  Yasuo Asakura,et al.  Behavioural data mining of transit smart card data: A data fusion approach , 2014 .

[35]  Larry Pham,et al.  A Profile of Public Transportation Passenger Demographics and Travel Characteristics Reported in On-Board Surveys , 2007 .

[36]  Héctor Martínez,et al.  Frequency optimization in public transportation systems: Formulation and metaheuristic approach , 2014, Eur. J. Oper. Res..

[37]  Ching-Fu Chen,et al.  Behavioral intentions of public transit passengers--The roles of service quality, perceived value, satisfaction and involvement , 2011 .

[38]  Jianqiang Li,et al.  Computation Partitioning for Mobile Cloud Computing in a Big Data Environment , 2017, IEEE Transactions on Industrial Informatics.

[39]  Sabina Jeschke,et al.  Smart Cities: Foundations, Principles, and Applications , 2017 .

[40]  Xiaolei Ma,et al.  Development of a Data-Driven Platform for Transit Performance Measures Using Smart Card and GPS Data , 2014 .

[41]  Christopher P. Monterola,et al.  Inferring Passenger Type from Commuter Eigentravel Matrices , 2015, ArXiv.

[42]  Hongli Zhang,et al.  Mobile cloud sensing, big data, and 5G networks make an intelligent and smart world , 2015, IEEE Network.

[43]  Helen M. Meng,et al.  Utilizing Real-Time Travel Information, Mobile Applications and Wearable Devices for Smart Public Transportation , 2016, 2016 7th International Conference on Cloud Computing and Big Data (CCBD).

[44]  Houbing Song,et al.  Internet of Things and Big Data Analytics for Smart and Connected Communities , 2016, IEEE Access.

[45]  Kai-Chieh Hu,et al.  Application of perceived value model to identify factors affecting passengers' repurchase intentions on city bus: A case of the Taipei metropolitan area , 2003 .