Irregular Travel Groups Detection Based on Cascade Clustering in Urban Subway

Travel smart cards record passengers’ travel histories, which makes it possible to study personal traveling behaviors and passengers’ mobility patterns. The existing researches on smart card data pay less attention to those who beg, steal or busk during traveling, and they are called irregular passengers in this paper. Moreover, the group consisted of irregular passengers is called irregular travel group. In this paper, we propose an approach to recognize irregular travel groups based on cascade clustering. Firstly, passengers’ travel sequences of active state in hours are extracted as representations of travel patterns according to the records of smart cards in a continuous time period. The sequences are clustered with the K-means algorithm to detect irregular passengers. Then, travel similarities between the derived irregular passengers are measured and irregular travel groups are recognized based on DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm. The proposed method is applied to real Beijing subway smart card data, and the results are validated with the data derived from SINA Micro-blogs. Experimental results show that the detected irregular passengers’ movement areas are consistent with the ground truth.

[1]  Bruno Agard,et al.  Analysing the Variability of Transit Users Behaviour with Smart Card Data , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[2]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[3]  Le Minh Kieu,et al.  Passenger Segmentation Using Smart Card Data , 2015, IEEE Transactions on Intelligent Transportation Systems.

[4]  Sang Gu Lee,et al.  Travel Pattern Analysis Using Smart Card Data of Regular Users , 2011 .

[5]  Michel Verleysen,et al.  Clustering Smart Card Data for Urban Mobility Analysis , 2017, IEEE Transactions on Intelligent Transportation Systems.

[6]  Meisy A. Ortega-Tong Classification of London's public transport users using smart card data , 2013 .

[7]  Haris N. Koutsopoulos,et al.  Inferring patterns in the multi-week activity sequences of public transport users , 2016 .

[8]  Zhenyu Chen,et al.  Predicting Passengers in Public Transportation Using Smart Card Data , 2015, ADC.

[9]  Jin Ki Eom,et al.  Analysis of Travel Patterns of the Elderly Using Transit Smart Card Data , 2011 .

[10]  Xing Xie,et al.  Mining Shopping Patterns for Divergent Urban Regions by Incorporating Mobility Data , 2016, CIKM.

[11]  Mufti Mahmud,et al.  Advances in Crowd Analysis for Urban Applications Through Urban Event Detection , 2018, IEEE Transactions on Intelligent Transportation Systems.

[12]  Hui Xiong,et al.  Catch Me If You Can: Detecting Pickpocket Suspects from Large-Scale Transit Records , 2016, KDD.

[13]  Torsten Hägerstraand WHAT ABOUT PEOPLE IN REGIONAL SCIENCE , 1970 .

[14]  Mark Hickman,et al.  Are Transit Trips Symmetrical in Time and Space? , 2013 .

[15]  E. Côme,et al.  Understanding Passenger Patterns in Public Transit Through Smart Card and Socioeconomic Data: A case study in Rennes, France , 2014 .

[16]  Xiaolei Ma,et al.  Mining smart card data for transit riders’ travel patterns , 2013 .

[17]  G. Newman,et al.  Identity Theft Literature Review , 2005 .

[18]  Le Minh Kieu,et al.  A modified Density-Based Scanning Algorithm with Noise for spatial travel pattern analysis from Smart Card AFC data , 2015 .

[19]  Zhiyong Cui,et al.  Characterizing evolution of extreme public transit behavior using smart card data , 2015, 2015 IEEE First International Smart Cities Conference (ISC2).

[20]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[21]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[22]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[23]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[24]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[25]  Huayu Wu,et al.  Identifying tourists from public transport commuters , 2014, KDD.

[26]  Shazia Wasim Sadiq,et al.  Efficient Retrieval of Top-K Most Similar Users from Travel Smart Card Data , 2014, 2014 IEEE 15th International Conference on Mobile Data Management.