A visual segmentation method for temporal smart card data

ABSTRACT In many cities, worldwide public transit companies use smart card system to manage fare collection. Analysis of this acquisitive information provides a comprehensive insight of user's influence in the interactive public transit network. In this regard, analysis of temporal data, describing the time of entering to the public transit network is considered as the most substantial component of the data gathered from the smart cards. Classical distance-based techniques are not always suitable to analyze this time series data. A novel projection with intuitive visual map from higher dimension into a three-dimensional clock-like space is suggested to reveal the underlying temporal pattern of public transit users. This projection retains the temporal distance between any arbitrary pair of time-stamped data with meaningful visualization. Consequently, this information is fed into a hierarchical clustering algorithm as a method of data segmentation to discover the pattern of users.

[1]  E. Côme,et al.  Understanding Passenger Patterns in Public Transit Through Smart Card and Socioeconomic Data: A case study in Rennes, France , 2014 .

[2]  P. Sneath,et al.  Some thoughts on bacterial classification. , 1957, Journal of general microbiology.

[3]  Pablo Montero,et al.  TSclust: An R Package for Time Series Clustering , 2014 .

[4]  Shashi Shekhar,et al.  Spatiotemporal Data Mining: A Computational Perspective , 2015, ISPRS Int. J. Geo Inf..

[5]  Mahmoud Mesbah,et al.  Validating and improving public transport origin–destination estimation algorithm using smart card fare data ☆ , 2016 .

[6]  P. Deb Finite Mixture Models , 2008 .

[7]  O. Järv,et al.  Understanding monthly variability in human activity spaces: A twelve-month study using mobile phone call detail records , 2014 .

[8]  Tai-Yu Ma,et al.  Mode choice with latent preference heterogeneity: a case study for employees of the EU institutions in Luxembourg , 2015 .

[9]  S. Yahya,et al.  Strategic Planning of an Integrated Smart Card Fare Collection System – Challenges and Solutions , 2008, 2008 11th IEEE International Conference on Computational Science and Engineering - Workshops.

[10]  Le Minh Kieu,et al.  Transit passenger segmentation using travel regularity mined from Smart Card transactions data , 2014 .

[11]  Marc Barthelemy,et al.  The multilayer temporal network of public transport in Great Britain , 2015, Scientific Data.

[12]  Xiaolei Ma,et al.  Mining smart card data for transit riders’ travel patterns , 2013 .

[13]  Maria Bordagaray,et al.  Modelling user perception of bus transit quality considering user and service heterogeneity , 2014 .

[14]  F. G. Benitez,et al.  Determining a public transport satisfaction index from user surveys , 2013 .

[15]  Christian Schneider,et al.  Spatiotemporal Patterns of Urban Human Mobility , 2012, Journal of Statistical Physics.

[16]  Catherine Morency,et al.  Bridging the gap between complex data and decision-makers: an example of an innovative interactive tool , 2010 .

[17]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[18]  G. Weisbrod,et al.  Economic Impact of Public Transportation Investment , 2009 .

[19]  Antony Stathopoulos,et al.  A utility-maximization model for retrieving users’ willingness to travel for participating in activities from big-data , 2015 .

[20]  Licia Capra,et al.  How smart is your smartcard?: measuring travel behaviours, perceptions, and incentives , 2011, UbiComp '11.

[21]  Meisy A. Ortega-Tong Classification of London's public transport users using smart card data , 2013 .

[22]  Yasuo Asakura,et al.  Behavioural data mining of transit smart card data: A data fusion approach , 2014 .

[23]  Joo-Young Kim,et al.  Travel behavior analysis using smart card data , 2016 .

[24]  Juan de Oña,et al.  Analysis of transit quality of service through segmentation and classification tree techniques , 2015 .

[25]  P. Sneath The application of computers to taxonomy. , 1957, Journal of general microbiology.

[26]  Ashish Bhaskar,et al.  Real-time traffic state estimation in urban corridors from heterogeneous data , 2016 .

[27]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[28]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[29]  Alexandre M. Bayen,et al.  Evaluation of traffic data obtained via GPS-enabled mobile phones: The Mobile Century field experiment , 2009 .

[30]  D. Stephens,et al.  A Quantitative Study of Gene Regulation Involved in the Immune Response of Anopheline Mosquitoes , 2006 .

[31]  Katherine A. Heller,et al.  Bayesian hierarchical clustering , 2005, ICML.

[32]  Catherine Morency,et al.  Smart card data use in public transit: A literature review , 2011 .

[33]  Martin Trépanier,et al.  Individual Trip Destination Estimation in a Transit Smart Card Automated Fare Collection System , 2007, J. Intell. Transp. Syst..

[34]  Malika Charrad,et al.  NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set , 2014 .

[35]  Bruno Agard,et al.  Analysing the Variability of Transit Users Behaviour with Smart Card Data , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[36]  Bruno Agard Mining Smart Card Data from an Urban Transit Network , 2009, Encyclopedia of Data Warehousing and Mining.