Graph mining for the detection of overcrowding and waste of resources in public transport

The imbalance between the quantity of supply and demand in public transport systems causes a series of disruptions in large metropolises. While extremely crowded vehicles are uncomfortable for passengers, virtually empty vehicles generate economic losses for system managers, and this usually comes back to passengers in the form of fare increases. In this article a new data processing methodology will be presented for the evaluation of collective transportation systems. It proposes the construction and mining of graphs that represent complex networks of supply and demand of the system to find such imbalances. In a case study with the bus system of a large Brazilian metropolis, it was shown that the methodology in question is capable of identifying global imbalances in the system based on an evaluation of the weight distributions of the edges of the supply and demand networks. It has also been shown that even in a scenario where information about the demand is incomplete, using community detection techniques it is possible to identify the stretches of the network that are potentially causing these imbalances on a global scale.

[1]  Vladia Pinheiro,et al.  Micro-interventions in urban transportation from pattern discovery on the flow of passengers and on the bus network , 2016, 2016 IEEE International Smart Cities Conference (ISC2).

[2]  Marta C. González,et al.  The path most traveled: Travel demand estimation using big data resources , 2015, Transportation Research Part C: Emerging Technologies.

[3]  Graham Currie,et al.  Effective ways to grow urban bus markets: a synthesis of evidence , 2007 .

[4]  Dennis M. Wilkinson,et al.  A method for finding communities of related genes , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Hani S. Mahmassani,et al.  A complex network perspective for characterizing urban travel demand patterns: graph theoretical analysis of large-scale origin–destination demand networks , 2017 .

[6]  Jeffrey S. Racine,et al.  Nonparametric estimation of regression functions with both categorical and continuous data , 2004 .

[7]  Vasco Furtado,et al.  Impact of human mobility on police allocation , 2017, 2017 IEEE International Conference on Intelligence and Security Informatics (ISI).

[8]  Huang Hai-jun,et al.  Study on the Complexity of Traffic Networks and Related Problems , 2005 .

[9]  Louahdi Khoudour,et al.  A People Counting System Based on Dense and Close Stereovision , 2008, ICISP.

[10]  L. Bettencourt,et al.  Urban Scaling and Its Deviations: Revealing the Structure of Wealth, Innovation and Crime across Cities , 2010, PloS one.

[11]  E. Nadaraya On Estimating Regression , 1964 .

[12]  Kenneth T. Rosen,et al.  The Size Distribution of Cities: An Examination of the Pareto Law and Primacy , 1980 .

[13]  Max Kleiber,et al.  The Fire of Life: An Introduction to Animal Energetics , 1975 .

[14]  G. S. Watson,et al.  Smooth regression analysis , 1964 .

[15]  P. Schonfeld,et al.  Multiple period optimization of bus transit systems , 1991 .

[16]  Daniel McFadden,et al.  Urban travel demand: A behavioral analysis : a Charles River Associates research study , 1975 .

[17]  Christian Schneider,et al.  Spatiotemporal Patterns of Urban Human Mobility , 2012, Journal of Statistical Physics.

[18]  J. Hołyst,et al.  Statistical analysis of 22 public transport networks in Poland. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Céline Robardet,et al.  Networks as signals, with an application to a bike sharing system , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[20]  H E Gerland,et al.  AUTOMATIC PASSENGER COUNTING (APC): INFRA-RED MOTION ANALYZER FOR ACCURATE COUNTS IN STATIONS AND RAIL, LIGHT-RAIL AND BUS OPERATIONS , 1999 .

[22]  Fahui Wang,et al.  Exploring the network structure and nodal centrality of China , 2011 .

[23]  Vasco Furtado,et al.  Towards Understanding the Impact of Crime on the Choice of Route by a Bus Passenger , 2017, EPIA.

[24]  Jiwon Kim,et al.  Identification of communities in urban mobility networks using multi-layer graphs of network traffic , 2017 .

[25]  D. McFadden,et al.  URBAN TRAVEL DEMAND - A BEHAVIORAL ANALYSIS , 1977 .

[26]  José S. Andrade,et al.  Statistical Signs of Social Influence on Suicides , 2014, Scientific Reports.

[27]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[28]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.

[29]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[30]  A. Barabasi,et al.  Scale-free characteristics of random networks: the topology of the world-wide web , 2000 .

[31]  J. S. Andrade,et al.  Large cities are less green , 2014, Scientific Reports.

[32]  Javam C. Machado,et al.  The light pollution as a surrogate for urban population of the US cities , 2017, 1706.05139.

[33]  Ren Hua-ling Origin-Destination Demands Estimation in Congested Dynamic Transit Networks , 2007, 2007 International Conference on Management Science and Engineering.

[34]  Jeffrey S. Racine,et al.  CROSS-VALIDATED LOCAL LINEAR NONPARAMETRIC REGRESSION , 2004 .

[35]  Marcela Munizaga,et al.  Estimation of a disaggregate multimodal public transport Origin-Destination matrix from passive smartcard data from Santiago, Chile , 2012 .

[36]  Daniel K Boyle PASSENGER COUNTING TECHNOLOGIES AND PROCEDURES , 1998 .

[37]  Pere Colet,et al.  Tweets on the Road , 2014, PloS one.

[38]  Fabio Gordillo,et al.  The value of automated fare collection data for transit planning : an example of rail transit OD matrix estimation , 2006 .

[39]  Mohsen Ramezani,et al.  A complex network analysis of macroscopic structure of taxi trips , 2017 .

[40]  Seref Sagiroglu,et al.  Big data: A review , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[41]  Laverne W. Stanton,et al.  Applied Regression Analysis: A Research Tool , 1990 .

[42]  Wei-Ying Ma,et al.  Understanding mobility based on GPS data , 2008, UbiComp.

[43]  José S. Andrade,et al.  Human mobility in large cities as a proxy for crime , 2016, PloS one.

[44]  Andrea Rinaldo,et al.  Supply–demand balance and metabolic scaling , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[45]  H. Stanley,et al.  Modelling urban growth patterns , 1995, Nature.

[46]  Elizabeth Furtado,et al.  A data-driven approach to help understanding the preferences of public transport users , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[47]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[48]  H. Pietrantonio URBAN TRAVEL DEMAND MODELING: FROM INDIVIDUAL CHOICES TO GENERAL EQUILIBRIUM , 1997 .