Comparing cities’ cycling patterns using online shared bicycle maps

AbstractBicycle sharing systems are increasingly being deployed in urban areas around the world, alongside online maps that disclose the state (i.e., location, number of bicycles/number of free parking slots) of stations in each city. Recent work has demonstrated how regularly monitoring these online maps allows for a granular analysis of a city’s cycling trends; further, the literature indicates that different cities have unique spatio-temporal patterns, reducing the generalisability of any insights or models derived from a single system. In this work, we analyse 4.5 months of online bike-sharing map data from ten cities which, combined, have 996 stations. While an aggregate comparison supports the view of cities having unique usage patterns, results of applying unsupervised learning to the temporal data shows that, instead, only the larger systems display heterogeneous behaviour, indicating that many of these systems share intrinsic similarities. We further show how these similarities are reflected in the predictability of stations’ occupancy data via a cross-city comparison of the error that a variety of approaches achieve when forecasting the number of bicycles that a station will have in the near future. We close by discussing the impact of uncovering these similarities on how future bicycle sharing systems can be designed, built, and managed.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Keying Ye,et al.  Determining the Number of Clusters Using the Weighted Gap Statistic , 2007, Biometrics.

[3]  I-Lin Wang,et al.  Models for Effective Deployment and Redistribution of Bicycles Within Public Bicycle-Sharing Systems , 2013, Oper. Res..

[4]  Sven F. Crone,et al.  Advances in forecasting with neural networks? Empirical evidence from the NN3 competition on time series prediction , 2011 .

[5]  Daniele Quercia,et al.  Ads and the city: considering geographic distance goes a long way , 2012, RecSys '12.

[6]  Emanuele Strano,et al.  The Structure of Spatial Networks and Communities in Bicycle Sharing Systems , 2013, PloS one.

[7]  Gérard Govaert,et al.  Clustering the Vélib' dynamic Origin/Destination flows using a family of Poisson mixture models , 2014, Neurocomputing.

[8]  Amir F. Atiya,et al.  An Empirical Comparison of Machine Learning Models for Time Series Forecasting , 2010 .

[9]  C. Robusto The Cosine-Haversine Formula , 1957 .

[10]  Gérard Govaert,et al.  Clustering the Vélib' origin-destinations flows by means of Poisson mixture models , 2013, ESANN.

[11]  Michael Batty,et al.  Mining bicycle sharing data for generating insights into sustainable transport systems , 2014 .

[12]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[13]  Patrice Abry,et al.  Studying Lyon's Vélo'V: A Statistical Cyclic Model , 2009 .

[14]  Adam P. Cohen,et al.  Understanding the diffusion of public bikesharing systems: evidence from Europe and North America , 2013 .

[15]  Licia Capra,et al.  Measuring the impact of opening the London shared bicycle scheme to casual users , 2012 .

[16]  Frédéric Meunier,et al.  Balancing the stations of a self service "bike hire" system , 2011, RAIRO Oper. Res..

[17]  Nuria Oliver,et al.  Sensing and predicting the pulse of the city through shared bicycling , 2009, IJCAI 2009.

[18]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[19]  Jeremy T. Bradley,et al.  Journey Data Based Arrival Forecasting for Bicycle Hire Schemes , 2013, ASMTA.

[20]  Virgílio A. F. Almeida,et al.  Tips, dones and todos: uncovering user profiles in foursquare , 2012, WSDM '12.

[21]  Christian S. Jensen,et al.  Hyper-local, directions-based ranking of places , 2011, Proc. VLDB Endow..

[22]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[23]  Andrew Hogue,et al.  Learning to rank for spatiotemporal search , 2013, WSDM.

[24]  Josep Blat,et al.  Digital Footprinting: Uncovering Tourists with User-Generated Content , 2008, IEEE Pervasive Computing.

[25]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[26]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.