Disaggregation of aggregate GPS-based cycling data – How to enrich commercial cycling data sets for detailed cycling behaviour analysis

Abstract In order to investigate cycling behaviour, planners and researchers are increasingly using disaggregate data such as GPS data. However, disaggregate data on cycling behaviour is not available for most cities. At first sight, the data collected by (sport) app providers like Strava could help fill the data gap, as they are available for most cities around the globe. Due to data privacy reasons however, this data is usually aggregated before it is sold commercially by data providers. To use the data for detailed analysis, this article presents a multi-step disaggregation approach to synthesise single routes from aggregate data sets. The approach requires aggregate origin-destination data of cycling demand as the primary input. A double-constrained routing algorithm is subsequently used to derive single bicycle routes from this data. This disaggregate route data can then be enriched with further attribute data and can thereafter be used to estimate bicycle route choice models. This article presents the approach developed as well as a proof of concept using a case study. It further illustrates how the results can estimate a route choice model for a case study area in Germany. The overall results show that the presented approach could easily be used to disaggregate available aggregate cycling data to investigate cycling behaviour.

[1]  Serge P. Hoogendoorn,et al.  How Do People Cycle in Amsterdam? Estimating Cyclists’ Route Choice determinants Using GPS Data from an Urban Area , 2017 .

[2]  Luca Bertolini,et al.  Characterisation of and reflections on the synergy of bicycles and public transport , 2016 .

[3]  Lee D. Han,et al.  Modeling Route Choice of Utilitarian Bikeshare Users with GPS Data , 2016 .

[4]  Kay W. Axhausen,et al.  Route choice of cyclists in Zurich , 2010 .

[5]  Clas Rydergren,et al.  ROUTE CHOICE MODELLING FOR BICYCLE TRIPS , 2014 .

[6]  Moshe Ben-Akiva,et al.  Discrete Choice Analysis: Theory and Application to Travel Demand , 1985 .

[7]  Meghan Winters,et al.  Using crowdsourced data to monitor change in spatial patterns of bicycle ridership , 2018, Journal of Transport & Health.

[8]  David Philip McArthur,et al.  Visualising where commuting cyclists travel using crowdsourced data , 2019, Journal of Transport Geography.

[9]  Kristiann C Heesch,et al.  The usefulness of GPS bicycle tracking data for evaluating the impact of infrastructure change on cycling behaviour. , 2016, Health promotion journal of Australia : official journal of Australian Association of Health Promotion Professionals.

[10]  Billy Charlton,et al.  A GPS-based bicycle route choice model for San Francisco, California , 2011 .

[11]  Mia Birk,et al.  Bridging the Gaps: How Quality and Quantity of a Connected Bikeway Network Correlates with Increasing Bicycle Use , 2006 .

[12]  Walter Musakwa,et al.  Mapping cycling patterns and trends using Strava Metro data in the city of Johannesburg, South Africa , 2016, Data in brief.

[13]  Robert Steele,et al.  How personal fitness data can be re-used by smart cities , 2011, 2011 Seventh International Conference on Intelligent Sensors, Sensor Networks and Information Processing.

[14]  Angela Francke,et al.  Big Data in Bicycle Traffic , 2018 .

[15]  Qing Shen,et al.  A GPS data-based analysis of built environment influences on bicyclist route preferences , 2018 .

[16]  Jennifer Dill,et al.  Where do cyclists ride? A route choice model developed with revealed preference GPS data , 2012 .

[17]  Chandra R. Bhat,et al.  An analysis of bicycle route choice preferences in Texas, US , 2009 .

[18]  Yiik Diew Wong,et al.  Influence of infrastructural compatibility factors on walking and cycling route choices , 2013 .

[19]  Tien Mai,et al.  Bike route choice modeling using GPS data without choice sets of paths , 2017 .

[20]  Serge P. Hoogendoorn,et al.  How Do People Cycle in Amsterdam, Netherlands?: Estimating Cyclists’ Route Choice Determinants with GPS Data from an Urban Area , 2017 .

[21]  Jon D Fricker,et al.  Bicyclist commuters’ choice of on-street versus off-street route segments , 2013 .

[22]  Trisalyn A. Nelson,et al.  Mapping ridership using crowdsourced cycling data , 2016 .

[23]  Ray Pritchard,et al.  Revealed Preference Methods for Studying Bicycle Route Choice—A Systematic Review , 2018, International journal of environmental research and public health.

[24]  A. Mobasheri,et al.  Utilizing Crowdsourced Data for Studies of Cycling and Air Pollution Exposure: A Case Study Using Strava Data , 2017, International journal of environmental research and public health.

[25]  Rod Katz,et al.  MODELING BICYCLE DEMAND AS A MAINSTREAM TRANSPORTATION PLANNING FUNCTION , 1995 .

[26]  Martin P. Seybold Robust Map Matching for Heterogeneous Data via Dominance Decompositions , 2017, SDM.

[27]  Dick Ettema,et al.  Big Data and Cycling , 2016 .

[28]  D. Watling,et al.  Big data and understanding change in the context of planning transport systems , 2019, Journal of Transport Geography.

[29]  S Bekhor,et al.  Modelling bicycle route choice using data from a GPS-assisted household survey , 2018 .

[30]  Christopher Pettit,et al.  Comparing spatial patterns of crowdsourced and conventional bicycling datasets , 2018 .