Estimating city-level travel patterns using street imagery: A case study of using Google Street View in Britain

Background Street imagery is a promising and growing big data source providing current and historical images in more than 100 countries. Studies have reported using this data to audit road infrastructure and other built environment features. Here we explore a novel application, using Google Street View (GSV) to predict travel patterns at the city level. Methods We sampled 34 cities in Great Britain. In each city, we accessed 2000 GSV images from 1000 random locations. We selected archived images from time periods overlapping with the 2011 Census and the 2011–2013 Active People Survey (APS). We manually annotated the images into seven categories of road users. We developed regression models with the counts of images of road users as predictors. The outcomes included Census-reported commute shares of four modes (combined walking plus public transport, cycling, motorcycle, and car), as well as APS-reported past-month participation in walking and cycling. Results We found high correlations between GSV counts of cyclists (‘GSV-cyclists’) and cycle commute mode share (r = 0.92)/past-month cycling (r = 0.90). Likewise, GSV-pedestrians was moderately correlated with past-month walking for transport (r = 0.46), GSV-motorcycles was moderately correlated with commute share of motorcycles (r = 0.44), and GSV-buses was highly correlated with commute share of walking plus public transport (r = 0.81). GSV-car was not correlated with car commute mode share (r = –0.12). However, in multivariable regression models, all outcomes were predicted well, except past-month walking. The prediction performance was measured using cross-validation analyses. GSV-buses and GSV-cyclists are the strongest predictors for most outcomes. Conclusions GSV images are a promising new big data source to predict urban mobility patterns. Predictive power was the greatest for those modes that varied the most (cycle and bus). With its ability to identify mode of travel and capture street activity often excluded in routinely carried out surveys, GSV has the potential to be complementary to new and traditional data. With half the world’s population covered by street imagery, and with up to 10 years historical data available in GSV, further testing across multiple settings is warranted both for cross-sectional and longitudinal assessments.

[1]  Yee Leung,et al.  Applying mobile phone data to travel behaviour research: A literature review , 2017 .

[2]  Rahul Goel,et al.  Contextualising Safety in Numbers: a longitudinal investigation into change in cycling safety in Britain, 1991–2001 and 2001–2011 , 2017, Injury Prevention.

[3]  Carlo Ratti,et al.  Understanding individual mobility patterns from urban sensing data: A mobile phone trace example , 2013 .

[4]  Deborah Estrin,et al.  Using mobile phones to determine transportation modes , 2010, TOSN.

[5]  J. Pucher,et al.  Bicycling renaissance in North America? An update and re-appraisal of cycling trends and policies , 2011 .

[6]  David Banister,et al.  The sustainable mobility paradigm , 2008 .

[7]  S. Ferrari,et al.  On beta regression residuals , 2008 .

[8]  Carlos Monteiro,et al.  Health impact modelling of different travel patterns on physical activity, air pollution and road injuries for São Paulo, Brazil , 2017, Environment international.

[9]  Norbert Brändle,et al.  Supporting large-scale travel surveys with smartphones – A practical approach , 2014 .

[10]  Carrie M. Geremia,et al.  Using an audit tool (MAPS Global) to assess the characteristics of the physical environment related to walking for transport in youth: reliability of Belgian data , 2016, International Journal of Health Geographics.

[11]  Ming Wen,et al.  Neighbourhood looking glass: 360º automated characterisation of the built environment for neighbourhood effects research , 2018, Journal of Epidemiology & Community Health.

[12]  Jennifer Ailshire,et al.  Using Google Earth to conduct a neighborhood audit: reliability of a virtual audit instrument. , 2010, Health & place.

[13]  S. Ferrari,et al.  Beta Regression for Modelling Rates and Proportions , 2004 .

[14]  A. Zeileis,et al.  Beta Regression in R , 2010 .

[15]  P. Tucker,et al.  The effect of season and weather on physical activity: a systematic review. , 2007, Public health.

[16]  H. Badland,et al.  Can Virtual Streetscape Audits Reliably Replace Physical Streetscape Audits? , 2010, Journal of Urban Health.

[17]  Andrew Curtis,et al.  Using google street view for systematic observation of the built environment: analysis of spatio-temporal instability of imagery dates , 2013, International Journal of Health Geographics.

[18]  Jeffrey S. Wilson,et al.  Using Google Street View to Audit the Built Environment: Inter-rater Reliability Results , 2013, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[19]  Trisalyn A. Nelson,et al.  Mapping ridership using crowdsourced cycling data , 2016 .

[20]  A. Goodman Walking, Cycling and Driving to Work in the English and Welsh 2011 Census: Trends, Socio-Economic Patterning and Relevance to Travel Behaviour in General , 2013, PloS one.

[21]  Zhenfeng Shao,et al.  ‘Big data’ for pedestrian volume: Exploring the use of Google Street View images for pedestrian counts , 2015 .

[22]  Ying Long,et al.  How green are the streets? An analysis for central areas of Chinese cities using Tencent Street View , 2017, PloS one.

[23]  Deepti Adlakha,et al.  Emerging technologies: webcams and crowd-sourcing to identify active transportation. , 2013, American journal of preventive medicine.

[24]  Jackelyn Hwang Invited Commentary: Observing Neighborhood Physical Disorder in an Age of Technological Innovation. , 2017, American journal of epidemiology.

[25]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[26]  Anna Goodman,et al.  Does More Cycling Mean More Diversity in Cycling? , 2016 .

[27]  AnguelovDragomir,et al.  Google Street View , 2010 .

[28]  David M. Allen,et al.  The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction , 1974 .

[29]  J. Leskovec,et al.  Large-scale physical activity data reveal worldwide activity inequality , 2017, Nature.

[30]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[31]  Douglas K Miller,et al.  Assessing the built environment using omnidirectional imagery. , 2012, American journal of preventive medicine.

[32]  Julien O. Teitler,et al.  Using Google Street View to audit neighborhood environments. , 2011, American journal of preventive medicine.

[33]  Carlo Ratti,et al.  Exploring Universal Patterns in Human Home-Work Commuting from Mobile Phone Data , 2013, PloS one.

[34]  Francisco Cribari‐Neto Beta Regression in , 2010 .