Explainable, automated urban interventions to improve pedestrian and vehicle safety

Abstract At the moment, urban mobility research and governmental initiatives are mostly focused on motor-related issues, e.g. the problems of congestion and pollution. And yet, we cannot disregard the most vulnerable elements in the urban landscape: pedestrians, exposed to higher risks than other road users. Indeed, safe, accessible, and sustainable transport systems in cities are a core target of the UN’s 2030 Agenda. Thus, there is an opportunity to apply advanced computational tools to the problem of traffic safety, in regards especially to pedestrians, who have been often overlooked in the past. This paper combines public data sources, large-scale street imagery and computer vision techniques to approach pedestrian and vehicle safety with an automated, relatively simple, and universally-applicable data-processing scheme. The steps involved in this pipeline include the adaptation and training of a Residual Convolutional Neural Network to determine a hazard index for each given urban scene, as well as an interpretability analysis based on image segmentation and class activation mapping on those same images. Combined, the outcome of this computational approach is a fine-grained map of hazard levels across a city, and an heuristic to identify interventions that might simultaneously improve pedestrian and vehicle safety. The proposed framework should be taken as a complement to the work of urban planners and public authorities.

[1]  Sergio Gómez,et al.  Random Walks on Multiplex Networks , 2013, ArXiv.

[2]  P. Alam,et al.  R , 1823, The Herodotus Encyclopedia.

[3]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[4]  Yinhai Wang,et al.  Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies , 2019, Transportation Research Part C: Emerging Technologies.

[5]  Helai Huang,et al.  County-Level Crash Risk Analysis in Florida: Bayesian Spatial Modeling , 2010 .

[6]  Andrea Palazzi,et al.  Predicting the Driver's Focus of Attention: The DR(eye)VE Project , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Hironobu Fujiyoshi,et al.  Attention Branch Network: Learning of Attention Mechanism for Visual Explanation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Tao Cheng,et al.  Understanding cities with machine eyes: A review of deep computer vision in urban analytics , 2020, Cities.

[9]  Christian Früh,et al.  Google Street View: Capturing the World at Street Level , 2010, Computer.

[10]  Zhiguang Wang,et al.  Diabetic Retinopathy Detection via Deep Convolutional Networks for Discriminative Localization and Visual Explanation , 2017, AAAI Workshops.

[11]  Ramesh Raskar,et al.  Computer vision uncovers predictors of physical urban change , 2017, Proceedings of the National Academy of Sciences.

[12]  N. Moray Attention in Dichotic Listening: Affective Cues and the Influence of Instructions , 1959 .

[13]  P. Cavanagh,et al.  The Capacity of Visual Short-Term Memory is Set Both by Visual Information Load and by Number of Objects , 2004, Psychological science.

[14]  Erich Müller,et al.  Walking , 1872, Hall's journal of health.

[15]  Ralph Gakenheimer,et al.  Urban mobility in the developing world , 1999 .

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[18]  Rossano Schifanella,et al.  The shortest path to happiness: recommending beautiful, quiet, and happy routes in the city , 2014, HT.

[19]  M. Ezzati,et al.  Measuring social, environmental and health inequalities using deep learning and street imagery , 2019, Scientific Reports.

[20]  Nicholas G. Polson,et al.  Deep learning for short-term traffic flow prediction , 2016, 1604.04527.

[21]  Jing Gao,et al.  A deep learning approach for detecting traffic accidents from social media data , 2018, ArXiv.

[22]  Honorio Enrique Crespo Díaz Alejo Portal de Datos Abiertos del Ayuntamiento de Madrid , 2020 .

[23]  Liang Lin,et al.  Is Faster R-CNN Doing Well for Pedestrian Detection? , 2016, ECCV.

[24]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  D. Kahneman,et al.  Attention and Effort , 1973 .

[26]  S. Pulugurtha,et al.  Examining the influence of network, land use, and demographic characteristics to estimate the number of bicycle-vehicle crashes on urban roads , 2020 .

[27]  I. Janssen,et al.  Neighbourhood street connectivity and injury in youth: a national study of built environments in Canada , 2011, Injury Prevention.

[28]  Loo Hay Lee,et al.  Enhancing transportation systems via deep learning: A survey , 2019, Transportation Research Part C: Emerging Technologies.

[29]  Gorjan Alagic,et al.  #p , 2019, Quantum information & computation.

[30]  M. Barthelemy,et al.  A typology of street patterns , 2014, Journal of The Royal Society Interface.

[31]  Michael Duncan,et al.  Walking, bicycling, and urban landscapes: evidence from the San Francisco Bay Area. , 2003, American journal of public health.

[32]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Satish V. Ukkusuri,et al.  The role of built environment on pedestrian crash frequency , 2012 .

[35]  Bin Ran,et al.  A hybrid deep learning based traffic flow prediction method and its understanding , 2018 .

[36]  B. Schiele,et al.  How Far are We from Solving Pedestrian Detection? , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  John E Richards,et al.  The development of attention to simple and complex visual stimuli in infants: Behavioral and psychophysiological measures. , 2010, Developmental review : DR.

[38]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[39]  Reginald R. Souleyrette,et al.  FARSA: Fully Automated Roadway Safety Assessment , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[40]  Sven Behnke,et al.  Interpretable and Fine-Grained Visual Explanations for Convolutional Neural Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Shakil Mohammad Rifaat,et al.  Effect of street pattern on the severity of crashes involving vulnerable road users. , 2011, Accident; analysis and prevention.

[42]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[43]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[44]  Nei Kato,et al.  State-of-the-Art Deep Learning: Evolving Machine Intelligence Toward Tomorrow’s Intelligent Network Traffic Control Systems , 2017, IEEE Communications Surveys & Tutorials.

[45]  Peng Chen,et al.  Effects of the Built Environment on Automobile-Involved Pedestrian Crash Frequency and Risk , 2016 .

[46]  Marta C. González,et al.  Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale , 2017, KDD.

[47]  Eibe Frank,et al.  A Simple Approach to Ordinal Classification , 2001, ECML.

[48]  Anirban Sarkar,et al.  Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[49]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[50]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Tahar Zanouda,et al.  Structural robustness and service reachability in urban settings , 2018, Data Mining and Knowledge Discovery.

[52]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[53]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[54]  Vineeth N. Balasubramanian,et al.  Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[55]  Hui Wang,et al.  A machine learning-based method for the large-scale evaluation of the qualities of the urban environment , 2017, Comput. Environ. Urban Syst..

[56]  Sofiane Abbar,et al.  Unraveling environmental justice in ambient PM2.5 exposure in Beijing: A big data approach , 2019, Comput. Environ. Urban Syst..

[57]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Siddharth Gupta,et al.  The TimeGeo modeling framework for urban mobility without travel surveys , 2016, Proceedings of the National Academy of Sciences.

[59]  G. T. McIntyre,et al.  Mobile telephones. , 2000, British dental journal.

[60]  Richard Wener,et al.  Mobile telephones, distracted attention, and pedestrian safety. , 2008, Accident; analysis and prevention.

[61]  Harish G. Ramaswamy,et al.  Ablation-CAM: Visual Explanations for Deep Convolutional Network via Gradient-free Localization , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[62]  Ilona Buttler,et al.  Pedestrian Safety Assessment with Video Analysis , 2016 .

[63]  David Masip,et al.  Interpreting CNN Models for Apparent Personality Trait Regression , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[64]  Jonathan Krause,et al.  Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States , 2017, Proceedings of the National Academy of Sciences.

[65]  Ramesh Raskar,et al.  Streetscore -- Predicting the Perceived Safety of One Million Streetscapes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[66]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[67]  Vinay P. Namboodiri,et al.  U-CAM: Visual Explanation Using Uncertainty Based Class Activation Maps , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[68]  C. Ratti,et al.  Green streets − Quantifying and mapping urban trees with street-level imagery and computer vision , 2017 .

[69]  Daniele Quercia,et al.  Mapping and Visualizing Deep-Learning Urban Beautification , 2018, IEEE Computer Graphics and Applications.

[70]  Nicolas Saunier,et al.  Investigating secondary pedestrian-vehicle interactions at non-signalized intersections using vision-based trajectory data , 2019, Transportation Research Part C: Emerging Technologies.

[71]  Mehdi Moeinaddini,et al.  The relationship between urban street networks and the number of transport fatalities at the city level , 2014 .

[72]  Yu Zhang,et al.  Where are the Dangerous Intersections for Pedestrians and Cyclists: A Colocation-Based Approach , 2018, Transportation Research Part C: Emerging Technologies.

[73]  Geoff Boeing,et al.  OSMnx: New Methods for Acquiring, Constructing, Analyzing, and Visualizing Complex Street Networks , 2016, Comput. Environ. Urban Syst..

[74]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[75]  Peter Harrington,et al.  Machine Learning in Action , 2012 .