Treepedia 2.0: Applying Deep Learning for Large-Scale Quantification of Urban Tree Cover

Recent advances in deep learning have made it possible to quantify urban metrics at fine resolution, and over large extents using street-level images. Here, we focus on measuring urban tree cover using Google Street View (GSV) images. First, we provide a small-scale labelled validation dataset and propose standard metrics to compare the performance of automated estimations of street tree cover using GSV. We apply state-of-the-art deep learning models, and compare their performance to a previously established benchmark of an unsupervised method. Our training procedure for deep learning models is novel; we utilize the abundance of openly available and similarly labelled street-level image datasets to pre-train our model. We then perform additional training on a small training dataset consisting of GSV images. We find that deep learning models significantly outperform the unsupervised benchmark method. Our semantic segmentation model increased mean intersection-over-union (IoU) from 44.10% to 60.42% relative to the unsupervised method and our end-to-end model decreased Mean Absolute Error from 10.04% to 4.67%. We also employ a recently developed method called gradient-weighted class activation map (Grad-CAM) to interpret the features learned by the end-to-end model. This technique confirms that the end-to-end model has accurately learned to identify tree cover area as key features for predicting percentage tree cover. Our paper provides an example of applying advanced deep learning techniques on a large-scale, geo-tagged and image-based dataset to efficiently estimate important urban metrics. The results demonstrate that deep learning models are highly accurate, can be interpretable, and can also be efficient in terms of data-labelling effort and computational resources.

[1]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Pietro Perona,et al.  Cataloging Public Objects Using Aerial and Street-Level Images — Urban Trees , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Weixing Zhang,et al.  Urban Forestry & Urban Greening , 2015 .

[4]  F. Haghighat,et al.  Approaches to study Urban Heat Island – Abilities and limitations , 2010 .

[5]  Christian Früh,et al.  Google Street View: Capturing the World at Street Level , 2010, Computer.

[6]  Peng Gong,et al.  Can you see green? Assessing the visibility of urban forests in cities , 2009 .

[7]  C.Y. Jim,et al.  Perception and Attitude of Residents Toward Urban Green Spaces in Guangzhou (China) , 2006, Environmental management.

[8]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[9]  G. Carrus,et al.  Benefits and well-being perceived by people visiting green spaces in periods of heat stress. , 2009 .

[10]  Michael S. Lew,et al.  Deep learning for visual understanding: A review , 2016, Neurocomputing.

[11]  Xiaoqiang Lu,et al.  Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.

[12]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Barbara Gray,et al.  Living Streets: Strategies for Crafting Public Space , 2012 .

[14]  Weidong Li,et al.  Who lives in greener neighborhoods? The distribution of street greenery and its association with residents' socioeconomic conditions in Hartford, Connecticut, USA , 2015 .

[15]  C. Ratti,et al.  Green streets − Quantifying and mapping urban trees with street-level imagery and computer vision , 2017 .

[16]  Robert E. Hoehn,et al.  Oxygen production by urban trees in the United States , 2007 .

[17]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[18]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).