论文信息 - Treepedia 2.0: Applying Deep Learning for Large-Scale Quantification of Urban Tree Cover

Treepedia 2.0: Applying Deep Learning for Large-Scale Quantification of Urban Tree Cover

Recent advances in deep learning have made it possible to quantify urban metrics at fine resolution, and over large extents using street-level images. Here, we focus on measuring urban tree cover using Google Street View (GSV) images. First, we provide a small-scale labelled validation dataset and propose standard metrics to compare the performance of automated estimations of street tree cover using GSV. We apply state-of-the-art deep learning models, and compare their performance to a previously established benchmark of an unsupervised method. Our training procedure for deep learning models is novel; we utilize the abundance of openly available and similarly labelled street-level image datasets to pre-train our model. We then perform additional training on a small training dataset consisting of GSV images. We find that deep learning models significantly outperform the unsupervised benchmark method. Our semantic segmentation model increased mean intersection-over-union (IoU) from 44.10% to 60.42% relative to the unsupervised method and our end-to-end model decreased Mean Absolute Error from 10.04% to 4.67%. We also employ a recently developed method called gradient-weighted class activation map (Grad-CAM) to interpret the features learned by the end-to-end model. This technique confirms that the end-to-end model has accurately learned to identify tree cover area as key features for predicting percentage tree cover. Our paper provides an example of applying advanced deep learning techniques on a large-scale, geo-tagged and image-based dataset to efficiently estimate important urban metrics. The results demonstrate that deep learning models are highly accurate, can be interpretable, and can also be efficient in terms of data-labelling effort and computational resources.

[1] Dorin Comaniciu,et al. Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2] Pietro Perona,et al. Cataloging Public Objects Using Aerial and Street-Level Images — Urban Trees , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Weixing Zhang,et al. Urban Forestry & Urban Greening , 2015 .

[4] F. Haghighat,et al. Approaches to study Urban Heat Island – Abilities and limitations , 2010 .

[5] Christian Früh,et al. Google Street View: Capturing the World at Street Level , 2010, Computer.

[6] Peng Gong,et al. Can you see green? Assessing the visibility of urban forests in cities , 2009 .

[7] C.Y. Jim,et al. Perception and Attitude of Residents Toward Urban Green Spaces in Guangzhou (China) , 2006, Environmental management.

[8] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[9] G. Carrus,et al. Benefits and well-being perceived by people visiting green spaces in periods of heat stress. , 2009 .

[10] Michael S. Lew,et al. Deep learning for visual understanding: A review , 2016, Neurocomputing.

[11] Xiaoqiang Lu,et al. Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.

[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Barbara Gray,et al. Living Streets: Strategies for Crafting Public Space , 2012 .

[14] Weidong Li,et al. Who lives in greener neighborhoods? The distribution of street greenery and its association with residents' socioeconomic conditions in Hartford, Connecticut, USA , 2015 .

[15] C. Ratti,et al. Green streets − Quantifying and mapping urban trees with street-level imagery and computer vision , 2017 .

[16] Robert E. Hoehn,et al. Oxygen production by urban trees in the United States , 2007 .

[17] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[18] Xiaogang Wang,et al. Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).