Progressive Cascaded Convolutional Neural Networks for Single Tree Detection with Google Earth Imagery

High-resolution remote sensing images can not only help forestry administrative departments achieve high-precision forest resource surveys, wood yield estimations and forest mapping but also provide decision-making support for urban greening projects. Many scholars have studied ways to detect single trees from remote sensing images and proposed many detection methods. However, the existing single tree detection methods have many errors of commission and omission in complex scenes, close values on the digital data of the image for background and trees, unclear canopy contour and abnormal shape caused by illumination shadows. To solve these problems, this paper presents progressive cascaded convolutional neural networks for single tree detection with Google Earth imagery and adopts three progressive classification branches to train and detect tree samples with different classification difficulties. In this method, the feature extraction modules of three CNN networks are progressively cascaded, and the network layer in the branches determined whether to filter the samples and feed back to the feature extraction module to improve the precision of single tree detection. In addition, the mechanism of two-phase training is used to improve the efficiency of model training. To verify the validity and practicability of our method, three forest plots located in Hangzhou City, China, Phang Nga Province, Thailand and Florida, USA were selected as test areas, and the tree detection results of different methods, including the region-growing, template-matching, convolutional neural network and our progressive cascaded convolutional neural network, are presented. The results indicate that our method has the best detection performance. Our method not only has higher precision and recall but also has good robustness to forest scenes with different complexity levels. The F1 measure analysis in the three plots was 81.0%, which is improved by 14.5%, 18.9% and 5.0%, respectively, compared with other existing methods.

[1]  Darius S. Culvenor,et al.  TIDA: an algorithm for the delineation of tree crowns in high spatial resolution remotely sensed imagery , 2002 .

[2]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Xiaqing Wu,et al.  Tree detection from aerial imagery , 2009, GIS.

[4]  Leena Matikainen,et al.  An Object-Based Approach for Mapping Shrub and Tree Cover on Grassland Habitats by Use of LiDAR and CIR Orthoimages , 2013, Remote. Sens..

[5]  G. Foody Thematic map comparison: Evaluating the statistical significance of differences in classification accuracy , 2004 .

[6]  Tara N. Sainath,et al.  FUNDAMENTAL TECHNOLOGIES IN MODERN SPEECH RECOGNITION Digital Object Identifier 10.1109/MSP.2012.2205597 , 2012 .

[7]  Mats Erikson Two preprocessing techniques based on grey level and geometric thickness to improve segmentation results , 2006, Pattern Recognit. Lett..

[8]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Jan Novotný,et al.  INDIVIDUAL TREE CROWNS DELINEATION USING LOCAL MAXIMA APPROACH AND SEEDED REGION GROWING TECHNIQUE , 2011 .

[10]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  J. Rogan,et al.  Remote sensing for mapping and monitoring land-cover and land-use change—an introduction , 2004 .

[12]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[13]  Tara N. Sainath,et al.  Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Russell G. Congalton,et al.  Global Land Cover Mapping: A Review and Uncertainty Analysis , 2014, Remote. Sens..

[15]  A. Rango,et al.  Object-oriented image analysis for mapping shrub encroachment from 1937 to 2003 in southern New Mexico , 2004 .

[16]  Robert J. Woodham,et al.  The automatic recognition of individual trees in aerial images of forests based on a synthetic tree crown image model , 1996 .

[17]  Weijia Li,et al.  Deep Learning Based Oil Palm Tree Detection and Counting for High-Resolution Remote Sensing Images , 2016, Remote. Sens..

[18]  M. J. Tarp-Johansen,et al.  Automatic Stem Mapping in Three Dimensions by Template Matching from Aerial Photographs , 2002 .

[19]  Douglas J. King,et al.  Automated tree crown detection and delineation in high-resolution digital camera imagery of coniferous forest regeneration , 2002 .

[20]  Naif Alajlan,et al.  Efficient Framework for Palm Tree Detection in UAV Images , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[21]  Anelia Angelova,et al.  Real-Time Pedestrian Detection with Deep Network Cascades , 2015, BMVC.

[22]  Francisco Herrera,et al.  Deep-learning Versus OBIA for Scattered Shrub Detection with Google Earth Imagery: Ziziphus lotus as Case Study , 2017, Remote. Sens..

[23]  Morten Larsen,et al.  Optimizing templates for finding trees in aerial photographs , 1998, Pattern Recognit. Lett..

[24]  Francisco Herrera,et al.  On the use of convolutional neural networks for robust classification of multiple fingerprint captures , 2017, Int. J. Intell. Syst..

[25]  Stephen V. Stehman,et al.  Agent-based region growing for individual tree crown delineation from airborne laser scanning (ALS) data , 2015 .

[26]  James J. Little,et al.  Light Cascaded Convolutional Neural Networks for Accurate Player Detection , 2017, BMVC.

[27]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[28]  D. King,et al.  Image modelling of forest changes associated with acid mine drainage , 1999 .

[29]  Siham Tabik,et al.  A snapshot of image pre-processing for convolutional neural networks: case study of MNIST , 2017, Int. J. Comput. Intell. Syst..

[30]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[31]  Mathias Schardt,et al.  Single tree detection in very high resolution remote sensing data , 2007 .