How to deal with multi-source data for tree detection based on deep learning

In the field of remote sensing, it is very common to use data from several sensors in order to make classification or segmentation. Most of the standard Remote Sensing analysis use machine learning methods based on image descriptions as HOG or SIFT and a classifier as SVM. In recent years neural networks have emerged as a key tool regarding the detection of objects. Due to the heterogeneity of information (optical, infrared, LiDAR), the combination of multi-source data is still an open issue in the Remote Sensing field. In this paper, we focus on managing data from multiple sources for the task of localization of urban trees in multi-source (optical, infrared, DSM) aerial images and we evaluate the different effects of preprocessing on the input data of a CNN.

[1]  Christophe Garcia,et al.  Convolutional face finder: a neural architecture for fast and robust face detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[3]  Michael Cramer,et al.  The DGPF-Test on Digital Airborne Camera Evaluation - Over- view and Test Design , 2010 .

[4]  Xiao Xiang Zhu,et al.  Data Fusion and Remote Sensing: An ever-growing relationship , 2016, IEEE Geoscience and Remote Sensing Magazine.

[5]  G. Meyer,et al.  Machine Vision Identification of Plants , 2011 .

[6]  Harris Drucker,et al.  Learning algorithms for classification: A comparison on handwritten digit recognition , 1995 .

[7]  Yanchen Bo,et al.  Object-Based Crop Species Classification Based on the Combination of Airborne Hyperspectral Images and LiDAR Data , 2015, Remote. Sens..

[8]  Massimo Bertozzi,et al.  Stereo Vision-based approaches for Pedestrian Detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[9]  Marios Savvides,et al.  Multiple Scale Faster-RCNN Approach to Driver’s Cell-Phone Usage and Hands on Steering Wheel Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[10]  Amit K. Roy-Chowdhury,et al.  CNN based region proposals for efficient object detection , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[11]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[12]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[14]  E. Meyer Recent Trends for Enhancing the Diversity and Quality of Soybean Products , 2012 .

[15]  D. Roberts,et al.  Urban tree species mapping using hyperspectral and lidar data fusion , 2014 .

[16]  Sven Behnke,et al.  Multispectral Pedestrian Detection using Deep Fusion Convolutional Neural Networks , 2016, ESANN.

[17]  A. Bannari,et al.  Analyse de l'apport de deux indices de végétation à la classification dans les milieux hétérogènes , 1998 .

[18]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.