Object Detection in Satellite Images Based on Active Learning Utilizing Visual Explanation

Convolutional neural networks (CNNs) have attracted much attention for object detection in satellite images. However, creating an annotated dataset requires a lot of time and user workload due to which the remote sensing domain has insufficient labeled datasets for training CNNs. We exploit an active learning (AL) framework for training CNNs with a small labeled dataset. AL obtains the training dataset by asking a human user to label the samples. For efficient AL, an intelligent query strategy is essential because the performance of a CNN depends on the collected dataset. Thus, in this study, we propose a query strategy to train CNNs effectively; this is done by choosing effective samples for training both the classifier and feature extractor. The strategy selects samples according to the gap between the classifier's prediction and visual explanation, which is the class discriminative part of an image derived from the extracted feature maps. Experimental result shows that a CNN trained with samples queried by our strategy had a 95% reduction in training samples requirement while maintaining 94% detection performance compared to a CNN trained with a complete dataset. Furthermore, the proposed strategy also reduced the required training samples by 30% compared to the conventional strategy to yield the same performance.

[1]  Jun Xu,et al.  A new committee-based active learning (CBAL) approach to hyperspectral remote sensing data classification , 2014 .

[2]  Jun Li,et al.  Active Learning With Convolutional Neural Networks for Hyperspectral Image Classification Using a New Bayesian Approach , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Yanfei Liu,et al.  SatCNN: satellite image dataset classification using agile convolutional neural networks , 2017 .

[5]  Yoshihiko Mochizuki,et al.  Detection by classification of buildings in multispectral satellite imagery , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[6]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[8]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[9]  Laurent Durieux,et al.  A method for monitoring building construction in urban sprawl areas using object-based analysis of Spot 5 images and existing GIS data , 2008 .

[10]  Geoffrey J. Hay,et al.  Object-based change detection , 2012 .

[11]  Silvio Savarese,et al.  Active Learning for Convolutional Neural Networks: A Core-Set Approach , 2017, ICLR.

[12]  Ruimao Zhang,et al.  Cost-Effective Active Learning for Deep Image Classification , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Mikhail F. Kanevski,et al.  A Survey of Active Learning Algorithms for Supervised Remote Sensing Image Classification , 2011, IEEE Journal of Selected Topics in Signal Processing.