Negative Bootstrapping for Weakly Supervised Target Detection in Remote Sensing Images

When training a classifier in a traditional weakly supervised learning scheme, negative samples are obtained by randomly sampling. However, it may bring deterioration or fluctuation for the performance of the classifier during the iterative training process. Considering a classifier is inclined to misclassify negative examples which resemble positive ones, comprising these misclassified and informative negatives should be important for enhancing the effectiveness and robustness of the classifier. In this paper, we propose to integrate Negative Bootstrapping scheme into weakly supervised learning framework to achieve effective target detection in remote sensing images. Compared with traditional weakly supervised target detection schemes, this method mainly has three advantages. Firstly, our model training framework converges more stable and faster by selecting the most discriminative training samples. Secondly, on each iteration, we utilize the negative samples which are most easily misclassified to refine target detector, obtaining better performance. Thirdly, we employ a pre-trained convolutional neural network (CNN) model named Caffe to extract high-level features from RSIs, which carry more semantic meanings and hence yield effective image representation. Comprehensive evaluations on a high resolution airplane dataset and comparisons with state-of-the-art weakly supervised target detection approaches demonstrate the effectiveness and robustness of the proposed method.

[1]  Laurent Itti,et al.  Saliency and Gist Features for Target Detection in Satellite Images , 2011, IEEE Transactions on Image Processing.

[2]  Tao Xiang,et al.  Bayesian Joint Topic Modelling for Weakly Supervised Object Localisation , 2013, 2013 IEEE International Conference on Computer Vision.

[3]  Junwei Han,et al.  Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA , 2013 .

[4]  Lining Gao,et al.  A Visual Search Inspired Computational Model for Ship Detection in Optical Satellite Images , 2012, IEEE Geoscience and Remote Sensing Letters.

[5]  Marcel Worring,et al.  Bootstrapping Visual Categorization With Relevant Negatives , 2013, IEEE Transactions on Multimedia.

[6]  B. S. Manjunath,et al.  Modeling and Detection of Geospatial Objects Using Texture Motifs , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Lawrence Carin,et al.  Detection of Unexploded Ordnance via Efficient Semisupervised and Active Learning , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Jinchang Ren,et al.  Object-Based 2D-to-3D Video Conversion for Effective Stereoscopic Content Generation in 3D-TV Applications , 2011, IEEE Transactions on Broadcasting.

[9]  Yu Li,et al.  Automatic Target Detection in High-Resolution Remote Sensing Images Using a Contour-Based Spatial Model , 2012, IEEE Geoscience and Remote Sensing Letters.

[10]  Junwei Han,et al.  Efficient, simultaneous detection of multi-class geospatial targets based on visual saliency modeling and discriminative learning of sparse coding , 2014 .

[11]  Xiaofeng Li,et al.  Straight road edge detection from high-resolution remote sensing images based on the ridgelet transform with the revised parallel-beam Radon transform , 2010 .

[12]  Junwei Han,et al.  Multi-class geospatial object detection and geographic image classification based on collection of part detectors , 2014 .

[13]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[14]  Cem Ünsalan,et al.  Urban Area Detection Using Local Feature Points and Spatial Voting , 2010, IEEE Geoscience and Remote Sensing Letters.

[15]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Marcel Worring,et al.  Social negative bootstrapping for visual categorization , 2011, ICMR '11.

[17]  Carlos López-Martínez,et al.  A novel algorithm for ship detection in SAR imagery based on the wavelet transform , 2005, IEEE Geoscience and Remote Sensing Letters.

[18]  Tao Xiang,et al.  Weakly supervised object detector learning with model drift detection , 2011, 2011 International Conference on Computer Vision.

[19]  Tao Xiang,et al.  In Defence of Negative Mining for Annotating Weakly Labelled Data , 2012, ECCV.

[20]  Cem Ünsalan,et al.  Urban-Area and Building Detection Using SIFT Keypoints and Graph Theory , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Hong Huo,et al.  Rotation-Invariant Object Detection of Remotely Sensed Images Based on Texton Forest and Hough Voting , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[22]  Lei Guo,et al.  Scalable multi-class geospatial object detection in high-spatial-resolution remote sensing images , 2014, 2014 IEEE Geoscience and Remote Sensing Symposium.

[23]  Peijun Li,et al.  Urban building damage detection from very high resolution imagery using OCSVM and spatial features , 2010 .

[24]  Junwei Han,et al.  Object detection in remote sensing imagery using a discriminatively trained mixture model , 2013 .

[25]  Lei Guo,et al.  Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[27]  Feng Wu,et al.  Background Prior-Based Salient Object Detection via Deep Reconstruction Residual , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Andrea Garzelli,et al.  Target Detection With Semisupervised Kernel Orthogonal Subspace Projection , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Junwei Han,et al.  Novel Folded-PCA for improved feature extraction and data reduction with hyperspectral imaging and SAR in remote sensing , 2014 .

[30]  Yihua Tan,et al.  Airport Detection From Large IKONOS Images Using Clustered SIFT Keypoints and Region Information , 2011, IEEE Geoscience and Remote Sensing Letters.

[31]  Yu Li,et al.  Automatic Target Detection in High-Resolution Remote Sensing Images Using Spatial Sparse Coding Bag-of-Words Model , 2012, IEEE Geoscience and Remote Sensing Letters.

[32]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[33]  Lei Guo,et al.  Weakly Supervised Learning for Target Detection in Remote Sensing Images , 2015, IEEE Geoscience and Remote Sensing Letters.

[34]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[35]  Jinchang Ren,et al.  Hierarchical Modeling and Adaptive Clustering for Real-Time Summarization of Rush Videos , 2009, IEEE Transactions on Multimedia.