Region-based cascade pooling of convolutional features for HRRS image retrieval

ABSTRACT High-resolution remote sensing (HRRS) images contain abundant and complex visual contents. It is very important to extract powerful features to represent the complex contents of HRRS images in the image retrieval. This letter proposes a region-based cascade pooling (RBCP) method to aggregate convolutional features from both the pre-trained and the fine-tuned convolutional neural networks (CNNs). The RBCP method adopts small pooling regions, and first uses max-pooling on the feature maps of the last pooling layer, then employs average-pooling on the max-pooled feature maps. Furthermore, the feature map size is related to the input size, then two kinds of input sizes (required input size and original input size) are compared and analyzed for the RBCP features. The simulation results show that the RBCP features perform better than the features with traditional pooling methods, since multiple patch features can be extracted to describe the details of HRRS images. The RBCP method combines the advantages of max-pooling and average-pooling to extract discriminative features, thus it provides competitive results compared with state-of-the-art methods.

[1]  V. Lempitsky,et al.  Aggregating Local Deep Features for Image Retrieval , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[3]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Qi Tian,et al.  Good Practice in CNN Feature Transfer , 2016, ArXiv.

[5]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[6]  Paolo Napoletano,et al.  Visual descriptors for content-based retrieval of remote-sensing images , 2016, ArXiv.

[7]  Tong Zhang,et al.  Deep Learning Based Feature Selection for Remote Sensing Scene Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[8]  Xuelong Li,et al.  Local structure learning in high resolution remote sensing image retrieval , 2016, Neurocomputing.

[9]  Medeni Soysal,et al.  Performance Analysis of State-of-the-Art Representation Methods for Geographical Image Retrieval and Categorization , 2014, IEEE Geoscience and Remote Sensing Letters.

[10]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11]  Shawn D. Newsam,et al.  Geographic Image Retrieval Using Local Invariant Features , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Odej Kao,et al.  Retrieval of remotely sensed imagery using spectral information content , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[13]  Andrea Vedaldi,et al.  MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[14]  Jana Kosecka,et al.  Deep Convolutional Features for Image Based Retrieval and Scene Categorization , 2015, ArXiv.

[15]  Shawn D. Newsam,et al.  Learning Low Dimensional Convolutional Neural Networks for High-Resolution Remote Sensing Image Retrieval , 2016, Remote. Sens..

[16]  Tao Fang,et al.  Selective convolutional neural networks and cascade classifiers for remote sensing image classification , 2017 .