Diverse Capsules Network Combining Multiconvolutional Layers for Remote Sensing Image Scene Classification

Remote sensing image scene classification has drawn significant attention for its potential applications in the economy and livelihoods. Unlike the traditional handcrafted features, the convolutional neural networks provide an excellent avenue in obtaining powerful discriminative features. Although tremendous efforts have been made so far in this domain, there are still many open challenges in scene classification due to the scene complexity with higher within-class diversity and between-class similarity. To solve the above-mentioned problems, DcapsulesNet (D-CapsNet) is proposed to learn the richer and more robust features for scene classification. It is an end to end network with four types of layers and incorporates visual attention mechanisms. Its diverse capsules encode different properties of complex image scenes, including deep high-level features, spatial attention based on the fusion of multilayers features, both spatial and channel attention based on high-level features, and their fusion. Experiments on three image scene datasets demonstrate that D-CapsNet outperforms other baselines and state-of-the-art methods with a significant improvement in both classification accuracy and speed.

[1]  Xuelong Li,et al.  A Hybrid Sparsity and Distance-Based Discrimination Detector for Hyperspectral Images , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Koray Kavukcuoglu,et al.  Multiple Object Recognition with Visual Attention , 2014, ICLR.

[3]  Qian Du,et al.  Fusing Local and Global Features for High-Resolution Scene Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[4]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[5]  Yanfei Liu,et al.  Scene Classification Based on a Deep Random-Scale Stretched Convolutional Neural Network , 2018, Remote. Sens..

[6]  Gui-Song Xia,et al.  Dirichlet-Derived Multiple Topic Scene Classification Model for High Spatial Resolution Remote Sensing Imagery , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Yong Xu,et al.  Capsule Routing for Sound Event Detection , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[8]  Qian Du,et al.  Remote Sensing Image Scene Classification Using Multi-Scale Completed Local Binary Patterns and Fisher Vectors , 2016, Remote. Sens..

[9]  Hien Van Nguyen,et al.  Fast CapsNet for Lung Cancer Screening , 2018, MICCAI.

[10]  Gong Cheng,et al.  P-CNN: Part-Based Convolutional Neural Networks for Fine-Grained Visual Categorization , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Zhicheng Zhao,et al.  Saliency-based Deep Multi-level Semantic Feature Fusion for Person Re-identification , 2019, 2019 IEEE Visual Communications and Image Processing (VCIP).

[13]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[14]  Liangpei Zhang,et al.  Scene Classification Based on the Multifeature Fusion Probabilistic Topic Model for High Spatial Resolution Remote Sensing Imagery , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[15]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Xuelong Li,et al.  Attention Based Network for Remote Sensing Scene Classification , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[17]  Jefersson Alex dos Santos,et al.  Towards better exploiting convolutional neural networks for remote sensing scene classification , 2016, Pattern Recognit..

[18]  Junwei Han,et al.  Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA , 2013 .

[19]  Junwei Han,et al.  Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Xiangtao Zheng,et al.  Remote Sensing Scene Classification by Gated Bidirectional Network , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Jefersson Alex dos Santos,et al.  Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[22]  Junwei Han,et al.  Multi-class geospatial object detection and geographic image classification based on collection of part detectors , 2014 .

[23]  Lianru Gao,et al.  Building Extraction from High-Resolution Aerial Imagery Using a Generative Adversarial Network with Spatial and Channel Attention Mechanisms , 2019, Remote. Sens..

[24]  Lei Guo,et al.  When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[25]  Hong Yan,et al.  Multi-deep features fusion for high-resolution remote sensing image scene classification , 2020, Neural Computing and Applications.

[26]  Ping Zhong,et al.  An Unsupervised Convolutional Feature Fusion Network for Deep Representation of Remote Sensing Images , 2018, IEEE Geoscience and Remote Sensing Letters.

[27]  Chao Huang,et al.  Scene Classification via Triplet Networks , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[28]  Lei Guo,et al.  Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images , 2015, IET Comput. Vis..

[29]  Yuanyuan Liu,et al.  Deep Salient Feature Based Anti-Noise Transfer Network for Scene Classification of Remote Sensing Imagery , 2018, Remote. Sens..

[30]  Xueming Qian,et al.  Semantic Annotation of High-Resolution Satellite Images via Weakly Supervised Learning , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[32]  Christian Wolf,et al.  Sequential Deep Learning for Human Action Recognition , 2011, HBU.

[33]  Tat-Seng Chua,et al.  SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Xiaoqiang Lu,et al.  Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.

[35]  Jun Zhou,et al.  Multiscale Visual Attention Networks for Object Detection in VHR Remote Sensing Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[36]  Filiberto Pla,et al.  Capsule Networks for Hyperspectral Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[37]  Chao Yang,et al.  A Multiscale Deeply Described Correlatons-Based Model for Land-Use Scene Classification , 2017, Remote. Sens..

[38]  Wenzhong Guo,et al.  Land-Use Classification via Extreme Learning Classifier Based on Deep Convolutional Features , 2017, IEEE Geoscience and Remote Sensing Letters.

[39]  Luisa Verdoliva,et al.  Land Use Classification in Remote Sensing Images by Convolutional Neural Networks , 2015, ArXiv.

[40]  Stanton L. Martin,et al.  Applications of hyperspectral image analysis for precision agriculture , 2018, Defense + Security.

[41]  Hongxun Yao,et al.  Deep feature extraction and combination for remote sensing image classification based on pre-trained CNN models , 2017, International Conference on Digital Image Processing.

[42]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[44]  Jefersson Alex dos Santos,et al.  Evaluating the Potential of Texture and Color Descriptors for Remote Sensing Image Retrieval and Classification , 2010, VISAPP.

[45]  Yunlong Yu,et al.  Aerial Scene Classification via Multilevel Fusion Based on Deep Convolutional Neural Networks , 2018, IEEE Geoscience and Remote Sensing Letters.

[46]  Konstantinos N. Plataniotis,et al.  Brain Tumor Type Classification via Capsule Networks , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[47]  Yunlong Yu,et al.  A Two-Stream Deep Fusion Framework for High-Resolution Aerial Scene Classification , 2018, Comput. Intell. Neurosci..

[48]  Mihai Datcu,et al.  Latent Dirichlet Allocation for Spatial Analysis of Satellite Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[49]  Gui-Song Xia,et al.  Bag-of-Visual-Words Scene Classifier With Local and Global Features for High Spatial Resolution Remote Sensing Imagery , 2016, IEEE Geoscience and Remote Sensing Letters.

[50]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[51]  Xiwen Yao,et al.  Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images , 2021, IEEE Geoscience and Remote Sensing Letters.

[52]  Ulas Bagci,et al.  Capsules for Object Segmentation , 2018, ArXiv.

[53]  Zhou Yang,et al.  Multi-feature Fusion for High Resolution Aerial Scene Image Classification , 2019 .

[54]  Runsheng Wang,et al.  Texture description based on multiresolution moments of image histograms , 2008 .

[55]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[56]  Xiangtao Zheng,et al.  Semantic Descriptions of High-Resolution Remote Sensing Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[57]  Chao Yang,et al.  Concentric Circle Pooling in Deep Convolutional Networks for Remote Sensing Scene Classification , 2018, Remote. Sens..

[58]  Shiming Xiang,et al.  Aggregating Rich Hierarchical Features for Scene Classification in Remote Sensing Imagery , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[59]  Dan Zeng,et al.  Improving Remote Sensing Scene Classification by Integrating Global-Context and Local-Object Features , 2018, Remote. Sens..

[60]  Xiangtao Zheng,et al.  Joint Dictionary Learning for Multispectral Change Detection , 2017, IEEE Transactions on Cybernetics.

[61]  Xuelong Li,et al.  Bidirectional Adaptive Feature Fusion for Remote Sensing Scene Classification , 2017, CCCV.

[62]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[63]  Cong Lin,et al.  Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[64]  Mohamed Abdel-Mottaleb,et al.  Discriminant Correlation Analysis: Real-Time Feature Level Fusion for Multimodal Biometric Recognition , 2016, IEEE Transactions on Information Forensics and Security.

[65]  Xiangtao Zheng,et al.  A target detection method for hyperspectral image based on mixture noise model , 2016, Neurocomputing.

[66]  Shutao Li,et al.  Remote Sensing Scene Classification Using Multilayer Stacked Covariance Pooling , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[67]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[68]  Shawn D. Newsam,et al.  Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery , 2008, 2008 15th IEEE International Conference on Image Processing.

[69]  Hong Huo,et al.  Global-Local Attention Network for Aerial Scene Classification , 2019, IEEE Access.

[70]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[71]  Hao Sun,et al.  A Feature Aggregation Convolutional Neural Network for Remote Sensing Scene Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[72]  Jin Wang,et al.  Training Convolutional Neural Networks with Multi-Size Images and Triplet Loss for Remote Sensing Scene Classification , 2020, Sensors.