Multi-Instance Neural Network Architecture for Scene Classification in Remote Sensing

Scene classification is important problem in remote sensing since it is prerequisite to other more intelligent analysis operations. Often times, for a given scene only one part of it indicates which class it belongs to, whereas the other parts are either irrelevant or they actually tend to another class. To address this problem, we propose to divide the RS scene into multiple sub-images, consider each one as an instance of the original scene. In that case, we can view the scene as a bag of instances having the same label and we proposed a deep multi-instance learning (MIL) architecture for the classification problem. Our proposed deep MIL architecture extracts CNN features from each instance, then learns how to fuse them into one feature using weighted average layer in the network. The weights used in this layer are automatically learned by the network for each scene. We test the proposed multitask network on three popular scene datasets, namely UC Merced, KSA, and AID datasets. Preliminary results show the promising capabilities of this solution at improving the classification accuracy by giving more weight to the feature extracted from the more informative scene instance.

[1]  Chengyi Wang,et al.  Remote Sensing Scene Classification Based on Convolutional Neural Networks Pre-Trained Using Attention-Guided Sparse Filters , 2018, Remote. Sens..

[2]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[3]  Lei Guo,et al.  Remote Sensing Image Scene Classification Using Bag of Convolutional Features , 2017, IEEE Geoscience and Remote Sensing Letters.

[4]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[5]  Xiangtao Zheng,et al.  Remote Sensing Scene Classification by Unsupervised Representation Learning , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Naif Alajlan,et al.  Domain Adaptation Network for Cross-Scene Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Xiaodong Mu,et al.  Scene classification of remote sensing image based on deep network grading transferring , 2018, Optik.

[8]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[10]  Shutao Li,et al.  Remote Sensing Scene Classification Using Multilayer Stacked Covariance Pooling , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Xiaoqiang Lu,et al.  Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.