Remote Sensing Image Scene Classification with Noisy Label Distillation

The widespread applications of remote sensing image scene classification-based Convolutional Neural Networks (CNNs) are severely affected by the lack of large-scale datasets with clean annotations. Data crawled from the Internet or other sources allows for the most rapid expansion of existing datasets at a low-cost. However, directly training on such an expanded dataset can lead to network overfitting to noisy labels. Traditional methods typically divide this noisy dataset into multiple parts. Each part fine-tunes the network separately to improve performance further. These approaches are inefficient and sometimes even hurt performance. To address these problems, this study proposes a novel noisy label distillation method (NLD) based on the end-to-end teacher-student framework. First, unlike general knowledge distillation methods, NLD does not require pre-training on clean or noisy data. Second, NLD effectively distills knowledge from labels across a full range of noise levels for better performance. In addition, NLD can benefit from a fully clean dataset as a model distillation method to improve the student classifier’s performance. NLD is evaluated on three remote sensing image datasets, including UC Merced Land-use, NWPU-RESISC45, AID, in which a variety of noise patterns and noise amounts are injected. Experimental results show that NLD outperforms widely used directly fine-tuning methods and remote sensing pseudo-labeling methods.

[1]  Lijun Zhao,et al.  Remote Sensing Image Scene Classification Using CNN-CapsNet , 2019, Remote. Sens..

[2]  Jon Atli Benediktsson,et al.  Spatial Density Peak Clustering for Hyperspectral Image Classification With Noisy Labels , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Xiaoqiang Lu,et al.  Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.

[4]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Yang Wang,et al.  Deep Discriminative Representation Learning with Attention Map for Scene Classification , 2019, Remote. Sens..

[6]  Jefersson Alex dos Santos,et al.  Towards better exploiting convolutional neural networks for remote sensing scene classification , 2016, Pattern Recognit..

[7]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[8]  Lizhe Wang,et al.  A semi-supervised generative framework with deep learning features for high-resolution remote sensing image scene classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[9]  Xianming Liu,et al.  Hyperspectral Image Classification in the Presence of Noisy Labels , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Hongxun Yao,et al.  Deep Feature Fusion for VHR Remote Sensing Scene Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Jordi Pont-Tuset,et al.  The Open Images Dataset V4 , 2018, International Journal of Computer Vision.

[12]  Nicolas Courty,et al.  An Entropic Optimal Transport Loss for Learning Deep Neural Networks under Label Noise in Remote Sensing Images , 2018, Comput. Vis. Image Underst..