Noise-Tolerant Deep Neighborhood Embedding for Remotely Sensed Images With Label Noise

Recently, many deep learning-based methods have been developed for solving remote sensing (RS) scene classification or retrieval tasks. Most of the adopted loss functions for training these models require accurate annotations. However, the presence of noise in such annotations (also known as label noise) cannot be avoided in large-scale RS benchmark archives, resulting from geo-location/registration errors, land-cover changes, and diverse knowledge background of annotators. To overcome the influence of noisy labels on the learning process of deep models, we propose a new loss function called noise-tolerant deep neighborhood embedding which can accurately encode the semantic relationships among RS scenes. Specifically, we target at maximizing the leave-one-out $K$-NN score for uncovering the inherent neighborhood structure among the images in feature space. Moreover, we down-weight the contribution of potential noisy images by learning their localized structure and pruning the images with low leave-one-out $K$-NN scores. Based on our newly proposed loss function, classwise features can be more robustly discriminated. Our experiments, conducted on two benchmark RS datasets, validate the effectiveness of the proposed approach on three different RS scene interpretation tasks, including classification, clustering, and retrieval. The codes of this article will be publicly available from https://github.com/jiankang1991.

[1]  Ping Zhong,et al.  Diversity-Promoting Deep Structural Metric Learning for Remote Sensing Scene Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[2]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Rob Fergus,et al.  Learning from Noisy Labels with Deep Neural Networks , 2014, ICLR.

[4]  Antonio J. Plaza,et al.  Deep Unsupervised Embedding for Remotely Sensed Images Based on Spatially Augmented Momentum Contrast , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Ilkay Ulusoy,et al.  Image Classification with Deep Learning in the Presence of Noisy Labels: A Survey , 2019, ArXiv.

[6]  Yueming Lyu,et al.  Curriculum Loss: Robust Learning and Generalization against Label Corruption , 2019, ICLR.

[7]  Liangpei Zhang,et al.  Evaluation of Morphological Texture Features for Mangrove Forest Mapping and Species Discrimination Using Multispectral IKONOS Imagery , 2009, IEEE Geoscience and Remote Sensing Letters.

[8]  W. Marsden I and J , 2012 .

[9]  Lianru Gao,et al.  Graph Convolutional Networks for Hyperspectral Image Classification , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Filiberto Pla,et al.  Multimodal Probabilistic Latent Semantic Analysis for Sentinel-1 and Sentinel-2 Image Fusion , 2018, IEEE Geoscience and Remote Sensing Letters.

[11]  Xudong Kang,et al.  Robust Normalized Softmax Loss for Deep Metric Learning-Based Characterization of Remote Sensing Images With Label Noise , 2021, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Xiaoqiang Lu,et al.  Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.

[13]  Zhen Ye,et al.  Deep Metric Learning Based on Scalable Neighborhood Components for Remote Sensing Scene Characterization , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Geoffrey E. Hinton,et al.  Neighbourhood Components Analysis , 2004, NIPS.

[15]  Lu Wang,et al.  Land-use scene classification using multi-scale completed local binary patterns , 2015, Signal, Image and Video Processing.

[16]  Rong Huang,et al.  Robust global registration of point clouds by closed-form solution in the frequency domain , 2021 .

[17]  Naoto Yokoya,et al.  An Augmented Linear Mixing Model to Address Spectral Variability for Hyperspectral Unmixing , 2018, IEEE Transactions on Image Processing.

[18]  Haiyan Gu,et al.  Object-oriented classification of high-resolution remote sensing imagery based on an improved colour structure code and a support vector machine , 2010 .

[19]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[20]  Anil M. Cheriyadat,et al.  Unsupervised Feature Learning for Aerial Scene Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[22]  Mert R. Sabuncu,et al.  Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels , 2018, NeurIPS.

[23]  Pedram Ghamisi,et al.  Texture-aware total variation-based removal of sun glint in hyperspectral images , 2020 .

[24]  Yusheng Xu,et al.  Registration of large-scale terrestrial laser scanner point clouds: A review and benchmark , 2020 .

[25]  Haifeng Li,et al.  RSI-CB: A Large Scale Remote Sensing Image Classification Benchmark via Crowdsource Data , 2017, ArXiv.

[26]  Naoto Yokoya,et al.  Learning-Shared Cross-Modality Representation Using Multispectral-LiDAR and Hyperspectral Data , 2019, IEEE Geoscience and Remote Sensing Letters.

[27]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Naif Alajlan,et al.  Land-Use Classification With Compressive Sensing Multifeature Fusion , 2015, IEEE Geoscience and Remote Sensing Letters.

[29]  Dumitru Erhan,et al.  Training Deep Neural Networks on Noisy Labels with Bootstrapping , 2014, ICLR.

[30]  Alexei A. Efros,et al.  Improving Generalization via Scalable Neighborhood Component Analysis , 2018, ECCV.

[31]  Gong Cheng,et al.  Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities , 2020, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[32]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[33]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[34]  Laurent Itti,et al.  Saliency and Gist Features for Target Detection in Satellite Images , 2011, IEEE Transactions on Image Processing.

[35]  Aritra Ghosh,et al.  Robust Loss Functions under Label Noise for Deep Neural Networks , 2017, AAAI.

[36]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[37]  Jae-Gil Lee,et al.  Learning from Noisy Labels with Deep Neural Networks: A Survey , 2020, ArXiv.

[38]  Xiao Xiang Zhu,et al.  DiRS: On Creating Benchmark Datasets for Remote Sensing Image Interpretation , 2020, ArXiv.

[39]  Wen Yang,et al.  STRUCTURAL HIGH-RESOLUTION SATELLITE IMAGE INDEXING , 2010 .

[40]  Rong Huang,et al.  GraNet: Global Relation-aware Attentional Network for ALS Point Cloud Classification , 2020, ArXiv.

[41]  Hao-Yu Wu,et al.  Classification is a Strong Baseline for Deep Metric Learning , 2018, BMVC.

[42]  Jocelyn Chanussot,et al.  ORSIm Detector: A Novel Object Detection Framework in Optical Remote Sensing Imagery Using Spatial-Frequency Channel Features , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Cong Lin,et al.  Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[44]  Lei Guo,et al.  When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[45]  Junwei Han,et al.  Object detection in remote sensing imagery using a discriminatively trained mixture model , 2013 .

[46]  Shawn D. Newsam,et al.  Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery , 2008, 2008 15th IEEE International Conference on Image Processing.

[47]  Tong Zhang,et al.  Deep Learning Based Feature Selection for Remote Sensing Scene Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[48]  James Bailey,et al.  Symmetric Cross Entropy for Robust Learning With Noisy Labels , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[49]  Gang Wan,et al.  Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark , 2020, ISPRS Journal of Photogrammetry and Remote Sensing.

[50]  Stefanos Zafeiriou,et al.  ArcFace: Additive Angular Margin Loss for Deep Face Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Li Yan,et al.  Cross-Domain Distance Metric Learning Framework With Limited Target Samples for Scene Classification of Aerial Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Xiao Xiang Zhu,et al.  Building Instance Classification Using Street View Images , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[53]  Kun Fu,et al.  FMSSD: Feature-Merged Single-Shot Detection for Multiscale Objects in Large-Scale Remote Sensing Imagery , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[54]  Xuelong Li,et al.  Scene Classification With Recurrent Attention of VHR Remote Sensing Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[55]  Devis Tuia,et al.  OpenStreetMap: Challenges and Opportunities in Machine Learning and Remote Sensing , 2020, IEEE Geoscience and Remote Sensing Magazine.

[56]  Lei Zhang,et al.  Reweighted Tensor Factorization Method for SAR Narrowband and Wideband Interference Mitigation Using Smoothing Multiview Tensor Model , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[57]  Seong-Whan Lee,et al.  Coarse-to-Fine Deep Metric Learning for Remote Sensing Image Retrieval , 2020, Remote. Sens..

[58]  Filiberto Pla,et al.  Hyperspectral Unmixing Based on Dual-Depth Sparse Probabilistic Latent Semantic Analysis , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[59]  Naoto Yokoya,et al.  Learning from Multimodal and Multitemporal Earth Observation Data for Building Damage Mapping , 2020, ISPRS Journal of Photogrammetry and Remote Sensing.

[60]  Peijun Du,et al.  Mid-Level Feature Representation via Sparse Autoencoder for Remotely Sensed Scene Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[61]  Joan Bruna,et al.  Training Convolutional Networks with Noisy Labels , 2014, ICLR 2014.

[62]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[63]  Naoto Yokoya,et al.  Learning Convolutional Sparse Coding on Complex Domain for Interferometric Phase Restoration , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[64]  Yusheng Xu,et al.  Pairwise coarse registration of point clouds in urban scenes using voxel-based 4-planes congruent sets , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[65]  Lei Guo,et al.  Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[66]  Naoto Yokoya,et al.  More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[67]  Zhiyuan Yan,et al.  SRAF-Net: Shape Robust Anchor-Free Network for Garbage Dumps in Remote Sensing Imagery , 2021, IEEE Transactions on Geoscience and Remote Sensing.

[68]  Qingshan Liu,et al.  Learning Multiscale Deep Features for High-Resolution Satellite Image Scene Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[69]  Naoto Yokoya,et al.  Breaking Limits of Remote Sensing by Deep Learning From Simulated Data for Flood and Debris-Flow Mapping , 2022, IEEE Transactions on Geoscience and Remote Sensing.

[70]  Xiangtao Zheng,et al.  A Deep Scene Representation for Aerial Scene Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[71]  Zhen Ye,et al.  Robust segmentation and localization of structural planes from photogrammetric point clouds in construction sites , 2020 .

[72]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[73]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[74]  Rong Huang,et al.  Deep point embedding for urban classification using ALS point clouds: A new perspective from local to global , 2020 .

[75]  Geoffrey E. Hinton,et al.  Learning to Label Aerial Images from Noisy Data , 2012, ICML.