Hyperspectral Image Classification in the Presence of Noisy Labels

Label information plays an important role in a supervised hyperspectral image classification problem. However, current classification methods all ignore an important and inevitable problem—labels may be corrupted and collecting clean labels for training samples is difficult and often impractical. Therefore, how to learn from the database with noisy labels is a problem of great practical importance. In this paper, we study the influence of label noise on hyperspectral image classification and develop a random label propagation algorithm (RLPA) to cleanse the label noise. The key idea of RLPA is to exploit knowledge (e.g., the superpixel-based spectral–spatial constraints) from the observed hyperspectral images and apply it to the process of label propagation. Specifically, the RLPA first constructs a spectral–spatial probability transform matrix (SSPTM) that simultaneously considers the spectral similarity and superpixel-based spatial information. It then randomly chooses some training samples as “clean” samples and sets the rest as unlabeled samples, and propagates the label information from the “clean” samples to the rest unlabeled samples with the SSPTM. By repeating the random assignment (of “clean” labeled samples and unlabeled samples) and propagation, we can obtain multiple labels for each training sample. Therefore, the final propagated label can be calculated by a majority vote algorithm. Experimental studies show that the RLPA can reduce the level of noisy label and demonstrates the advantages of our proposed method over four major classifiers with a significant margin—the gains in terms of the average overall accuracy, average accuracy, and kappa are impressive, e.g., 9.18%, 9.58%, and 0.1043. The MATLAB source code is available at https://github.com/junjun-jiang/RLPA.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Bor-Chen Kuo,et al.  Feature Mining for Hyperspectral Image Classification , 2013, Proceedings of the IEEE.

[3]  James E. Fowler,et al.  Information Fusion in the Redundant-Wavelet-Transform Domain for Noise-Robust Hyperspectral Classification , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Jon Atli Benediktsson,et al.  Sensitivity of Support Vector Machines to Random Feature Selection in Classification of Hyperspectral Data , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Joydeep Ghosh,et al.  Investigation of the random forest framework for classification of hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Bin Wang,et al.  A Novel Spatial–Spectral Similarity Measure for Dimensionality Reduction and Classification of Hyperspectral Imagery , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Peijun Du,et al.  Hyperspectral Remote Sensing Image Classification Based on Rotation Forest , 2014, IEEE Geoscience and Remote Sensing Letters.

[9]  David A. Landgrebe,et al.  Signal Theory Methods in Multispectral Remote Sensing , 2003 .

[10]  Xiangtao Zheng,et al.  Dimensionality Reduction by Spatial–Spectral Preservation in Selected Bands , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Shutao Li,et al.  Multiscale Superpixel-Based Sparse Representation for Hyperspectral Image Classification , 2017, Remote. Sens..

[12]  Jon Atli Benediktsson,et al.  Spectral–Spatial Hyperspectral Image Classification With Edge-Preserving Filtering , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Shutao Li,et al.  Extinction Profiles Fusion for Hyperspectral Images Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Nagarajan Natarajan,et al.  Learning with Noisy Labels , 2013, NIPS.

[15]  D. Angluin,et al.  Learning From Noisy Examples , 1988, Machine Learning.

[16]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[17]  Jun Li,et al.  ${{\rm E}^{2}}{\rm LMs}$ : Ensemble Extreme Learning Machines for Hyperspectral Image Classification , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[18]  Chuan Long,et al.  Boosting Noisy Data , 2001, ICML.

[19]  Jun Huang,et al.  Hyperspectral image denoising using the robust low-rank tensor recovery. , 2015, Journal of the Optical Society of America. A, Optics, image science, and vision.

[20]  Junjun Jiang,et al.  Guided Locality Preserving Feature Matching for Remote Sensing Image Registration , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Lizhe Wang,et al.  SuperPCA: A Superpixelwise PCA Approach for Unsupervised Feature Extraction of Hyperspectral Imagery , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[22]  M. Verleysen,et al.  Classification in the Presence of Label Noise: A Survey , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Qi Wang,et al.  Salient Band Selection for Hyperspectral Image Classification via Manifold Ranking , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1995, COLT '90.

[25]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Yi Yu,et al.  Spatial-Aware Collaborative Representation for Hyperspectral Remote Sensing Image Classification , 2017, IEEE Geoscience and Remote Sensing Letters.

[27]  Shen-En Qian,et al.  Noise reduction of hyperspectral imagery using hybrid spatial-spectral derivative-domain wavelet shrinkage , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Adrian J. Brown Spectral curve fitting for automatic hyperspectral data analysis , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Matthias Seeger,et al.  Learning from Labeled and Unlabeled Data , 2010, Encyclopedia of Machine Learning.

[30]  Rama Chellappa,et al.  Entropy rate superpixel segmentation , 2011, CVPR 2011.

[31]  Claire Marais-Sicre,et al.  Effect of Training Class Label Noise on Classification Performances for Land Cover Mapping with Satellite Image Time Series , 2017, Remote. Sens..

[32]  Jun Li,et al.  Recent Advances on Spectral–Spatial Hyperspectral Image Classification: An Overview and New Guidelines , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[33]  Jelena Kovacevic,et al.  Supervised Hyperspectral Image Classification With Rejection , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[34]  Trac D. Tran,et al.  Hyperspectral Image Classification Using Dictionary-Based Sparse Representation , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Simon J. Hook,et al.  HYDROTHERMAL FORMATION OF CLAY-CARBONATE ALTERATION ASSEMBLAGES IN THE , 2010, 1402.1150.

[36]  Liangpei Zhang,et al.  Efficient Superpixel-Level Multitask Joint Sparse Representation for Hyperspectral Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[37]  Liangpei Zhang,et al.  Hyperspectral Image Denoising Employing a Spectral–Spatial Adaptive Total Variation Model , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[38]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..

[39]  Alan L. Yuille,et al.  Semi-Supervised Sparse Representation Based Classification for Face Recognition With Insufficient Labeled Samples , 2016, IEEE Transactions on Image Processing.

[40]  Junjun Jiang,et al.  Multiscale Union Regions Adaptive Sparse Representation for Hyperspectral Image Classification , 2017, Remote. Sens..

[41]  Andrés R. Masegosa,et al.  Bagging schemes on the presence of class noise in classification , 2012, Expert Syst. Appl..

[42]  Rocco A. Servedio,et al.  Boosting in the presence of noise , 2003, STOC '03.

[43]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[44]  Bernhard Schölkopf,et al.  Estimating a Kernel Fisher Discriminant in the Presence of Label Noise , 2001, ICML.

[45]  S. Dunagan,et al.  The MARTE VNIR imaging spectrometer experiment: design and analysis. , 2008, Astrobiology.

[46]  Jason Weston,et al.  Semisupervised Neural Networks for Efficient Hyperspectral Image Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[47]  Jun Huang,et al.  Hyperspectral image denoising with superpixel segmentation and low-rank representation , 2017, Inf. Sci..

[48]  Xuelong Li,et al.  Spectral-Spatial Constraint Hyperspectral Image Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[49]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .

[50]  Jon Atli Benediktsson,et al.  Advances in Spectral-Spatial Classification of Hyperspectral Images , 2013, Proceedings of the IEEE.

[51]  Qian Du,et al.  Local Binary Patterns and Extreme Learning Machine for Hyperspectral Imagery Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[53]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[54]  Dacheng Tao,et al.  Classification with Noisy Labels by Importance Reweighting , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.