Sparse Hilbert Schmidt Independence Criterion and Surrogate-Kernel-Based Feature Selection for Hyperspectral Image Classification

Designing an effective criterion to select a subset of features is a challenging problem for hyperspectral image classification. In this paper, we develop a feature selection method to select a subset of class discriminant features for hyperspectral image classification. First, we propose a new class separability measure based on the surrogate kernel and Hilbert Schmidt independence criterion in the reproducing kernel Hilbert space. Second, we employ the proposed class separability measure as an objective function and we model the feature selection problem as a continuous optimization problem using LASSO optimization framework. The combination of the class separability measure and the LASSO model allows selecting the subset of features that increases the class separability information and also avoids a computationally intensive subset search strategy. Experiments conducted with three hyperspectral data sets and different experimental settings show that our proposed method increases the classification accuracy and outperforms the state-of-the-art methods.

[1]  R. Real,et al.  The Probabilistic Basis of Jaccard's Index of Similarity , 1996 .

[2]  Bernhard Schölkopf,et al.  Remote Sensing Feature Selection by Kernel Dependence Measures , 2010, IEEE Geoscience and Remote Sensing Letters.

[3]  Rama Rao Nidamanuri,et al.  Dynamic Linear Classifier System for Hyperspectral Image Classification for Land Cover Mapping , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[4]  Ivan Marsic,et al.  Covariate Shift in Hilbert Space: A Solution via Sorrogate Kernels , 2013, ICML.

[5]  Maoguo Gong,et al.  Unsupervised Hyperspectral Image Band Selection via Column Subset Selection , 2015, IEEE Geoscience and Remote Sensing Letters.

[6]  Qingquan Li,et al.  A Novel Ranking-Based Clustering Approach for Hyperspectral Band Selection , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[7]  N. Keshava,et al.  Distance metrics and band selection in hyperspectral processing with applications to material identification and spectral libraries , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Brian Kingsbury,et al.  How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets , 2014, ArXiv.

[9]  Masashi Sugiyama,et al.  High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso , 2012, Neural Computation.

[10]  Rong Jin,et al.  Non-parametric Mixture Models for Clustering , 2010, SSPR/SPR.

[11]  James E. Fowler,et al.  Locality-Preserving Dimensionality Reduction and Classification for Hyperspectral Image Analysis , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Weiwei Sun,et al.  Band Selection Using Improved Sparse Subspace Clustering for Hyperspectral Imagery Classification , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[13]  Yicong Zhou,et al.  Extreme Learning Machine With Composite Kernels for Hyperspectral Image Classification , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[14]  Jon Atli Benediktsson,et al.  Advances in Hyperspectral Image Classification: Earth Monitoring with Statistical Learning Methods , 2013, IEEE Signal Processing Magazine.

[15]  Paul M. Mather,et al.  The role of feature selection in artificial neural network applications , 2002 .

[16]  Le Song,et al.  A Hilbert Space Embedding for Distributions , 2007, Discovery Science.

[17]  Robert I. Damper,et al.  A fast separability-based feature-selection method for high-dimensional remotely sensed image classification , 2008, Pattern Recognit..

[18]  Junwei Han,et al.  Novel Folded-PCA for improved feature extraction and data reduction with hyperspectral imaging and SAR in remote sensing , 2014 .

[19]  Ludmila I. Kuncheva,et al.  A stability index for feature selection , 2007, Artificial Intelligence and Applications.

[20]  Yuliya Tarabalka,et al.  Dynamic Ensemble Selection Approach for Hyperspectral Image Classification With Joint Spectral and Spatial Information , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[21]  David A. Landgrebe,et al.  Supervised classification in high-dimensional space: geometrical, statistical, and asymptotical properties of multivariate data , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[22]  Maoguo Gong,et al.  Unsupervised Band Selection Based on Evolutionary Multiobjective Optimization for Hyperspectral Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[23]  Bernhard Schölkopf,et al.  Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[24]  Lorenzo Bruzzone,et al.  An extension of the Jeffreys-Matusita distance to multiclass cases for feature selection , 1995, IEEE Trans. Geosci. Remote. Sens..

[25]  Thomas Burger,et al.  PerTurbo Manifold Learning Algorithm for Weakly Labeled Hyperspectral Image Classification , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[26]  Michael W. Prairie,et al.  Visual method for spectral band selection , 2004, IEEE Geoscience and Remote Sensing Letters.

[27]  Jon Atli Benediktsson,et al.  Morphological Attribute Profiles for the Analysis of Very High Resolution Images , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[29]  John R. Jensen,et al.  Introductory Digital Image Processing: A Remote Sensing Perspective , 1986 .

[30]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[32]  Jon Atli Benediktsson,et al.  Classification of Hyperspectral Images by Using Extended Morphological Attribute Profiles and Independent Component Analysis , 2011, IEEE Geoscience and Remote Sensing Letters.

[33]  Lorenzo Bruzzone,et al.  Kernel-Based Domain-Invariant Feature Selection in Hyperspectral Images for Transfer Learning , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[34]  Saurabh Prasad,et al.  Genetic algorithms and Linear Discriminant Analysis based dimensionality reduction for remotely sensed image analysis , 2011, 2011 IEEE International Geoscience and Remote Sensing Symposium.

[35]  J. Yackel,et al.  The Jeffries–Matusita distance for the case of complex Wishart distribution as a separability criterion for fully polarimetric SAR data , 2014 .

[36]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[37]  Lorenzo Bruzzone,et al.  Extended profiles with morphological attribute filters for the analysis of hyperspectral data , 2010 .

[38]  Changshui Zhang,et al.  On the Sample Complexity of Random Fourier Features for Online Learning , 2014, ACM Trans. Knowl. Discov. Data.

[39]  Le Song,et al.  Feature Selection via Dependence Maximization , 2012, J. Mach. Learn. Res..

[40]  Giles M. Foody,et al.  Feature Selection for Classification of Hyperspectral Data by SVM , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[42]  Lei Yu,et al.  Stable and Accurate Feature Selection , 2009, ECML/PKDD.

[43]  Jun Li,et al.  ${{\rm E}^{2}}{\rm LMs}$ : Ensemble Extreme Learning Machines for Hyperspectral Image Classification , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[44]  Mathieu Fauvel,et al.  Fast Forward Feature Selection of Hyperspectral Images for Classification With Gaussian Mixture Models , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[45]  Chein-I Chang,et al.  Constrained band selection for hyperspectral imagery , 2006, IEEE Transactions on Geoscience and Remote Sensing.