A Novel Ranking-Based Clustering Approach for Hyperspectral Band Selection

Through imaging the same spatial area by hyperspectral sensors at different spectral wavelengths simultaneously, the acquired hyperspectral imagery often contains hundreds of band images, which provide the possibility to accurately analyze and identify a ground object. However, due to the difficulty of obtaining sufficient labeled training samples in practice, the high number of spectral bands unavoidably leads to the problem of a “dimensionality disaster” (also called the Hughes phenomenon), and dimensionality reduction should be applied. Concerning band (or feature) selection, conventional methods choose the representative bands by ranking the bands with defined metrics (such as non-Gaussianity) or by formulating the band selection problem as a clustering procedure. Because of the different but complementary advantages of the two kinds of methods, it can be beneficial to use both methods together to accomplish the band selection task. Recently, a fast density-peak-based clustering (FDPC) algorithm has been proposed. Based on the computation of the local density and the intracluster distance of each point, the product of the two factors is sorted in decreasing order, and cluster centers are recognized as points with anomalously large values; hence, the FDPC algorithm can be considered a ranking-based clustering method. In this paper, the FDPC algorithm has been enhanced to make it suitable for hyperspectral band selection. First, the ranking score of each band is computed by weighting the normalized local density and the intracluster distance rather than equally taking them into account. Second, an exponential-based learning rule is employed to adjust the cutoff threshold for a different number of selected bands, where it is fixed in the FDPC. The proposed approach is thus named the enhanced FDPC (E-FDPC). Furthermore, an effective strategy, which is called the isolated-point-stopping criterion, is developed to automatically determine the appropriate number of bands to be selected. That is, the clustering process will be stopped by the emergence of an isolated point (the only point in one cluster). Experimental results on three real hyperspectral data demonstrate that the bands selected by our E-FDPC approach could achieve higher classification accuracy than the FDPC and other state-of-the-art band selection techniques, whereas the isolated-point-stopping criterion is a reasonable way to determine the preferable number of bands to be selected.

[1]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[2]  Chein-I. Chang Hyperspectral Data Exploitation: Theory and Applications , 2007 .

[3]  Jon Atli Benediktsson,et al.  Kernel Principal Component Analysis for the Classification of Hyperspectral Remote Sensing Data over Urban Areas , 2009, EURASIP J. Adv. Signal Process..

[4]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[5]  Luis O. Jimenez-Rodriguez,et al.  Unsupervised Linear Feature-Extraction Methods and Their Effects in the Classification of High-Dimensional Data , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[6]  A. Agarwal,et al.  Efficient Hierarchical-PCA Dimension Reduction for Hyperspectral Imagery , 2007, 2007 IEEE International Symposium on Signal Processing and Information Technology.

[7]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[8]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[9]  Jean-François Cardoso,et al.  Dependence, Correlation and Gaussianity in Independent Component Analysis , 2003, J. Mach. Learn. Res..

[10]  Chein-I Chang,et al.  Unsupervised hyperspectral image analysis with projection pursuit , 2000, IEEE Trans. Geosci. Remote. Sens..

[11]  LinLin Shen,et al.  Unsupervised Band Selection for Hyperspectral Imagery Classification Without Manual Band Removal , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  Adolfo Martínez Usó,et al.  Clustering-Based Hyperspectral Band Selection Using Information Measures , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Jing Wang,et al.  Independent component analysis-based dimensionality reduction with applications in hyperspectral image analysis , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[15]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[16]  James E. Fowler,et al.  Locality-Preserving Dimensionality Reduction and Classification for Hyperspectral Image Analysis , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[17]  J. Douglas Carroll,et al.  Two-Way Multidimensional Scaling: A Review , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[18]  Chein-I Chang,et al.  Constrained band selection for hyperspectral imagery , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Chein-I Chang,et al.  Hyperspectral Data Exploitation , 2007 .

[20]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[21]  Joydeep Ghosh,et al.  Investigation of the random forest framework for classification of hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[22]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[23]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[24]  Melba M. Crawford,et al.  Manifold-Learning-Based Feature Extraction for Classification of Hyperspectral Data: A Review of Advances in Manifold Learning , 2014, IEEE Signal Processing Magazine.

[25]  J. Chanussot,et al.  Hyperspectral Remote Sensing Data Analysis and Future Challenges , 2013, IEEE Geoscience and Remote Sensing Magazine.

[26]  G. McLachlan,et al.  The EM Algorithm and Extensions: Second Edition , 2008 .

[27]  Daoqiang Zhang,et al.  Semisupervised Dimensionality Reduction With Pairwise Constraints for Hyperspectral Image Classification , 2011, IEEE Geoscience and Remote Sensing Letters.

[28]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[29]  Simon Fong,et al.  DBSCAN: Past, present and future , 2014, The Fifth International Conference on the Applications of Digital Information and Web Technologies (ICADIWT 2014).

[30]  David G. Stork,et al.  Pattern Classification , 1973 .

[31]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[32]  Peng Zhang,et al.  Dynamic Learning of SMLR for Feature Selection and Classification of Hyperspectral Data , 2008, IEEE Geoscience and Remote Sensing Letters.

[33]  Qian Du,et al.  Similarity-Based Unsupervised Band Selection for Hyperspectral Image Analysis , 2008, IEEE Geoscience and Remote Sensing Letters.

[34]  Qian Du,et al.  A joint band prioritization and band-decorrelation approach to band selection for hyperspectral image classification , 1999, IEEE Trans. Geosci. Remote. Sens..

[35]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[36]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[37]  R. Younes,et al.  Dimensionality reduction on hyperspectral images: A comparative review based on artificial datas , 2011, 2011 4th International Congress on Image and Signal Processing.

[38]  Lorenzo Bruzzone,et al.  A new search algorithm for feature selection in hyperspectral remote sensing images , 2001, IEEE Trans. Geosci. Remote. Sens..

[39]  Jon Atli Benediktsson,et al.  Recent Advances in Techniques for Hyperspectral Image Processing , 2009 .

[40]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Joydeep Ghosh,et al.  Best-bases feature extraction algorithms for classification of hyperspectral data , 2001, IEEE Trans. Geosci. Remote. Sens..

[42]  Gary A. Shaw,et al.  Hyperspectral Image Processing for Automatic Target Detection Applications , 2003 .

[43]  Valero Laparra,et al.  Kernel Structural SIMIlarity on hyperspectral images , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[44]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[45]  Lei Zhang,et al.  Band-Subset-Based Clustering and Fusion for Hyperspectral Imagery Classification , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[46]  Robert I. Damper,et al.  Band Selection for Hyperspectral Image Classification Using Mutual Information , 2006, IEEE Geoscience and Remote Sensing Letters.

[47]  Caroline Fossati,et al.  Improvement of Classification for Hyperspectral Images Based on Tensor Modeling , 2010, IEEE Geoscience and Remote Sensing Letters.

[48]  Zhen Ji,et al.  Band Selection for Hyperspectral Imagery Using Affinity Propagation , 2008, 2008 Digital Image Computing: Techniques and Applications.

[49]  John A. Richards,et al.  Remote Sensing Digital Image Analysis: An Introduction , 1999 .

[50]  Delbert Dueck,et al.  Affinity Propagation: Clustering Data by Passing Messages , 2009 .

[51]  Xiangrong Zhang,et al.  Semisupervised Dimensionality Reduction of Hyperspectral Images via Local Scaling Cut Criterion , 2013, IEEE Geoscience and Remote Sensing Letters.

[52]  Chein-I. Chang Hyperspectral Imaging: Techniques for Spectral Detection and Classification , 2003 .