Minimax Bridgeness-Based Clustering for Hyperspectral Data

Hyperspectral (HS) imaging has been used extensively in remote sensing applications like agriculture, forestry, geology and marine science. HS pixel classification is an important task to help identify different classes of materials within a scene, such as different types of crops on a farm. However, this task is significantly hindered by the fact that HS pixels typically form high-dimensional clusters of arbitrary sizes and shapes in the feature space spanned by all spectral channels. This is even more of a challenge when ground truth data is difficult to obtain and when there is no reliable prior information about these clusters (e.g., number, typical shape, intrinsic dimensionality). In this letter, we present a new graph-based clustering approach for hyperspectral data mining that does not require ground truth data nor parameter tuning. It is based on the minimax distance, a measure of similarity between vertices on a graph. Using the silhouette index, we demonstrate that the minimax distance is more suitable to identify clusters in raw hyperspectral data than two other graph-based similarity measures: mutual proximity and shared nearest neighbours. We then introduce the minimax bridgeness-based clustering approach, and we demonstrate that it can discover clusters of interest in hyperspectral data better than comparable approaches.

[1]  Lutgarde M. C. Buydens,et al.  KNN-kernel density-based clustering for high-dimensional multivariate data , 2006, Comput. Stat. Data Anal..

[2]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[3]  Jing Wang,et al.  Independent component analysis-based dimensionality reduction with applications in hyperspectral image analysis , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Seth Pettie,et al.  An optimal minimum spanning tree algorithm , 2000, JACM.

[5]  David W. Messinger,et al.  Spectral-Density-Based Graph Construction Techniques for Hyperspectral Image Analysis , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Gabriele Moser,et al.  Partially Supervised classification of remote sensing images through SVM-based probability density estimation , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Thomas L. Ainsworth,et al.  Exploiting manifold geometry in hyperspectral imagery , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Rong Zheng,et al.  RECOME: a New Density-Based Clustering Algorithm Using Relative KNN Kernel Density , 2016, Inf. Sci..

[9]  P. Chauhan,et al.  Satellite Ocean Colour: Current Status and Future Perspective , 2019, Front. Mar. Sci..

[10]  Claude Cariou,et al.  Learning or assessment of classification algorithms relying on biased ground truth data: what interest? , 2019 .

[11]  Hui Jiang,et al.  Multivariate Density Estimation by Bayesian Sequential Partitioning , 2013 .

[12]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[13]  Stefano Amaducci,et al.  Nitrogen Status Assessment for Variable Rate Fertilization in Maize through Hyperspectral Imagery , 2014, Remote. Sens..

[14]  Raul Morais,et al.  Hyperspectral Imaging: A Review on UAV-Based Sensors, Data Processing and Applications for Agriculture and Forestry , 2017, Remote. Sens..

[15]  David J. Ketchen,et al.  THE APPLICATION OF CLUSTER ANALYSIS IN STRATEGIC MANAGEMENT RESEARCH: AN ANALYSIS AND CRITIQUE , 1996 .

[16]  Lei Wang,et al.  Parameter-free Laplacian centrality peaks clustering , 2017, Pattern Recognit. Lett..

[17]  Xuelong Li,et al.  DSets-DBSCAN: A Parameter-Free Clustering Algorithm , 2016, IEEE Transactions on Image Processing.

[18]  Chein-I Chang,et al.  Constrained band selection for hyperspectral imagery , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Heiko Balzter,et al.  Mapping Tree Species in Coastal Portugal Using Statistically Segmented Principal Component Analysis and Other Methods , 2014, IEEE Sensors Journal.

[20]  L. K. Hansen,et al.  On Clustering fMRI Time Series , 1999, NeuroImage.

[21]  Chin-Teng Lin,et al.  A review of clustering techniques and developments , 2017, Neurocomputing.

[22]  Ian J. Yule,et al.  Assessing the performance of multiple spectral–spatial features of a hyperspectral image for classification of urban land cover classes using support vector machines and artificial neural network , 2017 .

[23]  Sulan Zhang,et al.  A Novel Hybrid Clustering Algorithm Based on Minimum Spanning Tree of Natural Core Points , 2019, IEEE Access.

[24]  Dirk P. Kroese,et al.  Kernel density estimation via diffusion , 2010, 1011.2602.

[25]  Eugenio Cesario,et al.  Top-Down Parameter-Free Clustering of High-Dimensional Categorical Data , 2007, IEEE Transactions on Knowledge and Data Engineering.

[26]  Ray A. Jarvis,et al.  Clustering Using a Similarity Measure Based on Shared Near Neighbors , 1973, IEEE Transactions on Computers.

[27]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Mauro Maggioni,et al.  Unsupervised Clustering and Active Learning of Hyperspectral Images With Nonlinear Diffusion , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Yifan Xu,et al.  Fast clustering using adaptive density peak detection , 2015, Statistical methods in medical research.

[30]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[31]  Markus Schedl,et al.  Local and global scaling reduce hubs in space , 2012, J. Mach. Learn. Res..

[32]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .