Parameter-free Laplacian centrality peaks clustering

Abstract As an important tool of data mining, clustering analysis can measure similarity between different data and classify them. It is widely applied in many fields such as pattern recognition, economics and biology. In this paper, we propose a new clustering algorithm. First, original unclassified dataset is converted into a weighted complete graph in which a node represents a data point and distance between two data points is used as weight of the edge between the corresponding two nodes. Second, local importance of each node in the network is calculated and evaluated by Laplacian centrality. The cluster center has higher Laplacian centrality than surrounding neighbor nodes and relatively large distance from nodes with higher Laplacian centralities. The new algorithm is a true parameter-free clustering method. It can automatically classify the dataset without any priori parameters. In this paper, the new algorithm was compared with 8 well-known clustering algorithms in 7 real datasets. Results show that the proposed algorithm has good clustering effect.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Brendan J. Frey,et al.  Non-metric affinity propagation for unsupervised image categorization , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[3]  S. White,et al.  A Universal Density Profile from Hierarchical Clustering , 1996, astro-ph/9611107.

[4]  Xiao Zhang,et al.  Localization and centrality in networks , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Bing He,et al.  Mining diversity subgraph in multidisciplinary scientific collaboration networks: A meso perspective , 2013, J. Informetrics.

[6]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Giulio Cimini,et al.  Estimating topological properties of weighted networks from limited information , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Konstantin Avrachenkov,et al.  Cooperative Game Theory Approaches for Network Partitioning , 2017, COCOON.

[9]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Jiwen Lu,et al.  Automatic Subspace Learning via Principal Coefficients Embedding , 2014, IEEE Transactions on Cybernetics.

[11]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[12]  Jian Yu,et al.  A parameter-free community detection method based on centrality and dispersion of nodes in complex networks , 2015 .

[13]  Hans-Peter Kriegel,et al.  Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering , 2009, TKDD.

[14]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[15]  Zhou Wang,et al.  Complex Wavelet Structural Similarity: A New Image Similarity Index , 2009, IEEE Transactions on Image Processing.

[16]  Rong Luo,et al.  A novel centrality method for weighted networks based on the Kirchhoff polynomial , 2015, Pattern Recognit. Lett..

[17]  Shuliang Wang,et al.  Clustering by Fast Search and Find of Density Peaks with Data Field , 2016 .

[18]  Cun-Quan Zhang,et al.  Laplacian centrality: A new centrality measure for weighted networks , 2012, Inf. Sci..

[19]  Mahdi Mohammadzadeh,et al.  Knowledge discovery from patients’ behavior via clustering-classification algorithms based on weighted eRFM and CLV model: An empirical study in public health care services , 2016, Iranian journal of pharmaceutical research : IJPR.

[20]  Hae-Sang Park,et al.  A simple and fast algorithm for K-medoids clustering , 2009, Expert Syst. Appl..

[21]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[22]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[24]  Weixin Xie,et al.  Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors , 2016, Inf. Sci..

[25]  Jiwen Lu,et al.  Learning Compact Binary Face Descriptor for Face Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Pablo San Segundo,et al.  Robust Global Feature Based Data Association With a Sparse Bit Optimized Maximum Clique Algorithm , 2013, IEEE Transactions on Robotics.

[27]  Dit-Yan Yeung,et al.  Robust path-based spectral clustering , 2008, Pattern Recognit..

[28]  Jan Baumbach,et al.  Comparing the performance of biomedical clustering methods , 2015, Nature Methods.

[29]  Tian Zhang,et al.  BIRCH: A New Data Clustering Algorithm and Its Applications , 1997, Data Mining and Knowledge Discovery.

[30]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[31]  Zhang Yi,et al.  Constructing the L2-Graph for Robust Subspace Learning and Subspace Clustering , 2012, IEEE Transactions on Cybernetics.

[32]  Huaguang Zhang,et al.  A fuzzy support vector machine algorithm for classification based on a novel PIM fuzzy clustering method , 2014, Neurocomputing.

[33]  Feiping Nie,et al.  The Constrained Laplacian Rank Algorithm for Graph-Based Clustering , 2016, AAAI.