An Adaptive Method for Clustering by Fast Search-and-Find of Density Peaks: Adaptive-DP

Clustering by fast search and find of density peaks (DP) is a method in which density peaks are used to select the number of cluster centers. The DP has two input parameters: 1) the cutoff distance and 2) cluster centers. Also in DP, different methods are used to measure the density of underlying datasets. To overcome the limitations of DP, an Adaptive-DP method is proposed. In Adaptive-DP method, heat-diffusion is used to estimate density, cutoff distance is simplified, and novel method is used to discover exact number of cluster centers, adaptively. To validate the proposed method, we tested it on synthetic and real datasets, and comparison are done with the state of the art clustering methods. The experimental results validate the robustness and effectiveness of proposed method.

[1]  Jan Ramon,et al.  Clustering and instance based learning in first order logic , 2002, AI Communications.

[2]  Dirk P. Kroese,et al.  Kernel density estimation via diffusion , 2010, 1011.2602.

[3]  References , 1971 .

[4]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[5]  Chen Xu,et al.  Identification of cell types from single-cell transcriptomes using a novel clustering method , 2015, Bioinform..

[6]  P. Koehl,et al.  Multi-Scale Clustering by Building a Robust and Self Correcting Ultrametric Topology on Data Points , 2013, PloS one.

[7]  Pasi Fränti,et al.  Iterative shrinking method for clustering problems , 2006, Pattern Recognit..

[8]  Meng Wang,et al.  Image clustering based on sparse patch alignment framework , 2014, Pattern Recognit..

[9]  Pasi Fränti,et al.  Dynamic Local Search for Clustering with Unknown Number of Clusters , 2002, ICPR.

[10]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Thomas Lefèvre,et al.  Applying Multivariate Clustering Techniques to Health Data: The 4 Types of Healthcare Utilization in the Paris Metropolitan Area , 2014, PloS one.

[12]  Dit-Yan Yeung,et al.  Robust path-based spectral clustering , 2008, Pattern Recognit..

[13]  Jan Ramon Thesis: clustering and instance based learning in first order logic , 2002 .

[14]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[15]  Limin Fu,et al.  FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data , 2007, BMC Bioinformatics.

[16]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[17]  Yunchuan Sun,et al.  Adaptive fuzzy clustering by fast search and find of density peaks , 2015, 2015 International Conference on Identification, Information, and Knowledge in the Internet of Things (IIKI).

[18]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Mohammad Khalilia,et al.  Topology Preservation in Fuzzy Self-Organizing Maps , 2013, WCSC.

[20]  Philip Chan,et al.  Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[21]  Ali Daud,et al.  Using machine learning techniques for rising star prediction in co-author network , 2014, Scientometrics.

[22]  Cor J. Veenman,et al.  A Maximum Variance Cluster Algorithm , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Hikmat Ullah Khan,et al.  Modelling to identify influential bloggers in the blogosphere: A survey , 2017, Comput. Hum. Behav..

[24]  Sean C. Bendall,et al.  Conditional density-based analysis of T cell signaling in single-cell data , 2014, Science.

[25]  S. Sealfon,et al.  flowPeaks: a fast unsupervised clustering for flow cytometry data via K-means and density peak finding , 2012, Bioinform..

[26]  Yangyang Li,et al.  SAR image segmentation based on quantum-inspired multiobjective evolutionary clustering algorithm , 2014, Inf. Process. Lett..

[27]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[28]  Aristides Gionis,et al.  Clustering aggregation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[29]  Daniel A. Hammer,et al.  Integrin Clustering Is Driven by Mechanical Resistance from the Glycocalyx and the Substrate , 2009, PLoS Comput. Biol..

[30]  SunYunchuan,et al.  Adaptive fuzzy clustering by fast search and find of density peaks , 2016 .

[31]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[32]  Mark Lacy,et al.  Measuring the Clustering Around Normal and Dust-Obscured Quasars at 2 in the Spitzer Extragalactic Representative Volume Survey (SERVS) , 2014 .

[33]  Hamid Sharif,et al.  A Survey on Cyber Security for Smart Grid Communications , 2012, IEEE Communications Surveys & Tutorials.

[34]  Rongfang Bie,et al.  Clustering by fast search and find of density peaks via heat diffusion , 2016, Neurocomputing.

[35]  Abdelaziz Bouroumi,et al.  Unsupervised fuzzy learning and cluster seeking , 2000, Intell. Data Anal..

[36]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[37]  Glory H. Shah,et al.  An Empirical Evaluation of Density-Based Clustering Techniques , 2012 .

[38]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[39]  Ali Daud,et al.  Improving Similarity Measures for Publications with Special Focus on Author Name Disambiguation , 2015 .

[40]  Pasi Fränti,et al.  Fast Agglomerative Clustering Using a k-Nearest Neighbor Graph , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Alessandra R. Brazzale,et al.  Cross-Clustering: A Partial Clustering Algorithm with Automatic Estimation of the Number of Clusters , 2016, PloS one.

[42]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[43]  Jan Baumbach,et al.  Comparing the performance of biomedical clustering methods , 2015, Nature Methods.

[44]  Yunchuan Sun,et al.  Adaptive cutoff distance: Clustering by fast search and find of density peaks , 2016, J. Intell. Fuzzy Syst..

[45]  Jiwen Lu,et al.  Robust Feature Set Matching for Partial Face Recognition , 2013, 2013 IEEE International Conference on Computer Vision.

[46]  Daniel Jaeger,et al.  pyGCluster, a novel hierarchical clustering approach , 2014, Bioinform..

[47]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.