Manifold Density Peaks Clustering Algorithm

Cluster analysis is a popular technique in statistics and computer science with the objective of grouping similar observations in relatively distinct groups generally known as clusters. In this paper we propose an approach called Manifold Density Peaks Clustering to improve the basic density peaks clustering. It mainly concerns three aspects. First, geodesic distance is adopted to calculate manifold distance matrix. Then, cluster centers are identified automatically by the number of clusters, which improves the inconvenient way of identifying cluster centers by dragging a rectangle manually. Moreover, isometric mapping is introduced to map high dimensional datasets into lower dimension, so as to achieve the goal of dimensionality reduction. Experimental results indicate that the proposed algorithm has achieved the expected results on both low and high dimensional datasets.

[1]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[2]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[3]  Pan Xiao,et al.  Density Sensitive Based Multi-Agent Evolutionary Clustering Algorithm , 2010 .

[4]  Jiawei Han,et al.  Regularized locality preserving indexing via spectral regression , 2007, CIKM '07.

[5]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[6]  Amit Banerjee,et al.  On evolving neighborhood parameters for fuzzy density clustering , 2013, 2013 IEEE Congress on Evolutionary Computation.

[7]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[8]  Shen Hong,et al.  Fuzzy Kernel Clustering with Outliers , 2004 .

[9]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[10]  Jiawei Han,et al.  Orthogonal Laplacianfaces for Face Recognition , 2006, IEEE Transactions on Image Processing.

[11]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[12]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[13]  Ganapati Panda,et al.  Design of computationally efficient density-based clustering algorithms , 2015, Data Knowl. Eng..

[14]  Pooja Batra Nagpal,et al.  Comparative Study of Density based Clustering Algorithms , 2011 .

[15]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[16]  Ben Shneiderman,et al.  Interactively Exploring Hierarchical Clustering Results , 2002, Computer.

[17]  Soumaya Louhichi,et al.  A density based algorithm for discovering clusters with varied density , 2014, 2014 World Congress on Computer Applications and Information Systems (WCCAIS).

[18]  Daniel A. Keim,et al.  An Efficient Approach to Clustering in Large Multimedia Databases with Noise , 1998, KDD.

[19]  Won Suk Lee,et al.  Statistical grid-based clustering over data streams , 2004, SGMD.

[20]  Sun Ji,et al.  Clustering Algorithms Research , 2008 .

[21]  J. Jost Riemannian geometry and geometric analysis , 1995 .

[22]  Ji-Gui Sun,et al.  Clustering Algorithms Research , 2008 .