Survey of density based clustering algorithms and its variants

Clustering technique is a unsupervised machine learning technique in the domain of data mining. Many of the clustering techniques are inherently sensitive to the input parameters. Different clustering techniques works differently for different types of the input datasets. Among all different varieties of clustering techniques, DBSCAN is one of the most important clustering technique whose working principle based on the density estimation while forming the clusters of the input dataset points which is basically used for spatial datasets of random shapes and sizes. It also eliminates the noise during the clustering formation process with a worst case run-time complexity of O(nA2). DBSCAN technique also produces a bad result for varied density datasets. In this paper we have discussed about different density based clustering techniques along with DBSCAN, its variants and some of its modified algorithms with respect to their input parameters and running time complexities. Also we have presented the comparison analysis of all the different variants of DBSCAN algorithms over different benchmark datasets for computing various measures.

[1]  Dhruba Kumar Bhattacharyya,et al.  DDSC : A Density Differentiated Spatial Clustering Technique , 2008, J. Comput..

[2]  Chang-Dong Wang,et al.  Ensemble clustering using factor graph , 2016, Pattern Recognit..

[3]  Hajar Rehioui,et al.  The 7 th International Conference on Ambient Systems , Networks and Technologies ( ANT 2016 ) DENCLUE-IM : A New Approach for Big Data Clustering , 2016 .

[4]  M. Parimala,et al.  A Survey on Density Based Clustering Algorithms for Mining Large Spatial Databases , 2011 .

[5]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[6]  Huan Liu,et al.  '1+1>2': merging distance and density based clustering , 2001, Proceedings Seventh International Conference on Database Systems for Advanced Applications. DASFAA 2001.

[7]  Ping Zhu,et al.  Hierarchical Clustering Problems and Analysis of Fuzzy Proximity Relation on Granular Space , 2013, IEEE Transactions on Fuzzy Systems.

[8]  Alfredo Ferro,et al.  Enhancing density-based clustering: Parameter reduction and outlier detection , 2013, Inf. Syst..

[9]  Hans-Peter Kriegel,et al.  A distribution-based clustering algorithm for mining in large spatial databases , 1998, Proceedings 14th International Conference on Data Engineering.

[10]  Sohail Asghar,et al.  Critical analysis of DBSCAN variations , 2010, 2010 International Conference on Information and Emerging Technologies.

[11]  Christian Böhm,et al.  Density-based clustering using graphics processors , 2009, CIKM.

[12]  Daniel A. Keim,et al.  An Efficient Approach to Clustering in Large Multimedia Databases with Noise , 1998, KDD.

[13]  Jin Wang,et al.  G-DBSCAN: An Improved DBSCAN Clustering Method Based On Grid , 2014 .

[14]  Ganapati Panda,et al.  Design of computationally efficient density-based clustering algorithms , 2015, Data Knowl. Eng..

[15]  Pierre Hansen,et al.  Cluster analysis and mathematical programming , 1997, Math. Program..

[16]  Deniz Yuret,et al.  Locally Scaled Density Based Clustering , 2007, ICANNGA.

[17]  Wang Peng,et al.  Grid-based DBSCAN Algorithm with Referential Parameters , 2012 .

[18]  Peder C. Pedersen,et al.  A Markov random field approach to group-wise registration/mosaicing with application to ultrasound , 2015, Medical Image Anal..

[19]  Ying Wah Teh,et al.  MuDi-Stream: A multi density clustering algorithm for evolving data stream , 2016, J. Netw. Comput. Appl..

[20]  John P. Kerekes,et al.  An Adaptive Density-Based Model for Extracting Surface Returns From Photon-Counting Laser Altimeter Data , 2015, IEEE Geoscience and Remote Sensing Letters.

[21]  Le Minh Kieu,et al.  A modified Density-Based Scanning Algorithm with Noise for spatial travel pattern analysis from Smart Card AFC data , 2015 .

[22]  Greg Hamerly,et al.  Making k-means Even Faster , 2010, SDM.

[23]  Ashish Sharma,et al.  An Enhanced Density Based Spatial Clustering of Applications with Noise , 2009, 2009 IEEE International Advance Computing Conference.

[24]  Serge Guillaume,et al.  DENDIS: A new density-based sampling for clustering algorithm , 2016, Expert Syst. Appl..

[25]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[26]  Xianchao Zhang,et al.  Novel density-based and hierarchical density-based clustering algorithms for uncertain data , 2017, Neural Networks.

[27]  Songrit Maneewongvatana,et al.  U-DBSCAN : A density-based clustering algorithm for uncertain objects , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[28]  A. Rama Mohan Reddy,et al.  A fast DBSCAN clustering algorithm by accelerating neighbor searching using Groups method , 2016, Pattern Recognit..

[29]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[30]  Derya Birant,et al.  ST-DBSCAN: An algorithm for clustering spatial-temporal data , 2007, Data Knowl. Eng..

[31]  Younghoon Kim,et al.  DBCURE-MR: An efficient density-based clustering algorithm for large data using MapReduce , 2014, Inf. Syst..

[32]  P. Viswanath,et al.  Rough-DBSCAN: A fast hybrid density based clustering method for large data sets , 2009, Pattern Recognit. Lett..

[33]  Rafael Sachetto Oliveira,et al.  G-DBSCAN: A GPU Accelerated Algorithm for Density-based Clustering , 2013, ICCS.

[34]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[35]  Jing Lu,et al.  Semi-supervised fuzzy clustering: A kernel-based approach , 2009, Knowl. Based Syst..