A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory

The density peaks clustering (DPC) algorithm is not sensitive to the recognition of halo nodes. The halo nodes at the edge of the density peaks clustering algorithm has a lower local density. The outliers are distributed in halo nodes. The novel halo identification method based on density peaks clustering algorithm utilize the advantage of DBSCAN algorithm to quickly identify outliers, which improved the sensitivity to halo nodes. However, the identified halo nodes cannot be effectively assigned to adjacent clusters. Therefore, this paper will use K-nearest neighbor (KNN) algorithm to classify the identified halo nodes. K-nearest neighbor is the simplest and most efficient classification method. The KNN algorithm has the advantages of high accuracy, insensitivity to outliers and no input hypothesis data. Hence, we proposed a novel density peaks clustering halo node assignment algorithm based on K-nearest neighbor theory (KNN-HDPC). KNN-HDPC can grasp the internal relations between outliers and cluster nodes more deeply, so as to dig out the deeper relations between nodes. Experimental results demonstrate that the proposed algorithm can effectively cluster and reclassify a large number of complex data. We can quickly dig out the potential relationship between noise points and cluster points. The improved algorithm has higher clustering accuracy than the original DPC algorithm, and essentially has more robust clustering results.

[1]  Parham Moradi,et al.  Dynamic graph-based label propagation for density peaks clustering , 2019, Expert Syst. Appl..

[2]  Lei Wang,et al.  Identifying cluster centroids from decision graph automatically using a statistical outlier detection method , 2019, Neurocomputing.

[3]  Yu Jin,et al.  An Optimal Density Peak Algorithm Based on Data Field and Information Entropy , 2017, DMCIT '17.

[4]  Keqin Li,et al.  DPC-LG: Density peaks clustering based on logistic distribution and gravitation , 2019, Physica A: Statistical Mechanics and its Applications.

[5]  Raouf Boutaba,et al.  Machine Learning for Cognitive Network Management , 2018, IEEE Communications Magazine.

[6]  Chongfu Zhang,et al.  Developed Density Peak Clustering With Support Vector Data Description for Access Network Intrusion Detection , 2018, IEEE Access.

[7]  Victor C. M. Leung,et al.  Clustering Approach Based on Mini Batch Kmeans for Intrusion Detection System Over Big Data , 2018, IEEE Access.

[8]  Bo Wang,et al.  Effectively clustering by finding density backbone based-on kNN , 2016, Pattern Recognit..

[9]  Yongchuan Tang,et al.  Comparative density peaks clustering , 2018, Expert Syst. Appl..

[10]  Peter W. Tse,et al.  Clustering by defining and merging candidates of cluster centers via independence and affinity , 2018, Neurocomputing.

[11]  Wei Pang,et al.  FREDPC: A Feasible Residual Error-Based Density Peak Clustering Algorithm With the Fragment Merging Strategy , 2019, IEEE Access.

[12]  Hongjie Jia,et al.  Study on density peaks clustering based on k-nearest neighbors and principal component analysis , 2016, Knowl. Based Syst..

[13]  Xiao Xu,et al.  An improved density peaks clustering algorithm with fast finding cluster centers , 2018, Knowl. Based Syst..

[14]  Hui Li,et al.  Patient Cluster Divergence Based Healthcare Insurance Fraudster Detection , 2019, IEEE Access.

[15]  Guoyin Wang,et al.  DenPEHC: Density peak based efficient hierarchical clustering , 2016, Inf. Sci..

[16]  Rudolf Kruse,et al.  Variable density based clustering , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[17]  Hong Rong,et al.  Efficient k-Nearest Neighbor Classification Over Semantically Secure Hybrid Encrypted Cloud Database , 2018, IEEE Access.

[18]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[19]  Bo Wu,et al.  A Fast Density and Grid Based Clustering Method for Data With Arbitrary Shapes and Noise , 2017, IEEE Transactions on Industrial Informatics.

[20]  Xiao Xu,et al.  Density peaks clustering using geodesic distances , 2017, International Journal of Machine Learning and Cybernetics.

[21]  Ankush Sharma,et al.  KNN-DBSCAN: Using k-nearest neighbor information for parameter-free density based clustering , 2017, 2017 International Conference on Intelligent Computing, Instrumentation and Control Technologies (ICICICT).

[22]  Yu Xue,et al.  A robust density peaks clustering algorithm using fuzzy neighborhood , 2017, International Journal of Machine Learning and Cybernetics.

[23]  Yan Yu,et al.  A Real-Time Big Data Gathering Algorithm Based on Indoor Wireless Sensor Networks for Risk Analysis of Industrial Operations , 2016, IEEE Transactions on Industrial Informatics.

[24]  Zhengming Ma,et al.  Adaptive density peak clustering based on K-nearest neighbors with aggregating strategy , 2017, Knowl. Based Syst..

[25]  Qingsheng Zhu,et al.  A Novel Cluster Validity Index Based on Local Cores , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Lin Sun,et al.  Joint neighborhood entropy-based gene selection method with fisher score for tumor classification , 2018, Applied Intelligence.

[27]  Hong He,et al.  Automatic pattern recognition of ECG signals using entropy-based adaptive dimensionality reduction and clustering , 2017, Appl. Soft Comput..

[28]  Wei Zhou,et al.  HaloDPC: An Improved Recognition Method on Halo Node for Density Peak Clustering Algorithm , 2019, Int. J. Pattern Recognit. Artif. Intell..

[29]  Hong Yan,et al.  Adaptive clustering algorithm based on kNN and density , 2018, Pattern Recognit. Lett..

[30]  Chunyan Miao,et al.  REDPC: A residual error-based density peak clustering algorithm , 2019, Neurocomputing.

[31]  Rongfang Bie,et al.  Clustering by fast search and find of density peaks via heat diffusion , 2016, Neurocomputing.

[32]  Xiao Xu,et al.  DPCG: an efficient density peaks clustering algorithm based on grid , 2018, Int. J. Mach. Learn. Cybern..

[33]  Weixin Xie,et al.  Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors , 2016, Inf. Sci..