A Self Adaptive FCM Cluster Forests Based Feature Selection

Ensemble clustering refers to combine many clustering methods to produce better results. In this context, we propose a new clustering ensemble method inspired from cluster forests (CF) based Self-Adaptive Fuzzy C-Means (SAFCM) method. Firstly, unsupervised feature selection methodology based on the building of best variables on simulated datasets. Next, we ameliorate the CF algorithm with the integration of SAFCM to find also the best number of K groups. Finally, the modified version normalized cuts spectral clustering (Ncut) is applied to general grouping. The proposed algorithm was tested on datasets from UCI Machine Learning Repository. The experimental results indicate that our proposed method outperforms both different clustering algorithms in terms of clustering quality.

[1]  James C. Bezdek,et al.  Validity-guided (re)clustering with applications to image segmentation , 1996, IEEE Trans. Fuzzy Syst..

[2]  Min Ren,et al.  A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters , 2016, Comput. Intell. Neurosci..

[3]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Min Guo,et al.  Multi-value image segmentation based on FCM algorithm and Graph Cut Theory , 2016, 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[5]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Adel M. Alimi,et al.  Survey on clustering methods: Towards fuzzy clustering for big data , 2014, 2014 6th International Conference of Soft Computing and Pattern Recognition (SoCPaR).

[7]  Michael I. Jordan,et al.  Cluster Forests , 2011, Comput. Stat. Data Anal..

[8]  Adel M. Alimi,et al.  Cluster forest based fuzzy logic for massive data clustering , 2017, International Conference on Machine Vision.

[9]  Chang-Dong Wang,et al.  Ensemble clustering using factor graph , 2016, Pattern Recognit..

[10]  Yunni Xia,et al.  Efficient Clustering Method Based on Density Peaks With Symmetric Neighborhood Relationship , 2019, IEEE Access.

[11]  Bassel Solaiman,et al.  A new efficient fuzzy cluster validity index: Application to images clustering , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.