Abstract In the process of traditional hard clustering, the obtained data objects in clusters are certain. However, the objects in different classes do not have clear boundaries between in reality. A method of dealing with uncertain boundary objects is provided by Rough set theory. Therefore, combing two methods of rough set theory and k-means cluster the objects. At the same time, though the traditional k-means algorithm has powerful local search capability, it easily falls into local optimum. The genetic algorithm can get the global optimal solution, but its convergence is fast. So in the process of clustering, rough set theory and genetic algorithm are introduced. An efficient clustering method based on rough set theory and genetic algorithm is provided. Finally, the experimental results show that the proposed algorithm has the ability to adjust the results and obtain the higher accuracy rate.
[1]
Andreas Rudolph,et al.
Techniques of Cluster Algorithms in Data Mining
,
2002,
Data Mining and Knowledge Discovery.
[2]
Ting Liu,et al.
An Improved Genetic k-means Algorithm for Optimal Clustering
,
2006,
Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).
[3]
Pawan Lingras,et al.
Interval Set Clustering of Web Users with Rough K-Means
,
2004,
Journal of Intelligent Information Systems.
[4]
Zdzislaw Pawlak,et al.
Rough Set Theory and its Applications to Data Analysis
,
1998,
Cybern. Syst..
[5]
Yiyu Yao,et al.
A Survey on Rough Set Theory and Applications: A Survey on Rough Set Theory and Applications
,
2009
.
[6]
Jiawei Han,et al.
Data Mining: Concepts and Techniques
,
2000
.