Documents Clustering techniques
暂无分享,去创建一个
Documents Clustering is a technique in which relationships between sets of documents are being automatically discovered and documents are divided into groups of similar specimens. The groups that are created during the process of clustering should be specified by the high degree of similarity between the elements that belong to the same group and low degree of similarity between the elements that belong to different groups. Such way of organizing documents allows the user to review content quickly and makes it easier to retrieve particularly interesting information. The following article describes the most popular documents clustering techniques and issues associated with it, like: text documents representation and similarity measure of documents. Additionally, the author is going to introduce his own concept of new effective method of documents clustering based on Ant System.
[1] D. Teece. Research Directions for Knowledge Management , 1998 .
[2] C. Apté,et al. Lightweight Document Clustering , 2000 .
[3] George Karypis,et al. A Comparison of Document Clustering Techniques , 2000 .
[4] Pavel Berkhin,et al. A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.