DOCUMENT MANAGEMENT USING CLUSTERING ALGORITHMS
暂无分享,去创建一个
Document management systems are complex systems, which offer services as storage, versioning, metadata, security, as well as indexing and retrieval capabilities. Large numbers of documents could be automatically grouped into classes of documents, which contain similar information. Therefor we propose to use clustering methods in order to group the documents. Clustering is an important process in text mining used for groping documents based on their contents in order to extract knowledge. In this paper we will present some requirements for clustering algorithms for a document management system
[1] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .
[2] Jiawei Han,et al. Data Mining: Concepts and Techniques , 2000 .
[3] Anil K. Jain,et al. Algorithms for Clustering Data , 1988 .
[4] Anil K. Jain,et al. Data clustering: a review , 1999, CSUR.
[5] Oren Etzioni,et al. Web document clustering: a feasibility demonstration , 1998, SIGIR '98.