A Comparative Study Of Document Clustering

Data mining or knowledge discovery means extracting the knowledge or data from large amount of knowledge or data and summarising it into useful information. Data mining software has many tools for analysing data and summarising it. One of the tool is weka .It contains many machine learning algorithms. In this paper we are studying various clustering algorithms for the documents by using weka. Clustering means collecting a set of documents into group called clusters so that the documents in the same cluster are more similar than to other clusters. Key-Words: Data mining algorithms, Weka tools, clustering algorithm .