Locality mutual clustering for document retrieval

Document retrieval is aimed at searching relevant documents in response to answer user query. To do this task, algorithms of document clustering play an important role. These algorithms are often based on frequency computation of key-phrases in both query and document, and focus on locality. It is dealt with this paper an algorithm based on locality mutual clustering is proposed to cluster documents and to find relevance documents in answer to user query. This proposed algorithm has been used for searching scientific papers in our institutions.

[1]  James Bailey,et al.  Document clustering of scientific texts using citation contexts , 2010, Information Retrieval.

[2]  Muhammad Rafi,et al.  Document Clustering based on Topic Maps , 2010, ArXiv.

[3]  James Blustein,et al.  Enhancing semi-supervised document clustering with feature supervision , 2012, SAC '12.

[4]  Tobun Dorbin Ng,et al.  Demonstration of hierarchical document clustering of digital library retrieval results , 2001, JCDL '01.

[5]  Jiawei Han,et al.  Document clustering using locality preserving indexing , 2005, IEEE Transactions on Knowledge and Data Engineering.

[6]  Naftali Tishby,et al.  Document clustering using word clusters via the information bottleneck method , 2000, SIGIR '00.

[7]  Jian Pei,et al.  Clustering in applications with multiple data sources - A mutual subspace clustering approach , 2012, Neurocomputing.

[8]  Charu C. Aggarwal,et al.  A Survey of Text Clustering Algorithms , 2012, Mining Text Data.

[9]  Christos Bouras,et al.  A clustering technique for news articles using WordNet , 2012, Knowl. Based Syst..

[10]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[11]  Soon Myoung Chung,et al.  Text document clustering based on neighbors , 2009, Data Knowl. Eng..

[12]  Patrick Pantel,et al.  Document clustering with committees , 2002, SIGIR '02.

[13]  Chuanyi Fu,et al.  LTPI : A Spectral Clustering Method Based on Local Topology Preserving Indexing and Its Application for Document Clustering , 2013 .