A Brief Review of Metaheuristics for Document or Text Clustering

Document clustering, which involves concepts from the fields of information retrieval, automatic topic extraction, natural language processing, and machine learning, is one of the most popular research areas in data mining. Due to the large amount of information in electronic form, fast and high-quality cluster analysis plays an important role in helping users to effectively navigate, summarize and organise this information for useful data. There are a number of techniques in the literature, which efficiently provide solutions for document clustering. However, during the last decade, researchers started to use metaheuristic algorithms for the document clustering problem because of the limitations of the existing traditional clustering algorithms. In this chapter, the authors will give a brief review of various research papers that present the area of document or text clustering approaches with different metaheuristic algorithms.

[1]  Hany Atef Kelleny,et al.  Adaptation of Cuckoo search for Documents Clustering , 2014 .

[2]  Christian Blum,et al.  Metaheuristics in combinatorial optimization: Overview and conceptual comparison , 2003, CSUR.

[3]  Gilbert Laporte,et al.  Metaheuristics: A bibliography , 1996, Ann. Oper. Res..

[4]  Thomas E. Potok,et al.  Document clustering using particle swarm optimization , 2005, Proceedings 2005 IEEE Swarm Intelligence Symposium, 2005. SIS 2005..

[5]  Mehrnoush Shamsfard,et al.  An improved bee colony optimization algorithm with an application to document clustering , 2015, Neurocomputing.

[6]  Tomasz Tarczynski Document Clustering - Concepts, Metrics and Algorithms , 2011 .

[7]  Franz Rothlauf,et al.  Design of Modern Heuristics , 2011, Natural Computing Series.

[8]  Thomas E. Potok,et al.  A flocking based algorithm for document clustering analysis , 2006, J. Syst. Archit..

[9]  Luis M. de Campos,et al.  An information retrieval model based on simple Bayesian networks , 2003, Int. J. Intell. Syst..

[10]  Veenu Mangat,et al.  Evaluation of text document clustering approach based on particle swarm optimization , 2013, Central European Journal of Computer Science.

[11]  Fred W. Glover,et al.  Future paths for integer programming and links to artificial intelligence , 1986, Comput. Oper. Res..

[12]  Sunita Bisht,et al.  Document Clustering: A Review , 2013 .

[13]  Ł. Machnik A document clustering method based on ant algorithms , 2007 .

[14]  Yoojin Chung,et al.  An Evolutionary Approach for Document Clustering , 2013 .

[15]  Hassan Abolhassani,et al.  Harmony K-means algorithm for document clustering , 2009, Data Mining and Knowledge Discovery.

[16]  Yanping Lu,et al.  Automatic text clustering via particle swarm optimization , 2012 .

[17]  K. Premalatha,et al.  A Literature Review on Document Clustering , 2010 .

[18]  Neepa Shah,et al.  Document Clustering: A Detailed Review , 2012 .

[19]  Moe Moe Zaw,et al.  Web Document Clustering Using Cuckoo Search Clustering Algorithm based on Levy Flight , 2013 .

[20]  K. Premalatha,et al.  Hybrid PSO and GA models for Document Clustering , 2010 .

[21]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.