Document Management with Ant Colony Optimization Metaheuristic: A Fuzzy Text Clustering Approach Using Pheromone Trails

This paper proposes an ant colony optimization (ACO) algorithm to deal with fuzzy document clustering problems. A specialized glossary and a thesaurus are used in order to extract features of the documents and to obtain a languageindependent vector representation that can be used to measure similarities between documents written in different languages. The pheromone trails obtained in the ACO process are used to determine membership values in a fuzzy clustering. To illustrate the behavior of the algorithm, it was applied to a corpus of bilingual documents in different areas of economic and management.

[1]  G. Raju,et al.  Fuzzy Clustering Methods in Data Mining: A Comparative Case Analysis , 2008, 2008 International Conference on Advanced Computer Theory and Engineering.

[2]  Dexian Zhang,et al.  A PSO-Based Web Document Classification Algorithm , 2007 .

[3]  Marco Dorigo,et al.  Swarm intelligence: from natural to artificial systems , 1999 .

[4]  L.N. de Castro,et al.  Text document classification using swarm intelligence , 2005, International Conference on Integration of Knowledge Intensive Multi-Agent Systems, 2005..

[5]  Baldo Faieta,et al.  Diversity and adaptation in populations of clustering ants , 1994 .

[6]  B. Kulkarni,et al.  An ant colony approach for clustering , 2004 .

[7]  Thomas A. Runkler Ant colony optimization of clustering models , 2005, Int. J. Intell. Syst..

[8]  Juan Julián Merelo Guervós,et al.  Parallel Problem Solving from Nature - PPSN IX: 9th International Conference, Reykjavik, Iceland, September 9-13, 2006, Proceedings , 2006, PPSN.

[9]  Thomas Stützle,et al.  Ant Colony Optimization , 2009, EMO.

[10]  Hui Liu,et al.  An Ant-Based Fast Text Clustering Approach Using Pheromone , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[11]  Marco Dorigo,et al.  Optimization, Learning and Natural Algorithms , 1992 .

[12]  Julia Handl,et al.  Improved Ant-Based Clustering and Sorting , 2002, PPSN.

[13]  Jean-Louis Deneubourg,et al.  The dynamics of collective sorting robot-like ants and ant-like robots , 1991 .

[14]  Sun Jian-qing,et al.  A Novel SVM Decision Tree and its application to Face Detection , 2007 .