Ant-Based Document Clustering and Visualization

This paper discusses the document clustering and visualization process: analyzing documents index, clustering document, and visualizing exploration. It focuses on ant-based clustering algorithm and some significant improvements. Clusterings are formed on the plane by ants walking, picking up or dropping down projected document vectors with different probability. It is shown that the similar documents appear in spatial proximity, whereas unrelated documents are clearly separated in visual space.

[1]  Johan Himberg,et al.  A SOM based cluster visualization and its application for false coloring , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[2]  Sinan Salman,et al.  DIVA: a visualization system for exploring document databases for technology forecasting , 2002 .

[3]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[4]  Julia Handl,et al.  Improved Ant-Based Clustering and Sorting , 2002, PPSN.

[5]  Kwong-Sak Leung,et al.  Expanding Self-Organizing Map for data visualization and cluster analysis , 2004, Inf. Sci..

[6]  George K. Knopf,et al.  Visualization of randomly ordered numeric data sets using spherical self-organizing feature maps , 2003, Comput. Graph..

[7]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[8]  Baldo Faieta,et al.  Diversity and adaptation in populations of clustering ants , 1994 .

[9]  Mohamed S. Kamel,et al.  Clustering ensemble using swarm intelligence , 2003, Proceedings of the 2003 IEEE Swarm Intelligence Symposium. SIS'03 (Cat. No.03EX706).

[10]  Jean-Louis Deneubourg,et al.  The dynamics of collective sorting robot-like ants and ant-like robots , 1991 .

[11]  Wu Bin,et al.  CSIM: a document clustering algorithm based on swarm intelligence , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).