Simplifying and improving swarm-based clustering

Swarm-based clustering has enthused researchers for its ability to find clusters in datasets automatically, and without requiring users to specify the number of clusters. While conventional wisdom suggests that swarm intelligence contributes to this ability, recent works have provided alternative explanation about underlying stochastic heuristics that are really at work. This paper shows that the working principles of several recent SBC methods can be explained using a stochastic clustering framework that is unrelated to swarm intelligence. The framework is theoretically simple and in practice easy to implement. We also incorporate a mechanism to calibrate a key parameter so as to enhance the clustering performance. Despite the simplicity of the enhanced algorithm, experimental results show that it outperforms two recent SBC methods in terms of clustering accuracy and efficiency in the majority of the datasets used in this study.

[1]  Bastien Chopard,et al.  Formation of an ant cemetery: swarm intelligence or statistical accident? , 2002, Future Gener. Comput. Syst..

[2]  Fabien Picarougne,et al.  A New Approach of Data Clustering Using a Flock of Agents , 2007, Evolutionary Computation.

[3]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[4]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[5]  J. Tisseau,et al.  A first mathematical model of brood sorting by ants: Functional self-organization without swarm-intelligence , 2007 .

[6]  Baldo Faieta,et al.  Diversity and adaptation in populations of clustering ants , 1994 .

[7]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[8]  Gilles Venturini,et al.  Fast Unsupervised Clustering with Artificial Ants , 2004, PPSN.

[9]  Nicolas Monmarché,et al.  A new clustering algorithm based on the chemical recognition system of ants , 2002 .

[10]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[11]  Barbara Webb,et al.  Swarm Intelligence: From Natural to Artificial Systems , 2002, Connect. Sci..

[12]  W ReynoldsCraig Flocks, herds and schools: A distributed behavioral model , 1987 .

[13]  Thomas E. Potok,et al.  A flocking based algorithm for document clustering analysis , 2006, J. Syst. Archit..

[14]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[15]  Kai Ming Ting,et al.  A general stochastic clustering method for automatic cluster discovery , 2011, Pattern Recognit..

[16]  Ajith Abraham,et al.  Evolving a Stigmergic Self-Organized Data-Mining , 2004, ArXiv.

[17]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[18]  Nicolas Monmarché,et al.  On Improving Clustering in Numerical Databases with Artificial Ants , 1999, ECAL.

[19]  Julia Handl,et al.  Ant-based and swarm-based clustering , 2007, Swarm Intelligence.

[20]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[21]  Kai Ming Ting,et al.  Issues of grid-cluster retrievals in swarm-based clustering , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[22]  Marco Dorigo,et al.  Ant-Based Clustering and Topographic Mapping , 2006, Artificial Life.

[23]  Jean-Louis Deneubourg,et al.  The dynamics of collective sorting robot-like ants and ant-like robots , 1991 .

[24]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.