Particle swarm optimization algorithm with environmental factors for clustering analysis

In view of the model of bird flocking, particle swarm optimization (PSO) is a promising method to tackle optimization. This study benefits from the fact that the living environment affects behaviors of the bird flocking. That is, a well-defined environmental factor can improve the performance of PSO. Thus, the environment factor is taken into account to inspire the robust behaviors of bird flocking in depth. Specifically, it not only can carry out effective searching in limited flying space, but also can strengthen the social behaviors of individual. In the field of clustering, it can be regarded as a search optimization issue. Like the utilization of some useful information generated in its process, environment factor is considered and environment factor-inspired PSO (EPSO) is proposed in this study. To take full advantage of EPSO for solving issue of clustering, we divide its process into two stages. In the first stage, the environment factor is imported as a refined search technology to achieve the multi-local optimums with high probability. In the second stage, the manifold information, i.e., individual, swarm and environment factors, is considered to improve its global search capacity. Such an approach can effectively overcome the defect of PSO being prone to being trapped in local optima. To demonstrate the validity of our approach, EPSO, conventional PSO, genetic algorithm, $$K$$K-means, artificial bee colony and hybrid ABC are compared with benchmark document collections. The experiment results indicate that EPSO performs better than these state-of-the-art clustering algorithms in most cases.

[1]  Tao Chen,et al.  Model-based multidimensional clustering of categorical data , 2012, Artif. Intell..

[2]  Muriel Visani,et al.  A new interactive semi-supervised clustering model for large image database indexing , 2014, Pattern Recognit. Lett..

[3]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[4]  J. Wolfe PATTERN CLUSTERING BY MULTIVARIATE MIXTURE ANALYSIS. , 1970, Multivariate behavioral research.

[5]  Josep Carmona Projection approaches to process mining using region-based techniques , 2011, Data Mining and Knowledge Discovery.

[6]  Dervis Karaboga,et al.  A novel clustering approach: Artificial Bee Colony (ABC) algorithm , 2011, Appl. Soft Comput..

[7]  Andries Petrus Engelbrecht,et al.  Data clustering using particle swarm optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[8]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[9]  Leandro Nunes de Castro,et al.  Towards Improving Clustering Ants: An Adaptive Ant Clustering Algorithm , 2005, Informatica.

[10]  Özgür Ulusoy,et al.  Cluster searching strategies for collaborative recommendation systems , 2013, Inf. Process. Manag..

[11]  Ujjwal Maulik,et al.  Genetic algorithm-based clustering technique , 2000, Pattern Recognit..

[12]  Xindong Wu,et al.  Automatic clustering using genetic algorithms , 2011, Appl. Math. Comput..

[13]  Jesus Boticario,et al.  Application of machine learning techniques to analyse student interactions and improve the collaboration process , 2011, Expert Syst. Appl..

[14]  R. Eberhart,et al.  Comparing inertia weights and constriction factors in particle swarm optimization , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[15]  Andries Petrus Engelbrecht,et al.  Niching ability of basic particle swarm optimization algorithms , 2005, Proceedings 2005 IEEE Swarm Intelligence Symposium, 2005. SIS 2005..

[16]  Chieh-Li Chen,et al.  Evolutionary algorithm to traveling salesman problems , 2012, Comput. Math. Appl..

[17]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[18]  Ponnuthurai N. Suganthan,et al.  A Distance-Based Locally Informed Particle Swarm Model for Multimodal Optimization , 2013, IEEE Transactions on Evolutionary Computation.

[19]  Shengyi Jiang,et al.  A generalized cluster centroid based classifier for text categorization , 2013, Inf. Process. Manag..

[20]  Vijay V. Raghavan,et al.  A clustering strategy based on a formalism of the reproductive process in natural systems , 1979, SIGIR 1979.

[21]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[22]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[23]  Ellen M. Voorhees,et al.  Implementing agglomerative hierarchic clustering algorithms for use in document retrieval , 1986, Inf. Process. Manag..

[24]  Nor Ashidi Mat Isa,et al.  Color image segmentation using histogram thresholding - Fuzzy C-means hybrid approach , 2011, Pattern Recognit..

[25]  Thomas E. Potok,et al.  Document clustering using particle swarm optimization , 2005, Proceedings 2005 IEEE Swarm Intelligence Symposium, 2005. SIS 2005..

[26]  George Karypis,et al.  Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering , 2004, Machine Learning.

[27]  Haoxiang Xia,et al.  A modified ant-based text clustering algorithm with semantic similarity measure , 2006 .

[28]  Xiaodong Li,et al.  Niching Without Niching Parameters: Particle Swarm Optimization Using a Ring Topology , 2010, IEEE Transactions on Evolutionary Computation.

[29]  Russell C. Eberhart,et al.  Evolutionary Computation Theory and Paradigms , 2001 .

[30]  Rajesh Kumar,et al.  A review on particle swarm optimization algorithms and their applications to data clustering , 2011, Artificial Intelligence Review.

[31]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[32]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[33]  Keinosuke Fukunaga,et al.  A Graph-Theoretic Approach to Nonparametric Cluster Analysis , 1976, IEEE Transactions on Computers.

[34]  Sudipto Guha,et al.  ROCK: A Robust Clustering Algorithm for Categorical Attributes , 2000, Inf. Syst..

[35]  Xiaohui Yan,et al.  A new approach for data clustering using hybrid artificial bee colony algorithm , 2012, Neurocomputing.

[36]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).