A hybrid evolutionary computation approach with its application for optimizing text document clustering

We propose a novel hybrid evolutionary computation approach for optimizing text clustering.GA improves the initializing strategy of QPSO and yields a preliminary optimization.A new position update approach is proposed to normalize the search space of particles.This approach enhances the performance evaluated by both fitness and F-measure. Quantum-behaved particle swarm optimization (QPSO) is a promising global optimization algorithm inspired by concepts of quantum mechanics and particle swarm optimization (PSO). Since the particles are initialized randomly in QPSO, the blindness of initializing particles affects its capacity for complicated optimization. In this paper, we make full use of a hybrid evolutionary computation approach to resolve such an issue. In specific, the robust global search ability of genetic algorithm (GA) improves the initial strategy of particles in QPSO. What is more, the original position update approach of QPSO without the restriction of its upper bound may generate some abrupt features and cause the issue of overstepping boundary, which affects its performance for search of optimum. In this study, a new position update approach is tested to normalize the search range of particles in a proper space. Such an approach enhances its probability to find the optimal solution. Since the clustering problem can be regarded as the centers searching process by using evolutionary optimization approach, the evolutionary process of chromosomes or particles encoded by centers simulates the process of solving clustering problem. In order to testify the clustering performance of our approach, we conduct the experiments on 4 subsets of standard Reuter-21578 and 20Newsgroup datasets. Experimental results show that our method performs better than the state of art clustering algorithms in the light of the evaluations of fitness and F-measure.

[1]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[2]  Wei Song,et al.  Genetic algorithm for text clustering using ontology and evaluating the validity of various semantic similarity measures , 2009, Expert Syst. Appl..

[3]  Wenbo Xu,et al.  Quantum-Behaved Particle Swarm Optimization Clustering Algorithm , 2006, ADMA.

[4]  Hao Wang,et al.  Scalability of Hybrid Fuzzy C-Means Algorithm Based on Quantum-Behaved PSO , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[5]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[6]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[7]  Mesut Gündüz,et al.  A novel hybrid algorithm based on particle swarm and ant colony optimization for finding the global minimum , 2012, Appl. Math. Comput..

[8]  Wei Chen,et al.  Clustering of Gene Expression Data with Quantum-Behaved Particle Swarm Optimization , 2008, IEA/AIE.

[9]  Ujjwal Maulik,et al.  Genetic algorithm-based clustering technique , 2000, Pattern Recognit..

[10]  Ajith Abraham,et al.  Swarm Intelligence Algorithms for Data Clustering , 2008, Soft Computing for Knowledge Discovery and Data Mining.

[11]  Hao-Dong Zhu,et al.  Feature Selection Method Combined Optimized Document Frequency with Improved RBF Network , 2009, ADMA.

[12]  Xiaojun Wu,et al.  Quantum-behaved particle swarm optimization with Gaussian distributed local attractor point , 2011, Appl. Math. Comput..

[13]  Xu Wen-bo,et al.  Convergence analysis of quantum-behaved particle swarm optimization algorithm and study on its control parameter , 2010 .

[14]  Andries Petrus Engelbrecht,et al.  Data clustering using particle swarm optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[15]  Wei Song,et al.  Fuzzy control GA with a novel hybrid semantic similarity strategy for text clustering , 2014, Inf. Sci..

[16]  Xiaojun Wu,et al.  Convergence analysis and improvements of quantum-behaved particle swarm optimization , 2012, Inf. Sci..

[17]  Shengrui Wang,et al.  Text Clustering via Particle Swarm Optimization , 2009, 2009 IEEE Swarm Intelligence Symposium.

[18]  Shokri Z. Selim,et al.  K-Means-Type Algorithms: A Generalized Convergence Theorem and Characterization of Local Optimality , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Thomas E. Potok,et al.  Document clustering using particle swarm optimization , 2005, Proceedings 2005 IEEE Swarm Intelligence Symposium, 2005. SIS 2005..

[20]  Gerald Kowalski,et al.  Information Retrieval Systems: Theory and Implementation , 1997 .

[21]  R. J. Kuo,et al.  Application of particle swarm optimization and perceptual map to tourist market segmentation , 2012, Expert Syst. Appl..

[22]  Riccardo Poli,et al.  Analysis of the publications on the applications of particle swarm optimisation , 2008 .

[23]  Riccardo Poli,et al.  Particle Swarms: The Second Decade , 2008 .

[24]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[25]  Wenbo Xu,et al.  Particle swarm optimization with particles having quantum behavior , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[26]  Maurice Clerc,et al.  The particle swarm - explosion, stability, and convergence in a multidimensional complex space , 2002, IEEE Trans. Evol. Comput..

[27]  Leandro N. de Castro,et al.  Data Clustering with Particle Swarms , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[28]  Shengrui Wang,et al.  Particle swarm optimizer for variable weighting in clustering high-dimensional data , 2009, 2009 IEEE Swarm Intelligence Symposium.

[29]  Wei Song,et al.  Genetic algorithm for text clustering based on latent semantic indexing , 2009, Comput. Math. Appl..

[30]  A. Engelbrecht,et al.  A new locally convergent particle swarm optimiser , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[31]  Gareth Jones,et al.  Non-hierarchic document clustering using a genetic algorithm , 1995, Information Research.

[32]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[33]  Luiz Eduardo Soares de Oliveira,et al.  Improving cascading classifiers with particle swarm optimization , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[34]  Zhang Zhisheng Short Communication: Quantum-behaved particle swarm optimization algorithm for economic load dispatch of power system , 2010 .

[35]  Amit Konar,et al.  Document Clustering Using Differential Evolution , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[36]  Xiaojun Wu,et al.  Quantum-Behaved Particle Swarm Optimization: Analysis of Individual Particle Behavior and Parameter Selection , 2012, Evolutionary Computation.

[37]  B. Chandra Mohan,et al.  A survey: Ant Colony Optimization based recent research and implementation on several engineering domain , 2012, Expert Syst. Appl..

[38]  Anna-Lan Huang,et al.  Similarity Measures for Text Document Clustering , 2008 .