Feature selection using feature ranking, correlation analysis and chaotic binary particle swarm optimization

In this paper, we propose a multi-stage feature selection algorithm, which focuses on the reduction of redundant features and the improvement of classification performance using feature ranking (FR), correlation analysis (CA) and chaotic binary particle swarm optimization (CBPSO). In the first stage, with the purpose of selecting the most effective features for classification, FR is introduced to select the top-ranked features according to the classification accuracies. In the second stage, CA is used to measure the correlation among the selected top-ranked features for reducing redundant features. In the third stage, in order to further eliminate redundant features and improve the classification performances, CBPSO is adopted to search the optimal feature subset. Ultimately, feature selection can be completed by using only some top-ranked features with less redundancy for classification. Support vector machine (SVM) with n-fold cross-validation is adopted to assess the classification performances on six datasets in the experiments. Experimental results show that the proposed algorithm can achieve better performance in terms of classification accuracy and the number of features than benchmark algorithms.

[1]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[2]  Hao Chen,et al.  A Heuristic Feature Selection Approach for Text Categorization by Using Chaos Optimization and Genetic Algorithm , 2013 .

[3]  Hao Dong,et al.  An improved particle swarm optimization for feature selection , 2011 .

[4]  Mengjie Zhang,et al.  Multi-objective particle swarm optimisation (PSO) for feature selection , 2012, GECCO '12.

[5]  Hongbin Zhang,et al.  Feature selection using tabu search method , 2002, Pattern Recognit..

[6]  Luo Zhang,et al.  A method for feature selection based on the correlation analysis , 2012, Proceedings of 2012 International Conference on Measurement, Information and Control.

[7]  Li-Yeh Chuang,et al.  Gene selection and classification using Taguchi chaotic binary particle swarm optimization , 2011, Expert Syst. Appl..

[8]  Mengjie Zhang,et al.  Particle swarm optimisation for feature selection in classification: Novel initialisation and updating mechanisms , 2014, Appl. Soft Comput..

[9]  Taghi M. Khoshgoftaar,et al.  A COMPARATIVE STUDY OF FILTER-BASED AND WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR SOFTWARE QUALITY MODELING , 2011 .

[10]  Li-Yeh Chuang,et al.  Chaotic maps based on binary particle swarm optimization for feature selection , 2011, Appl. Soft Comput..

[11]  B. Alatas,et al.  Chaos embedded particle swarm optimization algorithms , 2009 .

[12]  Mengjie Zhang,et al.  Single Feature Ranking and Binary Particle Swarm Optimisation Based Feature Subset Ranking for Feature Selection , 2012, ACSC.

[13]  Mengjie Zhang,et al.  A multi-objective particle swarm optimisation for filter-based feature selection in classification problems , 2012, Connect. Sci..