A Feature Subset Selection Algorithm Based on Feature Activity and Improved GA

Feature subset selection is an important research branch in the field of pattern recognition. Due to the traditional feature selection algorithms do not take into account the feature updating case, the paper analyzes the relationship between dataset and features, proposes a new feature activity measurement that is used to determine the influence among different features on some certain conditions. Based on the feature activity measurement, to cope with the premature convergence and the weak local search ability of classic genetic algorithm, the paper proposes a feature set selection algorithm based on adaptive feature activity and improved genetic algorithm. The proposed algorithm can dynamic guidance feature selection process, and then accelerate from multidimensional characteristics in the collection to find the optimal feature subset. Experimental results indicate the proposed method can obtain small scale feature set on the basis of higher classification accuracy and faster running time than those compared algorithms. The proposed algorithm can be better applied to the field of feature selection application.

[1]  Shian-Shyong Tseng,et al.  A two-phase feature selection method using both filter and wrapper , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[2]  Stephen A. Billings,et al.  Feature Subset Selection and Ranking for Data Dimensionality Reduction , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Witold Pedrycz,et al.  Identifying core sets of discriminatory features using particle swarm optimization , 2009, Expert Syst. Appl..

[4]  William H. Hsu,et al.  Genetic wrappers for feature selection in decision tree induction and variable ordering in Bayesian network structure learning , 2004, Inf. Sci..

[5]  Yi Liu,et al.  FS_SFS: A novel feature selection method for support vector machines , 2006, Pattern Recognit..

[6]  Yuping Wang,et al.  Inventory based two-objective job shop scheduling model and its hybrid genetic algorithm , 2013, Appl. Soft Comput..

[7]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[8]  Kuan-Cheng Lin,et al.  A Novel Feature Selection Method for Support Vector Machines Using a Lion’s Algorithm , 2014 .

[9]  Yuping Wang,et al.  An evolutionary algorithm for dynamic multi-objective optimization , 2008, Appl. Math. Comput..

[10]  S. Billings,et al.  Feature Subset Selection and Ranking for Data Dimensionality Reduction , 2007 .

[11]  Somnath Mukhopadhyay,et al.  Methods for pattern selection, class-specific feature selection and classification for automated learning. , 2013, Neural networks : the official journal of the International Neural Network Society.

[12]  Hossein Nezamabadi-pour,et al.  GSA: A Gravitational Search Algorithm , 2009, Inf. Sci..

[13]  Liu Zhen Novel spam filtering method based on fuzzy adaptive particle swarm optimization , 2011 .

[14]  Philip M. Lewis,et al.  The characteristic selection problem in recognition systems , 1962, IRE Trans. Inf. Theory.

[15]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.