A differential evolution based feature selection approach using an improved filter criterion

In filter feature selection, mutual information approaches have recently gained a high popularity among researchers. In these approaches, mutual information is commonly used to measure two components: the mutual relevance between each feature and the class labels, and the mutual redundancy between each pair of features. Despite their popularity, it has been pointed in the literature that such feature selection approaches may not fairly estimate the redundancy in high dimensional problems. To alleviate this problem, this paper proposes a new criterion, which uses the concepts of ReliefF instead of the mutual redundancy. Using the proposed criterion, a new differential evolution based filter feature selection approach is developed. The performance comparisons and analysis are conducted by comparing it with the most well-known mutual information feature selection (MIFS) criterion based on maximum-relevance and minimum-redundancy on the differential evolution framework. The results show that performing feature selection using the proposed criterion can generally achieve better classification performance and smaller feature subset size.

[1]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[2]  Xiaoming Xu,et al.  A hybrid genetic algorithm for feature selection wrapper based on mutual information , 2007, Pattern Recognit. Lett..

[3]  Asha Gowda Karegowda,et al.  Cascading GA & CFS for Feature Subset selection in Medical Data Mining , 2009, 2009 IEEE International Advance Computing Conference.

[4]  Selma Ayse Özel,et al.  A hybrid approach of differential evolution and artificial bee colony for feature selection , 2016, Expert Syst. Appl..

[5]  Jian Li,et al.  Iterative RELIEF for feature weighting , 2006, ICML.

[6]  K. Premalatha,et al.  A Survey on Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification , 2017 .

[7]  Shimon Ullman,et al.  Object recognition with informative features and linear classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8]  Ruisheng Zhang,et al.  A BPSO-SVM algorithm based on memory renewal and enhanced mutation mechanisms for feature selection , 2017, Appl. Soft Comput..

[9]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[10]  Bing Xue,et al.  Mutual information for feature selection: estimation or counting? , 2016, Evol. Intell..

[11]  Colas Schretter,et al.  Information-Theoretic Feature Selection in Microarray Data Using Variable Complementarity , 2008, IEEE Journal of Selected Topics in Signal Processing.

[12]  Jacob Benesty,et al.  Pearson Correlation Coefficient , 2009 .

[13]  Mengjie Zhang,et al.  Filter based backward elimination in wrapper based PSO for feature selection in classification , 2014, 2014 IEEE Congress on Evolutionary Computation (CEC).

[14]  Mengjie Zhang,et al.  Binary particle swarm optimisation for feature selection: A filter based approach , 2012, 2012 IEEE Congress on Evolutionary Computation.

[15]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[16]  Dahua Lin,et al.  Conditional Infomax Learning: An Integrated Framework for Feature Extraction and Fusion , 2006, ECCV.

[17]  Max A. Little,et al.  A Simple Filter Benchmark for Feature Selection , 2010 .

[18]  Prabhat Ranjan,et al.  FRBPSO: A Fuzzy Rule Based Binary PSO for Feature Selection , 2017 .

[19]  Chong-Ho Choi,et al.  Input feature selection for classification problems , 2002, IEEE Trans. Neural Networks.

[20]  Antonio Martínez-Álvarez,et al.  Feature selection by multi-objective optimisation: Application to network anomaly detection by hierarchical self-organising maps , 2014, Knowl. Based Syst..

[21]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[22]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[23]  Bing Xue,et al.  Mutual Information Estimation for Filter Based Feature Selection Using Particle Swarm Optimization , 2016, EvoApplications.

[24]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[25]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[26]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[27]  Anuradha Purohit,et al.  Feature Selection using Modified Particle Swarm Optimization , 2017 .

[28]  P. N. Suganthan,et al.  Differential Evolution: A Survey of the State-of-the-Art , 2011, IEEE Transactions on Evolutionary Computation.

[29]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[30]  Ajith Abraham,et al.  A simplex differential evolution algorithm: development and applications , 2012 .

[31]  Mengjie Zhang,et al.  A binary ABC algorithm based on advanced similarity scheme for feature selection , 2015, Appl. Soft Comput..

[32]  Ahmed Al-Ani Ant Colony Optimization for Feature Subset Selection , 2005, WEC.

[33]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[34]  Mengjie Zhang,et al.  Multi-objective Evolutionary Algorithms for filter Based Feature Selection in Classification , 2013, Int. J. Artif. Intell. Tools.

[35]  U. Balakrishnan,et al.  An enhanced PSO-DEFS based feature selection with biometric authentication for identification of diabetic retinopathy , 2016 .

[36]  Adel Al-Jumaily,et al.  A Combined Ant Colony and Differential Evolution Feature Selection Algorithm , 2008, ANTS Conference.

[37]  Magdalene Marinaki,et al.  An Island Memetic Differential Evolution Algorithm for the Feature Selection Problem , 2013, NICSO.

[38]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Azuraliza Abu Bakar,et al.  Hybrid feature selection based on enhanced genetic algorithm for text categorization , 2016, Expert Syst. Appl..

[40]  John E. Moody,et al.  Data Visualization and Feature Selection: New Algorithms for Nongaussian Data , 1999, NIPS.

[41]  Mengjie Zhang,et al.  A multi-objective artificial bee colony approach to feature selection using fuzzy mutual information , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[42]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[43]  Mengjie Zhang,et al.  Particle Swarm Optimisation with genetic operators for feature selection , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[44]  Rich Caruana,et al.  Greedy Attribute Selection , 1994, ICML.

[45]  Mohamed A. Deriche,et al.  A New Technique for Combining Multiple Classifiers using The Dempster-Shafer Theory of Evidence , 2002, J. Artif. Intell. Res..