Evolutionary local improvement on genetic algorithms for feature selection

Feature selection is an extremely important matter in pattern recognition, particularly when a large set of features is available without knowledge about the discriminative information provided by each element. The key issue is to define a criterion in order to rank the features, discarding those features that are less relevant, redundant, or noisy. This depends on the particular task, the classifier and the properties of the data. A frequent approach consists on the use of genetic algorithms guided by the classification accuracy. However they are often not able to provide a solution with both a considerable reduction of dimensionality and high accuracy rate. Here we propose a modified version of a genetic algorithm, introducing a novel local improvement approach based on evolution, which is able to obtain better dimensionality-accuracy trade-off. Experimental results on different well known datasets show the advantages of our proposal.

[1]  Diego H. Milone,et al.  Multi-objective optimisation of wavelet features for phoneme recognition , 2016, IET Signal Process..

[2]  Lluís A. Belanche Muñoz,et al.  Feature selection algorithms: a survey and experimental evaluation , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[3]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[4]  J. David Schaffer,et al.  Proceedings of the third international conference on Genetic algorithms , 1989 .

[5]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[6]  Snehamoy Chatterjee,et al.  Genetic algorithms for feature selection of image analysis-based quality monitoring model: An application to an iron mine , 2011, Eng. Appl. Artif. Intell..

[7]  Terence C. Fogarty,et al.  Varying the Probability of Mutation in the Genetic Algorithm , 1989, ICGA.

[8]  Byung Ro Moon,et al.  Genetic Algorithm and Graph Partitioning , 1996, IEEE Trans. Computers.

[9]  César Hervás-Martínez,et al.  Evolutionary Generalized Radial Basis Function neural networks for improving prediction accuracy in gene classification using feature selection , 2012, Appl. Soft Comput..

[10]  Sam Kwong,et al.  Genetic algorithms and their applications , 1996, IEEE Signal Process. Mag..

[11]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[12]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[13]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Andrew Rosenberg,et al.  Classifying Skewed Data: Importance Weighting to Optimize Average Recall , 2012, INTERSPEECH.

[15]  S. R. Mahadeva Prasanna,et al.  Feature optimisation for stress recognition in speech , 2016, Pattern Recognit. Lett..

[16]  Mohamed Chetouani,et al.  Optimizing feature complementarity by evolution strategy: Application to automatic speaker verification , 2009, Speech Commun..

[17]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[18]  Jacob Scharcanski,et al.  Feature selection for face recognition based on multi-objective evolutionary wrappers , 2013, Expert Syst. Appl..

[19]  Jiawei Han,et al.  Generalized Fisher Score for Feature Selection , 2011, UAI.

[20]  Sadiq M. Sait,et al.  Evolutionary algorithms, simulated annealing and tabu search: a comparative study , 2001 .

[21]  Sébastien Gúerif,et al.  Dimensionality Reduction Through Unsupervised Features Selection , 2007 .

[22]  Hui-Huang Hsu,et al.  Hybrid feature selection by combining filters and wrappers , 2011, Expert Syst. Appl..

[23]  João Miguel da Costa Sousa,et al.  Metaheuristics for feature selection: Application to sepsis outcome prediction , 2012, 2012 IEEE Congress on Evolutionary Computation.

[24]  Robert V. Brill,et al.  Applied Statistics and Probability for Engineers , 2004, Technometrics.

[25]  Marco Cristani,et al.  Infinite Feature Selection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[26]  Kazuyuki Murase,et al.  Involving New Local Search in Hybrid Genetic Algorithm for Feature Selection , 2009, ICONIP.

[27]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[28]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[29]  Witold Pedrycz,et al.  Evolutionary feature selection via structure retention , 2012, Expert Syst. Appl..

[30]  T. Poggio,et al.  Multiclass cancer diagnosis using tumor gene expression signatures , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Diego H. Milone,et al.  Evolutionary Splines for Cepstral Filterbank Optimization in Phoneme Classification , 2011, EURASIP J. Adv. Signal Process..

[32]  Simone Melzi,et al.  Ranking to Learn: - Feature Ranking and Selection via Eigenvector Centrality , 2016, NFMCP@PKDD/ECML.