Multi-objective Genetic Algorithm Evaluation in Feature Selection

Feature Selection may be viewed as a search for optimal feature subsets considering one or more importance criteria. This search may be performed with Multi-objective Genetic Algorithms. In this work, we present an application of these algorithms for combining different filter approach criteria, which rely on general characteristics of the data, as feature-class correlation, to perform the search for subsets of features. We conducted experiments on public data sets and the results show the potential of this proposal when compared to mono-objective genetic algorithms and two popular filter algorithms.

[1]  Joshua D. Knowles,et al.  Multiobjective Optimization in Bioinformatics and Computational Biology , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Carlos A. Coello Coello,et al.  Evolutionary multi-objective optimization: a historical view of the field , 2006, IEEE Comput. Intell. Mag..

[4]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[5]  Terry Windeatt,et al.  Correlation-Based and Causal Feature Selection Analysis for Ensemble Classifiers , 2010, ANNPR.

[6]  Tony R. Martinez,et al.  Improved Heterogeneous Distance Functions , 1996, J. Artif. Intell. Res..

[7]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[8]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[9]  M. C. Monard,et al.  A Fractal Dimension Based Filter Algorithm to Select Features for Supervised Learning , 2006, IBERAMIA-SBIA.

[10]  Y. Ong,et al.  Feature Selection Using Single/Multi-Objective Memetic Frameworks , 2009 .

[11]  Ian Witten,et al.  Data Mining , 2000 .

[12]  W. Kruskal,et al.  Use of Ranks in One-Criterion Variance Analysis , 1952 .

[13]  Anne M. P. Canuto,et al.  Feature selection in heterogeneous structure of ensembles: A genetic algorithm approach , 2009, 2009 International Joint Conference on Neural Networks.

[14]  Deng Cai,et al.  Laplacian Score for Feature Selection , 2005, NIPS.

[15]  Lorenzo Bruzzone,et al.  A Novel Approach to the Selection of Spatially Invariant Features for the Classification of Hyperspectral Images With Improved Generalization Capability , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Taeshik Shon,et al.  Applying genetic algorithm for classifying anomalous TCP/IP packets , 2006, Neurocomputing.

[17]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[18]  Mengjie Zhang,et al.  Pareto front feature selection: using genetic programming to explore feature space , 2009, GECCO.

[19]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[20]  Sushmita Mitra,et al.  Evolutionary Rough Feature Selection in Gene Expression Data , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[21]  Weizhong Yan,et al.  Fusion in multi-criterion feature ranking , 2007, 2007 10th International Conference on Information Fusion.

[22]  Stefan Holban,et al.  A Computational Intelligence Approach for Ranking Risk Factors in Preterm Birth , 2007, 2007 4th International Symposium on Applied Computational Intelligence and Informatics.

[23]  José Manuel Benítez,et al.  Consistency measures for feature selection , 2008, Journal of Intelligent Information Systems.

[24]  Lipo Wang,et al.  Data Mining With Computational Intelligence , 2006, IEEE Transactions on Neural Networks.

[25]  Yin-Fu Huang,et al.  Evolutionary-based feature selection approaches with new criteria for data mining: A case study of credit approval data , 2009, Expert Syst. Appl..

[26]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[27]  Kalyanmoy Deb,et al.  A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objective Optimisation: NSGA-II , 2000, PPSN.

[28]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[29]  Ana Carolina Lorena,et al.  Use of Multiobjective Genetic Algorithms in Feature Selection , 2010, 2010 Eleventh Brazilian Symposium on Neural Networks.

[30]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[31]  Venkatesan Guruswami,et al.  Combinatorial feature selection problems , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[32]  Slobodan Petrovic,et al.  Improving Effectiveness of Intrusion Detection by Correlation Feature Selection , 2010, ARES.

[33]  Nicoletta Dessì,et al.  An evolutionary method for combining different feature selection criteria in microarray data classification , 2009 .

[34]  Huan Liu,et al.  A Probabilistic Approach to Feature Selection - A Filter Solution , 1996, ICML.

[35]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[36]  Xin Yao,et al.  Parallel Problem Solving from Nature PPSN VI , 2000, Lecture Notes in Computer Science.

[37]  Paolo Rosso,et al.  A comparison of machine learning techniques for detection of drug target articles , 2010, J. Biomed. Informatics.

[38]  Jennifer G. Dy Unsupervised Feature Selection , 2007 .

[39]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[40]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[41]  Marco Laumanns,et al.  PISA: A Platform and Programming Language Independent Interface for Search Algorithms , 2003, EMO.

[42]  Andries Petrus Engelbrecht,et al.  A decision rule-based method for feature selection in predictive data mining , 2010, Expert Syst. Appl..

[43]  Kay Chen Tan,et al.  Multi-Objective Memetic Algorithms , 2009 .

[44]  Carlos A. Coello Coello,et al.  Online Objective Reduction to Deal with Many-Objective Problems , 2009, EMO.