A Generic Multi-dimensional Feature Extraction Method Using Multiobjective Genetic Programming

In this paper, we present a generic feature extraction method for pattern classification using multiobjective genetic programming. This not only evolves the (near-)optimal set of mappings from a pattern space to a multi-dimensional decision space, but also simultaneously optimizes the dimensionality of that decision space. The presented framework evolves vector-to-vector feature extractors that maximize class separability. We demonstrate the efficacy of our approach by making statistically-founded comparisons with a wide variety of established classifier paradigms over a range of datasets and find that for most of the pairwise comparisons, our evolutionary method delivers statistically smaller misclassification errors. At very worst, our method displays no statistical difference in a few pairwise comparisons with established classifier/dataset combinations; crucially, none of the misclassification results produced by our method is worse than any comparator classifier. Although principally focused on feature extraction, feature selection is also performed as an implicit side effect; we show that both feature extraction and selection are important to the success of our technique. The presented method has the practical consequence of obviating the need to exhaustively evaluate a large family of conventional classifiers when faced with a new pattern recognition problem in order to attain a good classification accuracy.

[1]  Edwin D. de Jong,et al.  Multi-Objective Methods for Tree Size Control , 2003, Genetic Programming and Evolvable Machines.

[2]  Neal R. Harvey,et al.  Comparison of GENIE and conventional supervised classifiers for multispectral image feature extraction , 2002, IEEE Trans. Geosci. Remote. Sens..

[3]  Sean Luke,et al.  Evolving kernels for support vector machine classification , 2007, GECCO '07.

[4]  William B. Langdon,et al.  Genetic Programming and Data Structures: Genetic Programming + Data Structures = Automatic Programming! , 1998 .

[5]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[6]  Yang Zhang,et al.  A generic optimising feature extraction method using multiobjective genetic programming , 2011, Appl. Soft Comput..

[7]  Asoke K. Nandi,et al.  Feature generation using genetic programming with application to fault classification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[9]  Vic Ciesielski,et al.  Representing classification problems in genetic programming , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[10]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[11]  Krzysztof Krawiec,et al.  Genetic Programming-based Construction of Features for Machine Learning and Knowledge Discovery Tasks , 2002, Genetic Programming and Evolvable Machines.

[12]  Stefan Wermter,et al.  A Comparison of Feature Extraction and Selection Techniques , 2003 .

[13]  Christopher Harris,et al.  An investigation into the application of genetic programming techniques to signal analysis and feature detection , 1998 .

[14]  Satoshi Sato,et al.  Non-destructive Depth-Dependent Crossover for Genetic Programming , 1998, EuroGP.

[15]  Huan Liu,et al.  Book review: Machine Learning, Neural and Statistical Classification Edited by D. Michie, D.J. Spiegelhalter and C.C. Taylor (Ellis Horwood Limited, 1994) , 1996, SGAR.

[16]  Larry Bull,et al.  Genetic Programming with a Genetic Algorithm for Feature Construction and Selection , 2005, Genetic Programming and Evolvable Machines.

[17]  J. M. Leiva-Murillo,et al.  Comparison of Supervised Feature Extraction Methods for Multispectral Images , 2002 .

[18]  Wolfram Schiffmann,et al.  Synthesis and Performance Analysis of Multilayer Neural Network Architectures , 1992 .

[19]  D. Coomans,et al.  Potential pattern recognition in chemical and medical decision making , 1986 .

[20]  Anikó Ekárt,et al.  Selection Based on the Pareto Nondomination Criterion for Controlling Code Growth in Genetic Programming , 2001, Genetic Programming and Evolvable Machines.

[21]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[22]  Yang Zhang,et al.  Domain-independent feature extraction for multi-classification using multi-objective genetic programming , 2010, Pattern Analysis and Applications.

[23]  Marc Ebner,et al.  On the Evolution of Interest Operators using Genetic Programming , 1998 .

[24]  Yang Zhang,et al.  Evolving optimal feature extraction using multi-objective genetic programming: a methodology and preliminary study on edge detection , 2005, GECCO '05.

[25]  P. Rockett,et al.  A Generic Optimal Feature Extraction Method using Multiobjective Genetic Programming : Methodology and Applications , 2007 .

[26]  Wei-Yin Loh,et al.  A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[27]  Panos M. Pardalos,et al.  A comparative study of linear and nonlinear feature extraction methods , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[28]  Ian Witten,et al.  Data Mining , 2000 .

[29]  Rajeev Kumar,et al.  Improved Sampling of the Pareto-Front in Multiobjective Genetic Optimizations by Steady-State Evolution: A Pareto Converging Genetic Algorithm , 2002, Evolutionary Computation.

[30]  Yang Zhang,et al.  Feature Extraction Using Multi-Objective Genetic Programming , 2006, Multi-Objective Machine Learning.

[31]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[32]  Marco Laumanns,et al.  Performance assessment of multiobjective optimizers: an analysis and review , 2003, IEEE Trans. Evol. Comput..

[33]  Terry Ngo,et al.  Data mining: practical machine learning tools and technique, third edition by Ian H. Witten, Eibe Frank, Mark A. Hell , 2011, SOEN.

[34]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[35]  Nikhil R. Pal,et al.  A novel approach to design classifiers using genetic programming , 2004, IEEE Transactions on Evolutionary Computation.

[36]  Walter Alden Tackett,et al.  Genetic Programming for Feature Discovery and Image Discrimination , 1993, ICGA.

[37]  Martijn C. J. Bot Feature Extraction for the k-Nearest Neighbour Classifier with Genetic Programming , 2001, EuroGP.

[38]  Asoke K. Nandi,et al.  Breast Cancer Diagnosis Using Genetic Programming Generated Feature , 2005 .

[39]  Ethem Alpaydın,et al.  Combined 5 x 2 cv F Test for Comparing Supervised Classification Learning Algorithms , 1999, Neural Comput..

[40]  Yaochu Jin,et al.  Multi-Objective Machine Learning , 2006, Studies in Computational Intelligence.

[41]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[42]  Alex A. Freitas,et al.  Evolutionary Computation , 2002 .

[43]  Carlos A. Coello Coello,et al.  An updated survey of GA-based multiobjective optimization techniques , 2000, CSUR.

[44]  Andreas Zell,et al.  Evolving Task Specific Image Operator , 1999, EvoWorkshops.

[45]  Manabu Kotani,et al.  Feature extraction using evolutionary computation , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[46]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[47]  William B. Langdon,et al.  Application of Genetic Programming to Induction of Linear Classification Trees , 2000, EuroGP.

[48]  D. Wolpert The Supervised Learning No-Free-Lunch Theorems , 2002 .

[49]  Michael G. Madden,et al.  The Genetic Kernel Support Vector Machine: Description and Evaluation , 2005, Artificial Intelligence Review.

[50]  Danny Coomans,et al.  Improvements to the classification performance of RDA , 1993 .

[51]  John R. Koza,et al.  Genetic programming 2 - automatic discovery of reusable programs , 1994, Complex Adaptive Systems.