Nonlinear Feature Construction with Evolved Neural Networks for Classification Problems

Predicting the class membership of a set of patterns represented by points in a multi-dimensional space critically depends on their specific distribution. To improve the classification performance, pattern vectors may be transformed. There is a range of linear methods for feature construction, but these are often limited in their performance. Nonlinear methods are a more recent development in this field, but these pose difficult optimization problems. Evolutionary approaches have been used to optimize both linear and nonlinear functions for feature construction. For nonlinear feature construction, a particular problem is how to encode the function in order to limit the huge search space while preserving enough flexibility to evolve effective solutions. In this paper, we present a new method for generating a nonlinear function for feature construction using multi-layer perceptrons whose weights are shaped by evolution. By pre-defining the architecture of the neural network we can directly influence the computational capacity of the function and the number of features to be constructed. We evaluate the suggested neural feature construction on four commonly used data sets and report an improvement in classification accuracy ranging from 4 to 13 percentage points over the performance on the original pattern set. A version of this report has been published as “Tobias Berka and Helmut A. Mayer: Nonlinear Feature Construction with Evolved Neural Networks for Classification Problems. Proceedings of the First International Conference on Pattern Recognition Applications and Methods (ICPRAM’12), 2012.”

[1]  datasets,et al.  Breast Cancer Diagnosis , 1967, Nature.

[2]  Roland Schwaiger,et al.  Differentiation of neuron types by evolving activation function templates for artificial neural networks , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[3]  Daniel Weingaertner,et al.  Evolving heterogeneous neural networks for classification problems , 2001 .

[4]  Bernhard Schölkopf,et al.  Iterative kernel principal component analysis for image modeling , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  David B. Fogel,et al.  Evolutionary algorithms in theory and practice , 1997, Complex.

[6]  Michael Schlosser,et al.  Discovery of Relevant New Features by Generating Non-Linear Decision Trees , 1996, KDD.

[7]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[8]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[9]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[10]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[12]  Helmut A. Mayer,et al.  Multi-chromosomal representations in neuroevolution , 2006, Computational Intelligence.

[13]  Mikael Bodén,et al.  An experimental comparison of weight evolution in neural control architectures for a 'Garbage-Collecting' Khepera robot , 1999 .

[14]  Asoke K. Nandi,et al.  Breast Cancer Diagnosis Using Genetic Programming Generated Feature , 2005, 2005 IEEE Workshop on Machine Learning for Signal Processing.

[15]  Tat-Jun Chin,et al.  Incremental Kernel PCA for Efficient Non-linear Feature Extraction , 2006, BMVC.

[16]  X. Yao Evolving Artificial Neural Networks , 1999 .

[17]  Paulo Cortez,et al.  Modeling wine preferences by data mining from physicochemical properties , 2009, Decis. Support Syst..

[18]  Kevin Baker,et al.  Classification of radar returns from the ionosphere using neural networks , 1989 .

[19]  M. Aizerman,et al.  Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[20]  Olvi L. Mangasarian,et al.  Nuclear feature extraction for breast tumor diagnosis , 1993, Electronic Imaging.

[21]  Mahesan Niranjan,et al.  Realisable Classifiers: Improving Operating Performance on Variable Cost Problems , 1998, BMVC.

[22]  Charu C. Aggarwal,et al.  The Generalized Dimensionality Reduction Problem , 2010, SDM.