Feature Selection for Modular Neural Network Classifiers

An N-class problem can be fully decomposed into N-independent small neural networks called modules (or sub-problems) in a modular neural network classifier. Each sub-problem is a two-class ( 'yes ' or 'no ' ) problem. Hence, the optimal input feature space for each module is also likely to be a subset of the original feature space. Therefore, feature selection plays an important role in finding these useful features. Some feature selection techniques have been developed from different perspectives but are not suitable, however, for the two-class problems resulting from complete task decomposition. In this paper, we propose two feature selection techniques— Relative Importance Factor (RIF) and Relative FLD Weight Analysis (RFWA) for modular neural network classifiers. Our approaches involve the use of Fisher's linear discriminant (FLD) function to obtain the importance of each feature and to find the correlation among features. In RIF, the input features are classified as relevant and irrelevant based on their contribution in classification. In RFWA, the irrelevant features are further classified into noise or redundant features based on the correlation among features. The proposed techniques have been applied to several classification problems. The results show that these techniques can successfully detect the irrelevant features in each module and improve accuracy while reducing computation effort. Reprint requests to: Both authors are with the Department of Electrical and Computer Engineering, National University of Singapore. 10 Kent Ridge Crescent, Singapore 119260; e-mail address: eleguans@nus.edu.sg

[1]  Mikko Lehtokangas Modelling with constructive backpropagation , 1999, Neural Networks.

[2]  S. Sjogaard Generalization in cascade-correlation networks , 1992, Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop.

[3]  Antonio González Muñoz,et al.  Table Ii Tc Pattern Recognition Result for 120 Eir Satellite Image Cases Selection of Relevant Features in a Fuzzy Genetic Learning Algorithm , 2001 .

[4]  Kenneth W. Bauer,et al.  Determining input features for multilayer perceptrons , 1995, Neurocomputing.

[5]  Timur Ash,et al.  Dynamic node creation in backpropagation networks , 1989 .

[6]  Dit-Yan Yeung A Neural Network Approach to Constructive Induction , 1991, ML.

[7]  M. T. Schilling,et al.  Fault location in electrical power systems using intelligent systems techniques , 2001 .

[8]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[9]  Huan Liu,et al.  Neural-network feature selector , 1997, IEEE Trans. Neural Networks.

[10]  Kenneth W. Bauer,et al.  Integrated Feature and Architecture Selection for Radial Basis Neural Networks , 2003 .

[11]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[12]  Hugo Guterman,et al.  Feature selection and chromosome classification using a multilayer perceptron neural network , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[13]  D. W. Tufts,et al.  Principal-feature classification , 1995, Proceedings of 1995 IEEE Workshop on Neural Networks for Signal Processing.

[14]  Kishan G. Mehrotra,et al.  Efficient classification for multiclass problems using modular neural networks , 1995, IEEE Trans. Neural Networks.

[15]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[16]  Steven Guan,et al.  Parallel growing and training of neural networks using output parallelism , 2002, IEEE Trans. Neural Networks.

[17]  Chong-Ho Choi,et al.  Input feature selection for classification problems , 2002, IEEE Trans. Neural Networks.

[18]  Mohamed S. Kamel,et al.  Modular neural network architectures for classification , 1996, Proceedings of International Conference on Neural Networks (ICNN'96).

[19]  Steven K. Rogers,et al.  Bayesian selection of important features for feedforward neural networks , 1993, Neurocomputing.

[20]  Lutz Prechelt,et al.  Investigation of the CasCor Family of Learning Algorithms , 1997, Neural Networks.