Rule Extraction from Minimal Neural Networks for Credit Card Screening

While feedforward neural networks have been widely accepted as effective tools for solving classification problems, the issue of finding the best network architecture remains unresolved, particularly so in real-world problem settings. We address this issue in the context of credit card screening, where it is important to not only find a neural network with good predictive performance but also one that facilitates a clear explanation of how it produces its predictions. We show that minimal neural networks with as few as one hidden unit provide good predictive accuracy, while having the added advantage of making it easier to generate concise and comprehensible classification rules for the user. To further reduce model size, a novel approach is suggested in which network connections from the input units to this hidden unit are removed by a very straightaway pruning procedure. In terms of predictive accuracy, both the minimized neural networks and the rule sets generated from them are shown to compare favorably with other neural network based classifiers. The rules generated from the minimized neural networks are concise and thus easier to validate in a real-life setting.

[1]  Bart Baesens,et al.  Using Neural Network Rule Extraction and Decision Tables for Credit - Risk Evaluation , 2003, Manag. Sci..

[2]  Ehud D. Karnin,et al.  A simple procedure for pruning back-propagation trained neural networks , 1990, IEEE Trans. Neural Networks.

[3]  C. Ledoux,et al.  Two original weight pruning methods based on statistical tests and rounding techniques , 1994 .

[4]  Snjezana Soltic,et al.  Knowledge Extraction from Evolving Spiking Neural Networks with Rank Order Population Coding , 2010, Int. J. Neural Syst..

[5]  Daniel S. Yeung,et al.  Hidden neuron pruning of multilayer perceptrons using a quantified sensitivity measure , 2006, Neurocomputing.

[6]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[7]  Narasimhan Sundararajan,et al.  A generalized growing and pruning RBF (GGAP-RBF) neural network for function approximation , 2005, IEEE Transactions on Neural Networks.

[8]  De-Shuang Huang,et al.  A Hybrid Forward Algorithm for RBF Neural Network Construction , 2006, IEEE Transactions on Neural Networks.

[9]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[10]  Rudy Setiono,et al.  A Penalty-Function Approach for Pruning Feedforward Neural Networks , 1997, Neural Computation.

[11]  Lutz Prechelt,et al.  PROBEN 1 - a set of benchmarks and benchmarking rules for neural network training algorithms , 1994 .

[12]  Roelof K. Brouwer Fuzzy rule extraction from a feed forward neural network by training a representative fuzzy neural network using gradient descent , 2004, 2004 IEEE International Conference on Industrial Technology, 2004. IEEE ICIT '04..

[13]  Bart Baesens,et al.  Credit Risk Management: Basic Concepts: Financial Risk Components, Rating Analysis, Models, Economic and Regulatory Capital , 2009 .

[14]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[15]  Jean-Pierre Nadal,et al.  Study of a Growth Algorithm for a Feedforward Network , 1989, Int. J. Neural Syst..

[16]  Khashayar Khorasani,et al.  New training strategies for constructive neural networks with application to regression problems , 2004, Neural Networks.

[17]  Bart Baesens,et al.  Recursive Neural Network Rule Extraction for Data With Mixed Attributes , 2008, IEEE Transactions on Neural Networks.

[18]  Daming Shi,et al.  Entropy Learning and Relevance Criteria for Neural Network Pruning , 2003, Int. J. Neural Syst..

[19]  Adnan Khashman,et al.  A Neural Network Model for Credit Risk Evaluation , 2009, Int. J. Neural Syst..

[20]  Michael C. Mozer,et al.  Optimizing Classifier Performance via an Approximation to the Wilcoxon-Mann-Whitney Statistic , 2003, ICML.

[21]  Randall S. Sexton,et al.  Knowledge discovery using a neural network simultaneous optimization algorithm on a real world classification problem , 2006, Eur. J. Oper. Res..

[22]  Alexander Gammerman,et al.  Machine-learning algorithms for credit-card applications , 1992 .

[23]  Panagiotis Patrinos,et al.  Variable Selection in Nonlinear Modeling Based on RBF Networks and Evolutionary Computation , 2010, Int. J. Neural Syst..

[24]  R. Palmer,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[25]  Barry P. Haynes,et al.  Pruning Artificial Neural Networks Using Neural Complexity Measures , 2008, Int. J. Neural Syst..

[26]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[27]  Vijay S. Desai,et al.  A comparison of neural networks and linear scoring models in the credit union environment , 1996 .

[28]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[29]  Giovanna Castellano,et al.  Variable selection using neural-network models , 2000, Neurocomputing.

[30]  Bruce Curry,et al.  Model selection in Neural Networks: Some difficulties , 2006, Eur. J. Oper. Res..

[31]  Tommy W. S. Chow,et al.  Estimating optimal feature subsets using efficient estimation of high-dimensional mutual information , 2005, IEEE Transactions on Neural Networks.

[32]  A. Steenackers,et al.  A credit scoring model for personal loans , 1989 .

[33]  Daming Shi,et al.  Sensitivity analysis applied to the construction of radial basis function networks , 2005, Neural Networks.

[34]  Rudy Setiono,et al.  Use of a quasi-Newton method in a feedforward neural network construction algorithm , 1995, IEEE Trans. Neural Networks.

[35]  Huan Liu,et al.  Neural-network feature selector , 1997, IEEE Trans. Neural Networks.

[36]  D. J. Newman,et al.  UCI Repository of Machine Learning Database , 1998 .

[37]  Monica Lam,et al.  Neural network techniques for financial performance prediction: integrating fundamental and technical analysis , 2004, Decis. Support Syst..

[38]  Hui Li,et al.  Predicting business failure using multiple case-based reasoning combined with support vector machine , 2009, Expert Syst. Appl..

[39]  Melody Y. Kiang,et al.  A comparative assessment of classification methods , 2003, Decis. Support Syst..

[40]  Ryotaro Kamimura Multi-Layered Greedy Network-Growing Algorithm: Extension of Greedy Network-Growing Algorithm to Multi-Layered Networks , 2004, Int. J. Neural Syst..

[41]  Hans Henrik Thodberg,et al.  Improving Generalization of Neural Networks Through Pruning , 1991, Int. J. Neural Syst..

[42]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[43]  Andries Petrus Engelbrecht,et al.  A new pruning heuristic based on variance analysis of sensitivity information , 2001, IEEE Trans. Neural Networks.

[44]  Michael T. Manry,et al.  An integrated growing-pruning method for feedforward network training , 2008, Neurocomputing.

[45]  Chih-Fong Tsai,et al.  Feature selection in bankruptcy prediction , 2009, Knowl. Based Syst..

[46]  Alfred Jean Philippe Lauret,et al.  A node pruning algorithm based on a Fourier amplitude sensitivity test method , 2006, IEEE Transactions on Neural Networks.

[47]  Daniel S. Yeung,et al.  Feature selection using localized generalization error for supervised classification problems using RBFNN , 2008, Pattern Recognit..

[48]  Huan Liu,et al.  Improving backpropagation learning with feature selection , 1996, Applied Intelligence.

[49]  Rudy Setiono,et al.  A note on knowledge discovery using neural networks and its application to credit card screening , 2009, Eur. J. Oper. Res..

[50]  Kee S. Kim,et al.  Predicting bond ratings using publicly available information , 2005, Expert Syst. Appl..

[51]  Melody Y. Kiang,et al.  Predicting Bank Failures: A neural network approach , 1990, Appl. Artif. Intell..

[52]  Guido Bologna,et al.  A Study on Rule Extraction from Several Combined Neural Networks , 2001, Int. J. Neural Syst..

[53]  Jacek M. Zurada,et al.  Computational intelligence methods for rule-based data understanding , 2004, Proceedings of the IEEE.

[54]  Léon Personnaz,et al.  Neural-network construction and selection in nonlinear modeling , 2003, IEEE Trans. Neural Networks.

[55]  David J. Hand,et al.  Construction of a k-nearest-neighbour credit-scoring system , 1997 .

[56]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[57]  Daniel W. C. Ho,et al.  A new training and pruning algorithm based on node dependence and Jacobian rank deficiency , 2006, Neurocomputing.

[58]  Kwok-Wo Wong,et al.  A Local Training and Pruning Approach for Neural Networks , 2000, Int. J. Neural Syst..