论文信息 - A Methodology for simplification and interpretation of backpropagation-based neural network models

A Methodology for simplification and interpretation of backpropagation-based neural network models

Abstract A new methodology for building inductive expert systems known as neural networks has emerged as one of the most promising applications of artificial intelligence in the 1990s. The primary advantages of a neural network approach for modeling expert decision processes are: (1) the ability of the network to learn from examples of experts' decisions that avoids the costly, time consuming, and error prone task of trying to directly extract knowledge of a problem domain from an expert and (2) the ability of the network to handle noisy, incomplete, and distorted data that are typically found in decision making under conditions of uncertainty. Unfortunately, a major limitation of neural network-based models has been the opacity of the inference process. Unlike conventional expert system decision support tools, decision makers are generally unable to understand the basis of neural network decisions. This problem often makes such systems undesirable for decision support applications. A new methodology is presented that allows the development of highly simplified backpropagation neural network models. This methodology simplifies netw variables that are not contributing to the networks ability to produce accurate predictions. Elimination of unnecessary input variables directly reduces the number of network parameters that must be estimated and consequently the complexity of the network structure. A primary benefit of this development methodology is that it is based on a variable importance measure that addresses the problem of producing an interpretation of a neural network's functioning. Decision makers may easily understand the resulting networks in terms of the proportional contribution each input variable is making in the production of accurate predictions. Furthermore, in actual application the accuracy of these simplified models should be comparable to or better than the more complex models developed with the standard approach. This new methodology is demonstrated by two classification problems based on sets of actual data.

Louis W. Glorfeld

[1] Ming S. Hung,et al. A neural network approach to the classification problem , 1990 .

[2] G. David Garson,et al. Interpreting neural-network connection weights , 1991 .

[3] Maureen Caudill,et al. Neural networks primer, part III , 1988 .

[4] Lashon B. Booker,et al. Proceedings of the fourth international conference on Genetic algorithms , 1991 .

[5] Philip D. Wasserman,et al. Neural computing - theory and practice , 1989 .

[6] S. T. Buckland,et al. Computer Intensive Statistical Methods: Validation, Model Selection, and Bootstrap , 1993 .

[7] R. Dawes. Judgment under uncertainty: The robust beauty of improper linear models in decision making , 1979 .

[8] Ivan Bratko,et al. Some comments on rule induction , 1987, The Knowledge Engineering Review.

[9] Yann LeCun,et al. Optimal Brain Damage , 1989, NIPS.

[10] S. Chatterjee,et al. Regression Analysis by Example , 1979 .

[11] Pi-Sheng Deng,et al. Automating Knowledge Acquisition and Refinement for Decision Support: A Connectionist Inductive Inference Model* , 1993 .