论文信息 - Extracting Provably Correct Rules from Artificial Neural Networks

Extracting Provably Correct Rules from Artificial Neural Networks

Although connectionist learning procedures have been applied successfully to a variety of real-world scenarios, artificial neural networks have often b en criticized for exhibiting a low degree of comprehensibility. Mechanisms that automatically compile neural networks into symbolic rules offer a promising perspective to overcome this practical shortcoming of neural network repres entations. This paper describes an approach to neural network rule extraction based on Validity Interval Analysis (VI-Analysis). VI-Analysis is a generic tool for extracting symbolic knowledge from Backpropagation-style artificial neural networks. It does this by propagating whole intervals of activations through the network in both the forward and backward directions. In the context of rule extraction, these intervals are used to prove or disprove the correctness of conjectured rules . We describe techniques for generating and testing rule hypotheses, and demonstrate these using some simple classification tasks including the MONK’s benchmark problems. Rules extracted by VI-Analysis are provably correct. No assumpt ions are made about the topology of the network at hand, as well as the procedure employed for training the network.

Sebastian Thrun | S. Thrun

[1] Ryszard S. Michalski,et al. Comparing learning paradigms via diagrammatic visualization: a case study in single concept learning using symbolic, neural net and genetic algorithm methods , 1991 .

[2] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[3] Sebastian Thrun,et al. The MONK''s Problems-A Performance Comparison of Different Learning Algorithms, CMU-CS-91-197, Sch , 1991 .

[4] C. L. Giles,et al. Rule refinement with recurrent neural networks , 1993, IEEE International Conference on Neural Networks.

[5] Jude W. Shavlik,et al. Interpretation of Artificial Neural Networks: Mapping Knowledge-Based Neural Networks into Rules , 1991, NIPS.

[6] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[7] Terrence J. Sejnowski,et al. NETtalk: a parallel network that learns to read aloud , 1988 .

[8] Clayton McMillan. Rule induction in a neural network through integrated symbolic and subsymbolic processing , 1992 .

[9] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..

[10] John H. Holland,et al. Genetic Algorithms and Adaptation , 1984 .

[11] Michael C. Mozer,et al. Rule Induction through Integrated Symbolic and Subsymbolic Processing , 1991, NIPS.