论文信息 - in Advances in Neural Information Processing

in Advances in Neural Information Processing

A significant limitation of neural networks is that the representations they learn are usually incomprehensible to humans. We present a novel algorithm, TREPAN, for extracting comprehensible, symbolic representations from trained neural networks. Our algorithm uses queries to induce a decision tree that approximates the concept represented by a given network. Our experiments demonstrate that TREPAN is able to produce decision trees that maintain a high level of fidelity to their respective networks while being comprehensible and accurate. Unlike previous work in this area, our algorithm is general in its applicability and scales well to large networks and problems with high-dimensional input spaces.

Jude W. Shavlik | Mark W. Craven | J. Shavlik

[1] M. Kendall. Probability and Statistical Inference , 1956, Nature.

[2] Robert V. Hogg,et al. Probability and Statistical Inference , 1978, An R Companion for the Third Edition of The Fundamentals of Political Science Research.

[3] R. Nakano,et al. Medical diagnostic expert system based on PDP model , 1988, IEEE 1988 International Conference on Neural Networks.

[4] P. J. Green,et al. Density Estimation for Statistics and Data Analysis , 1987 .

[5] Yoichi Hayashi,et al. A Neural Expert System with Automated Extraction of Fuzzy If-Then Rules , 1990, NIPS.

[6] Michael C. Mozer,et al. Rule Induction through Integrated Symbolic and Subsymbolic Processing , 1991, NIPS.

[7] M. Pazzani,et al. ID2-of-3: Constructive Induction of M-of-N Concepts for Discriminators in Decision Trees , 1991 .

[8] LiMin Fu,et al. Rule Learning by Searching on Adapted Nets , 1991, AAAI.

[9] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[10] Jude W. Shavlik,et al. Learning Symbolic Rules Using Artificial Neural Networks , 1993, ICML.

[11] Ishwar K. Sethi,et al. Extraction of diagnostic rules using neural networks , 1993, [1993] Computer-Based Medical Systems-Proceedings of the Sixth Annual IEEE Symposium.

[12] Mark Craven,et al. Learning to predict reading frames in E. coli DNA sequences , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[13] Stephen I. Gallant,et al. Neural network learning and expert systems , 1993 .

[14] Sebastian Thrun,et al. Extracting Rules from Artifical Neural Networks with Distributed Representations , 1994, NIPS.

[15] Michael C. Mozer,et al. Template-Based Algorithms for Connectionist Rule Extraction , 1994, NIPS.

[16] Jude W. Shavlik,et al. Using Sampling and Queries to Extract Rules from Trained Neural Networks , 1994, ICML.

[17] Jean-Gabriel Ganascia,et al. A Bayesian Framework to Integrate Symbolic and Neural Learning , 1994, ICML.

[18] Huan Liu,et al. Understanding Neural Networks via Rule Extraction , 1995, IJCAI.