A Full Explanation Facility for a MLP Network that Classifies Low-Back-Pain Patients and for Predicting its Reliability

This paper presents a full explanation facility that has been developed for any standard MLP network with binary input neurons that performs a classification task. The interpretation of any input case is represented by a non-linear ranked data relationship of key inputs, in both text and graphical forms. The knowledge that the MLP has learned is represented by average ranked class profiles or as a set of rules induced from all training cases. The full explanation facility discovers the MLP knowledge bounds as the hidden layer decision regions containing classified training examples. Novel inputs are detected when the input case is positioned in a decision region outside the knowledge bounds. Results using the facility are presented for a 48-dimensional real-world MLP that classifies low-back-pain patients. Using the full explanation facility, it is shown that the MLP preserves the continuity of the classifications in separate contiguous threads of decision regions across the 48-dimensional input space thereby demonstrating the consistency and predictability of the classifications within the knowledge bounds.

[1]  Joachim Diederich,et al.  The truth will come to light: directions and challenges in extracting the knowledge embedded within trained artificial neural networks , 1998, IEEE Trans. Neural Networks.

[2]  Marylin L. Vaughn,et al.  Direct Explanations and Knowledge Extraction from a Multilayer Perceptron Network that Performs Low Back Pain Classification , 1998, Hybrid Neural Systems.

[3]  LiMin Fu,et al.  Neural networks in computer intelligence , 1994 .

[4]  Lawrence D. Jackel,et al.  Large Automatic Learning, Rule Extraction, and Generalization , 1987, Complex Syst..

[5]  G. Waddell,et al.  1987 Volvo Award in Clinical Sciences: A New Clinical Model for the Treatment of Low-Back Pain , 1987, Spine.

[6]  Rudy Setiono,et al.  Extracting Rules from Neural Networks by Pruning and Hidden-Unit Splitting , 1997, Neural Computation.

[7]  Marilyn Lougher Vaughn Derivation of the multilayer perceptron weight constraints for direct network interpretation and knowledge discovery , 1999, Neural Networks.

[8]  Joydeep Ghosh,et al.  Symbolic Interpretation of Artificial Neural Networks , 1999, IEEE Trans. Knowl. Data Eng..

[9]  Jude W. Shavlik,et al.  Using Sampling and Queries to Extract Rules from Trained Neural Networks , 1994, ICML.

[10]  David G. Bounds,et al.  A comparison of neural network and other pattern recognition approaches to the diagnosis of low back disorders , 1990, Neural Networks.

[11]  Marylin L. Vaughn,et al.  A full explanation facility for a MLP network that classifies low-back-pain patients , 2001, The Seventh Australian and New Zealand Intelligent Information Systems Conference, 2001.

[12]  Jacek M. Zurada,et al.  Knowledge-based neurocomputing , 2000 .

[13]  Carl G. Looney,et al.  Pattern recognition using neural networks , 1997 .

[14]  Marylin L. Vaughn,et al.  Direct Explanations for the Development and Use of a Multi-layer Perceptron Network that Classifies Low-back-pain Patients , 2001, Int. J. Neural Syst..

[15]  F. Maire,et al.  A partial order for the M-of-N rule-extraction algorithm , 1997, IEEE Trans. Neural Networks.

[16]  Kazumi Saito,et al.  Law Discovery using Neural Networks , 1997, IJCAI.

[17]  G. Waddell,et al.  Symptoms and signs: physical disease or illness behaviour? , 1984, British medical journal.

[18]  Jude W. Shavlik,et al.  Learning Symbolic Rules Using Artificial Neural Networks , 1993, ICML.

[19]  Stephen I. Gallant,et al.  Neural network learning and expert systems , 1993 .

[20]  Stefan Wermter,et al.  A Novel Modular Neural Architecture for Rule-Based and Similarity-Based Reasoning , 1998, Hybrid Neural Systems.

[21]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[22]  Sebastian Thrun,et al.  Extracting Rules from Artifical Neural Networks with Distributed Representations , 1994, NIPS.