ExOpaque: A Framework to Explain Opaque Machine Learning Models Using Inductive Logic Programming

In this paper we developed an Inductive Logic Programming (ILP) based framework ExOpaque that is able to extract a set of Horn clauses from an arbitrary opaque machine learning model, to describe the behavior of the opaque model with high fidelity while maintaining the simplicity of the Horn clauses for human interpretations.

[1]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[2]  Luc De Raedt,et al.  Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[3]  Sholom M. Weiss,et al.  Rule-based Machine Learning Methods for Functional Prediction , 1995, J. Artif. Intell. Res..

[4]  Omar Farooq,et al.  Mel-scaled wavelet filter based features for noisy unvoiced phoneme recognition , 2002, INTERSPEECH.

[5]  Stephen Muggleton,et al.  Efficient Induction of Logic Programs , 1990, ALT.

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[7]  Dominique Genoud,et al.  POLYCOST: A telephone-speech database for speaker recognition , 2000, Speech Commun..

[8]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[9]  Rich Caruana,et al.  Model compression , 2006, KDD '06.

[10]  Stephen Muggleton,et al.  Inverse entailment and progol , 1995, New Generation Computing.

[11]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[12]  Stephen Muggleton,et al.  The Effect of Relational Background Knowledge on Learning of Protein Three-Dimensional Fold Signatures , 2001, Machine Learning.

[13]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[14]  Nikos Fakotakis,et al.  Wavelet Packets Based Speaker Verification , 2004 .

[15]  Stefan Wrobel,et al.  Scalability Issues in Inductive Logic Programming , 1998, ALT.

[16]  D.E.D. Torres,et al.  Extracting trees from trained SVM models using a TREPAN based approach , 2005 .

[17]  Luc De Raedt,et al.  Probabilistic Inductive Logic Programming , 2004, Probabilistic Inductive Logic Programming.

[18]  Jude W. Shavlik,et al.  in Advances in Neural Information Processing , 1996 .

[19]  Stephen Muggleton,et al.  Application of abductive ILP to learning metabolic network inhibition from temporal data , 2006, Machine Learning.

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[22]  Francesco Bergadano,et al.  Inductive Logic Programming: From Machine Learning to Software Engineering , 1995 .

[23]  Sholom M. Weiss,et al.  Solving regression problems with rule-based ensemble classifiers , 2001, KDD '01.

[24]  A. Walden,et al.  Wavelet Methods for Time Series Analysis , 2000 .

[25]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[26]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[27]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[28]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .