Towards Interpretable Deep Neural Networks: An Exact Transformation to Multi-Class Multivariate Decision Trees

Deep neural networks (DNNs) are commonly labelled as black-boxes lacking interpretability; thus, hindering human's understanding of DNNs' behaviors. A need exists to generate a meaningful sequential logic for the production of a specific output. Decision trees exhibit better interpretability and expressive power due to their representation language and the existence of efficient algorithms to generate rules. Growing a decision tree based on the available data could produce larger than necessary trees or trees that do not generalise well. In this paper, we introduce two novel multivariate decision tree (MDT) algorithms for rule extraction from a DNN: an Exact-Convertible Decision Tree (EC-DT) and a Deep C-Net algorithm to transform a neural network with Rectified Linear Unit activation functions into a representative tree which can be used to extract multivariate rules for reasoning. While the EC-DT translates the DNN in a layer-wise manner to represent exactly the decision boundaries implicitly learned by the hidden layers of the network, the Deep C-Net inherits the decompositional approach from EC-DT and combines with a C5 tree learning algorithm to construct the decision rules. The results suggest that while EC-DT is superior in preserving the structure and the accuracy of DNN, C-Net generates the most compact and highly effective trees from DNN. Both proposed MDT algorithms generate rules including combinations of multiple attributes for precise interpretation of decision-making processes.

[1]  Manomita Chakraborty,et al.  Rule Extraction from Neural Network Using Input Data Ranges Recursively , 2018, New Generation Computing.

[2]  Johanna D. Moore,et al.  Explanations in knowledge systems: design for explainable expert systems , 1991, IEEE Expert.

[3]  W. Lewis Johnson,et al.  Agents that Learn to Explain Themselves , 1994, AAAI.

[4]  Joshua B. Tenenbaum,et al.  Learning with Hierarchical-Deep Models , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Roziati Zainuddin,et al.  Skin segmentation based on multi pixel color clustering models , 2012, Digit. Signal Process..

[6]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[7]  Trevor Darrell,et al.  Generating Visual Explanations , 2016, ECCV.

[8]  Joydeep Ghosh,et al.  Symbolic Interpretation of Artificial Neural Networks , 1999, IEEE Trans. Knowl. Data Eng..

[9]  Edward H. Shortliffe,et al.  A model of inexact reasoning in medicine , 1990 .

[10]  Jude W. Shavlik,et al.  Using Sampling and Queries to Extract Rules from Trained Neural Networks , 1994, ICML.

[11]  LiMin Fu,et al.  Rule Generation from Neural Networks , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[12]  Jude W. Shavlik,et al.  Extracting refined rules from knowledge-based neural networks , 2004, Machine Learning.

[13]  Shogo Nishida,et al.  Implementation and refinement of decision trees using neural networks for hybrid knowledge acquisition , 1995, Artif. Intell. Eng..

[14]  Jude W. Shavlik,et al.  in Advances in Neural Information Processing , 1996 .

[15]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[16]  Hussein A. Abbass,et al.  C-Net: A Method for Generating Non-deterministic and Dynamic Multivariate Decision Trees , 2001, Knowledge and Information Systems.

[17]  Yoichi Hayashi,et al.  Understanding consumer heterogeneity: A business intelligence application of neural networks , 2010, Knowl. Based Syst..

[18]  Fei Tan,et al.  Face detection in complex background based on skin color features and improved AdaBoost algorithms , 2010, 2010 IEEE International Conference on Progress in Informatics and Computing.

[19]  Lars Niklasson,et al.  Evolving decision trees using oracle guides , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[20]  Wojciech Samek,et al.  Explainable AI: Interpreting, Explaining and Visualizing Deep Learning , 2019, Explainable AI.

[21]  Donald C. Wunsch,et al.  Neural network explanation using inversion , 2007, Neural Networks.

[22]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[23]  Madhuri Jha ANN-DT : An Algorithm for Extraction of Decision Trees from Artificial Neural Networks , 2013 .

[24]  Amit Gupta,et al.  Generalized Analytic Rule Extraction for Feedforward Neural Networks , 1999, IEEE Trans. Knowl. Data Eng..

[25]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[26]  Michael van Lent,et al.  An Explainable Artificial Intelligence System for Small-unit Tactical Behavior , 2004, AAAI.

[27]  A. E. Amin,et al.  A novel classification model for cotton yarn quality based on trained neural network using genetic algorithm , 2013, Knowl. Based Syst..

[28]  Nelson F. F. Ebecken,et al.  Extracting rules from multilayer perceptrons in classification problems: A clustering-based approach , 2006, Neurocomputing.

[29]  Carla E. Brodley,et al.  Multivariate decision trees , 2004, Machine Learning.

[30]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[31]  J. Ross Quinlan,et al.  Generating Production Rules from Decision Trees , 1987, IJCAI.

[32]  Yoichi Hayashi,et al.  Greedy rule generation from discrete data and its use in neural network rule extraction , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[33]  Huan Liu,et al.  Symbolic Representation of Neural Networks , 1996, Computer.

[34]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[35]  Vineeth N. Balasubramanian,et al.  Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[36]  Michael C. Mozer,et al.  The Connectionist Scientist Game: Rule Extraction and Refinement in a Neural Network , 1991 .

[37]  Mark Craven,et al.  Extracting comprehensible models from trained neural networks , 1996 .

[38]  Simon Kasif,et al.  A System for Induction of Oblique Decision Trees , 1994, J. Artif. Intell. Res..

[39]  Masumi Ishikawa Rule extraction by successive regularization , 2000, Neural Networks.

[40]  Xiang Li,et al.  An enhanced deep feature representation for person re-identification , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[41]  Michael C. Mozer,et al.  Template-based procedures for neural network interpretation , 1999, Neural Networks.

[42]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[43]  Hui Cheng,et al.  Multimedia event recounting with concept based representation , 2012, ACM Multimedia.

[44]  P. Peer,et al.  Human skin color clustering for face detection , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[45]  Yi Yang,et al.  DevNet: A Deep Event Network for multimedia event detection and evidence recounting , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Fei Wang,et al.  Efficient tree classifiers for large scale datasets , 2018, Neurocomputing.

[47]  Olcay Boz,et al.  Extracting decision trees from trained neural networks , 2002, KDD.

[48]  J. R. Quinlan,et al.  Data Mining Tools See5 and C5.0 , 2004 .

[49]  R. Kennedy,et al.  Defense Advanced Research Projects Agency (DARPA). Change 1 , 1996 .

[50]  Serge N. Demidenko,et al.  Multivariate alternating decision trees , 2016, Pattern Recognit..

[51]  Vladimir Vezhnevets,et al.  A Survey on Pixel-Based Skin Color Detection Techniques , 2003 .

[52]  Shlomo Geva,et al.  Rule extraction from local cluster neural nets , 2002, Neurocomputing.

[53]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[54]  Huan Liu,et al.  NeuroLinear: From neural networks to oblique decision rules , 1997, Neurocomputing.

[55]  Terrence J. Sejnowski,et al.  The Computational Brain , 1996, Artif. Intell..