An Exact Transformation from Deep Neural Networks to Multi-Class Multivariate Decision Trees

Deep neural networks (DNNs) are commonly labelled as black-boxes lacking interpretability; thus, hindering human’s understanding of DNNs’ behaviors. A need exists to generate a meaningful sequential logic for the production of a specific output. Decision trees exhibit better interpretability and expressive power due to their representation language and the existence of efficient algorithms to generate rules. Growing a decision tree based on the available data could produce larger than necessary trees or trees that do not generalise well. In this paper, we introduce two novel multivariate decision tree (MDT) algorithms for rule extraction from a DNN: an Exact-Convertible Decision Tree (EC-DT) and a Deep C-Net algorithm to transform a neural network with Rectified Linear Unit activation functions into a representative tree which can be used to extract multivariate rules for reasoning. While the EC-DT translates the DNN in a layer-wise manner to represent exactly the decision boundaries implicitly learned by the hidden layers of the network, the Deep C-Net inherits the decompositional approach from EC-DT and combines with a C5 tree learning algorithm to construct the decision rules. The results suggest that while EC-DT is superior in preserving the structure and the accuracy of DNN, C-Net generates the most compact and highly effective trees from DNN. Both proposed MDT algorithms generate rules including combinations of multiple attributes for precise interpretation of decision-making processes.

[1]  Huan Liu,et al.  Symbolic Representation of Neural Networks , 1996, Computer.

[2]  King-Sun Fu,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Michael C. Mozer,et al.  Template-based procedures for neural network interpretation , 1999, Neural Networks.

[4]  Simon Kasif,et al.  A System for Induction of Oblique Decision Trees , 1994, J. Artif. Intell. Res..

[5]  Masumi Ishikawa Rule extraction by successive regularization , 2000, Neural Networks.

[6]  Jude W. Shavlik,et al.  Extracting refined rules from knowledge-based neural networks , 2004, Machine Learning.

[7]  Shogo Nishida,et al.  Implementation and refinement of decision trees using neural networks for hybrid knowledge acquisition , 1995, Artif. Intell. Eng..

[8]  Jude W. Shavlik,et al.  in Advances in Neural Information Processing , 1996 .

[9]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[10]  Hussein A. Abbass,et al.  C-Net: A Method for Generating Non-deterministic and Dynamic Multivariate Decision Trees , 2001, Knowledge and Information Systems.

[11]  A. E. Amin,et al.  A novel classification model for cotton yarn quality based on trained neural network using genetic algorithm , 2013, Knowl. Based Syst..

[12]  Nelson F. F. Ebecken,et al.  Extracting rules from multilayer perceptrons in classification problems: A clustering-based approach , 2006, Neurocomputing.

[13]  Manomita Chakraborty,et al.  Rule Extraction from Neural Network Using Input Data Ranges Recursively , 2018, New Generation Computing.

[14]  J. Ross Quinlan,et al.  Generating Production Rules from Decision Trees , 1987, IJCAI.

[15]  Amit Gupta,et al.  Generalized Analytic Rule Extraction for Feedforward Neural Networks , 1999, IEEE Trans. Knowl. Data Eng..

[16]  Roziati Zainuddin,et al.  Skin segmentation based on multi pixel color clustering models , 2012, Digit. Signal Process..

[17]  Michael van Lent,et al.  An Explainable Artificial Intelligence System for Small-unit Tactical Behavior , 2004, AAAI.

[18]  Michael C. Mozer,et al.  The Connectionist Scientist Game: Rule Extraction and Refinement in a Neural Network , 1991 .

[19]  Mark Craven,et al.  Extracting comprehensible models from trained neural networks , 1996 .

[20]  Johanna D. Moore,et al.  Explanations in knowledge systems: design for explainable expert systems , 1991, IEEE Expert.

[21]  Joshua B. Tenenbaum,et al.  Learning with Hierarchical-Deep Models , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Trevor Darrell,et al.  Generating Visual Explanations , 2016, ECCV.

[23]  Madhuri Jha ANN-DT : An Algorithm for Extraction of Decision Trees from Artificial Neural Networks , 2013 .

[24]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[25]  Hui Cheng,et al.  Multimedia event recounting with concept based representation , 2012, ACM Multimedia.

[26]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[27]  Joydeep Ghosh,et al.  Symbolic Interpretation of Artificial Neural Networks , 1999, IEEE Trans. Knowl. Data Eng..

[28]  LiMin Fu,et al.  Rule Generation from Neural Networks , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[29]  Jude W. Shavlik,et al.  Using Sampling and Queries to Extract Rules from Trained Neural Networks , 1994, ICML.

[30]  P. Peer,et al.  Human skin color clustering for face detection , 2003, The IEEE Region 8 EUROCON 2003. Computer as a Tool..

[31]  Serge N. Demidenko,et al.  Multivariate alternating decision trees , 2016, Pattern Recognit..

[32]  Vladimir Vezhnevets,et al.  A Survey on Pixel-Based Skin Color Detection Techniques , 2003 .

[33]  Shlomo Geva,et al.  Rule extraction from local cluster neural nets , 2002, Neurocomputing.

[34]  Yi Yang,et al.  DevNet: A Deep Event Network for multimedia event detection and evidence recounting , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Fei Wang,et al.  Efficient tree classifiers for large scale datasets , 2018, Neurocomputing.

[36]  Carla E. Brodley,et al.  Multivariate decision trees , 2004, Machine Learning.

[37]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[38]  Lars Niklasson,et al.  Evolving decision trees using oracle guides , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[39]  Donald C. Wunsch,et al.  Neural network explanation using inversion , 2007, Neural Networks.

[40]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[41]  Yoichi Hayashi,et al.  Greedy rule generation from discrete data and its use in neural network rule extraction , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[42]  W. Lewis Johnson,et al.  Agents that Learn to Explain Themselves , 1994, AAAI.

[43]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[44]  Yoichi Hayashi,et al.  Understanding consumer heterogeneity: A business intelligence application of neural networks , 2010, Knowl. Based Syst..

[45]  Huan Liu,et al.  NeuroLinear: From neural networks to oblique decision rules , 1997, Neurocomputing.

[46]  Terrence J. Sejnowski,et al.  The Computational Brain , 1996, Artif. Intell..

[47]  Edward H. Shortliffe,et al.  A model of inexact reasoning in medicine , 1990 .

[48]  Olcay Boz,et al.  Extracting decision trees from trained neural networks , 2002, KDD.

[49]  Xiang Li,et al.  An enhanced deep feature representation for person re-identification , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[50]  Fei Tan,et al.  Face detection in complex background based on skin color features and improved AdaBoost algorithms , 2010, 2010 IEEE International Conference on Progress in Informatics and Computing.

[51]  Wojciech Samek,et al.  Explainable AI: Interpreting, Explaining and Visualizing Deep Learning , 2019, Explainable AI.