Interpretable Machine Learning: Convolutional Neural Networks with RBF Fuzzy Logic Classification Rules

A convolutional neural network (CNN) learning structure is proposed, with added interpretability-oriented layers, in the form of Fuzzy Logic-based rules. This is achieved by creating a classification layer based on a Neural-Fuzzy classifier, and integrating it into the overall learning mechanism within the deep learning structure. Using this new structure, one could extract linguistic Fuzzy Logic-based rules from the deep learning structure directly, which enhances the interpretability of the overall system. The classification layer is realised via a Radial Basis Function (RBF) Neural-Network, that is a direct equivalent of a class of Fuzzy Logic-based systems. In this work, the development of the RBF neural-fuzzy system and its integration into the deep-learning CNN is presented. The proposed hybrid CNN RBF-NF structure can from a fundamental building block, towards building more complex deep-learning structures with Fuzzy Logic-based interpretability. Using simulation results on a benchmark data-driven modelling and classification problem (labelled handwriting digits, MNIST 70000 samples) we show that the proposed learning structure maintains a good level of forecasting/prediction accuracy (> 96% on unseen data) compared to state-of-the-art CNN deep learning structures, while providing linguistic interpretability to the classification layer.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  Derek A. Linkens,et al.  A systematic neuro-fuzzy modeling framework with application to material property prediction , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[3]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[5]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[6]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[7]  Tom Schaul,et al.  Unit Tests for Stochastic Optimization , 2013, ICLR.

[8]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[11]  Luka Eciolaza,et al.  Spot welding monitoring system based on fuzzy classification and deep learning , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[12]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[13]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[14]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[15]  Razvan Pascanu,et al.  Advances in optimizing recurrent networks , 2012, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[16]  George Panoutsos,et al.  A neural-fuzzy modelling framework based on granular computing: Concepts and applications , 2010, Fuzzy Sets Syst..

[17]  John Lach,et al.  Deepmotion: a deep convolutional neural network on inertial body sensors for gait assessment in multiple sclerosis* , 2016, 2016 IEEE Wireless Health (WH).

[18]  Alessandra Caggiano,et al.  Vibration Sensor Monitoring of Nickel-Titanium Alloy Turning for Machinability Evaluation , 2017, Sensors.

[19]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[20]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[21]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[22]  Youyong Kong,et al.  A Hierarchical Fused Fuzzy Deep Neural Network for Data Classification , 2017, IEEE Transactions on Fuzzy Systems.

[23]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).