Advanced linguistic explanations of classifier decisions for users' annotation support

We propose several new concepts for providing enhanced explanations of classifier decisions in linguistic (human readable) form. These are intended to help operators to better understand the decision process and support them during sample annotation to improve their certainty and consistency in successive labeling cycles. This is expected to lead to better, more consistent data sets (streams) for use in training and updating classifiers. The enhanced explanations are composed of 1) grounded reasons for classification decisions, represented as linguistically readable fuzzy rules, 2) a classifier's level of uncertainty in relation to its decisions and possible alternative suggestions, 3) the degree of novelty of current samples and 4) the levels of impact of the input features on the current classification response. The last of these are also used to reduce the lengths of the rules to a maximum of 3 to 4 antecedent parts to ensure readability for operators and users. The proposed techniques were embedded within an annotation GUI and applied to a real-world application scenario from the field of visual inspection. The usefulness of the proposed linguistic explanations was evaluated based on experiments conducted with six operators. The results indicate that there is approximately an 80% chance that operator/ user labeling behavior improves significantly when enhanced linguistic explanations are provided, whereas this chance drops to 10% when only the classifier responses are shown.

[1]  Lihong Li,et al.  Unbiased online active learning in data streams , 2011, KDD.

[2]  Edwin Lughofer,et al.  Reliable All-Pairs Evolving Fuzzy Classifiers , 2013, IEEE Transactions on Fuzzy Systems.

[3]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[4]  Hung T. Nguyen,et al.  Theoretical aspects of fuzzy control , 1995 .

[5]  Motoaki Kawanabe,et al.  How to Explain Individual Classification Decisions , 2009, J. Mach. Learn. Res..

[6]  Edwin Lughofer,et al.  Assessment of the influence of adaptive components in trainable surface inspection systems , 2010, Machine Vision and Applications.

[7]  Jaume Amores,et al.  Multiple instance classification: Review, taxonomy and comparative study , 2013, Artif. Intell..

[8]  Vice President,et al.  An Introduction to Expert Systems , 1989 .

[9]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[10]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[11]  Plamen P. Angelov,et al.  Evolving Fuzzy-Rule-Based Classifiers From Data Streams , 2008, IEEE Transactions on Fuzzy Systems.

[12]  Plamen P. Angelov,et al.  Evolving classification of agents’ behaviors: a general approach , 2010, Evol. Syst..

[13]  Edwin Lughofer,et al.  Learning in Non-Stationary Environments , 2012 .

[14]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[15]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[16]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[17]  Mark Last,et al.  Predictive Maintenance with Multi-target Classification Models , 2010, ACIIDS.

[18]  János Abonyi,et al.  Learning Fuzzy Classification Rules from Data , 2001 .

[19]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[20]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[21]  Chun-Houh Chen,et al.  Handbook of Data Visualization , 2016 .

[22]  Shan Ling Pan,et al.  Improved SOM Labeling Methodology for Data Mining Applications , 2008, Soft Computing for Knowledge Discovery and Data Mining.

[23]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[24]  Plamen P. Angelov,et al.  PANFIS: A Novel Incremental Learning Machine , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Ludmila I. Kuncheva,et al.  Fuzzy Classifier Design , 2000, Studies in Fuzziness and Soft Computing.

[26]  Edwin Lughofer,et al.  Single-pass active learning with conflict and ignorance , 2012, Evolving Systems.

[27]  Edwin Lughofer,et al.  Evolving Fuzzy Systems - Methodologies, Advanced Concepts and Applications , 2011, Studies in Fuzziness and Soft Computing.

[28]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[29]  K. Ming Leung,et al.  Learning Vector Quantization , 2017, Encyclopedia of Machine Learning and Data Mining.

[30]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[31]  Eyke Hüllermeier,et al.  Learning valued preference structures for solving classification problems , 2008, Fuzzy Sets Syst..

[32]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation (3rd Edition) , 2007 .

[33]  T. Kohonen,et al.  Statistical pattern recognition with neural networks: benchmarking studies , 1988, IEEE 1988 International Conference on Neural Networks.

[34]  Edwin Lughofer,et al.  Integrating new classes on the fly in evolving fuzzy classifier designs and their application in visual inspection , 2015, Appl. Soft Comput..

[35]  Plamen P. Angelov,et al.  Creating Evolving User Behavior Profiles Automatically , 2012, IEEE Transactions on Knowledge and Data Engineering.

[36]  Marko Robnik-Sikonja,et al.  Explaining Classifications For Individual Instances , 2008, IEEE Transactions on Knowledge and Data Engineering.

[37]  U. Rajendra Acharya,et al.  Machine Learning in Healthcare Informatics , 2013, Machine Learning in Healthcare Informatics.

[38]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[39]  Edwin Lughofer,et al.  On improving performance of surface inspection systems by online active learning and flexible classifier updates , 2015, Machine Vision and Applications.

[40]  Edwin Lughofer,et al.  Human–Machine Interaction Issues in Quality Control Based on Online Image Classification , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[41]  Giovanna Castellano,et al.  Distinguishability quantification of fuzzy sets , 2007, Inf. Sci..

[42]  János Abonyi,et al.  Learning fuzzy classification rules from labeled data , 2003, Inf. Sci..

[43]  Hisao Ishibuchi,et al.  A weighted fuzzy classifier and its application to image processing tasks , 2007, Fuzzy Sets Syst..

[44]  E. Lughofer,et al.  Evolving fuzzy classifiers using different model architectures , 2008, Fuzzy Sets Syst..

[45]  Philip D. Wasserman,et al.  Advanced methods in neural computing , 1993, VNR computer library.

[46]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[47]  Chuen-Chien Lee FUZZY LOGIC CONTROL SYSTEMS: FUZZY LOGIC CONTROLLER - PART I , 1990 .

[48]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[49]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[50]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[51]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[52]  Edwin Lughofer,et al.  Learning in Non-Stationary Environments: Methods and Applications , 2012 .