Beneficial and harmful explanatory machine learning

Given the recent successes of Deep Learning in AI there has been increased interest in the role and need for explanations in machine learned theories. A distinct notion in this context is that of Michie's definition of Ultra-Strong Machine Learning (USML). USML is demonstrated by a measurable increase in human performance of a task following provision to the human of a symbolic machine learned theory for task performance. A recent paper demonstrates the beneficial effect of a machine learned logic theory for a classification task, yet no existing work has examined the potential harmfulness of machine's involvement in human learning. This paper investigates the explanatory effects of a machine learned theory in the context of simple two person games and proposes a framework for identifying the harmfulness of machine explanations based on the Cognitive Science literature. The approach involves a cognitive window consisting of two quantifiable bounds and it is supported by empirical evidence collected from human trials. Our quantitative and qualitative results indicate that human learning aided by a symbolic machine learned theory which satisfies a cognitive window has achieved significantly higher performance than human self learning. Results also demonstrate that human learning aided by a symbolic machine learned theory that fails to satisfy this window leads to significantly worse performance than unaided human learning.

[1]  A. Newell Unified Theories of Cognition , 1990 .

[2]  Ute Schmid,et al.  A Human Like Incremental Decision Tree Algorithm: Combining Rule Learning, Pattern Induction, and Storing Examples , 2017, LWDA.

[3]  C. Watkins Learning from delayed rewards , 1989 .

[4]  Razvan Pascanu,et al.  Deep reinforcement learning with relational inductive biases , 2018, ICLR.

[5]  P. Johnson-Laird,et al.  Mental Models: Towards a Cognitive Science of Language, Inference, and Consciousness , 1985 .

[6]  Donald Michie Experiments on the Mechanization of Game-Learning Part I. Characterization of the Model and its parameters , 1963, Comput. J..

[7]  Stephen Muggleton,et al.  Learning optimal chess strategies , 1994, Machine Intelligence 13.

[8]  S. Ohlsson,et al.  Complex Declarative Learning. , 2005 .

[9]  Herbert A Simon,et al.  The understanding process: Problem isomorphs , 1976, Cognitive Psychology.

[10]  Dedre Gentner,et al.  ANALOGICAL REMINDING: A GOOD MATCH IS HARD TO FIND. , 1985 .

[11]  José Hernández-Orallo,et al.  The teaching size: computable teachers and learners for universal languages , 2019, Machine Learning.

[12]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[13]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[14]  John R. Anderson,et al.  The role of examples and rules in the acquisition of a cognitive skill. , 1997, Journal of experimental psychology. Learning, memory, and cognition.

[15]  Tim Miller,et al.  Explainable AI: Beware of Inmates Running the Asylum Or: How I Learnt to Stop Worrying and Love the Social and Behavioural Sciences , 2017, ArXiv.

[16]  Ute Schmid,et al.  Inductive rule learning on the knowledge level , 2011, Cognitive Systems Research.

[17]  Tim Miller,et al.  Explanation in Artificial Intelligence: Insights from the Social Sciences , 2017, Artif. Intell..

[18]  Stephen Muggleton,et al.  Machine Discovery of Comprehensible Strategies for Simple Games Using Meta-interpretive Learning , 2019, New Generation Computing.

[19]  Ute Schmid,et al.  Automatic Generation of Analogous Problems to Help Resolving Misconceptions in an Intelligent Tutor System for Written Subtraction , 2016, ICCBR Workshops.

[20]  K. Holyoak,et al.  Mathematical problem solving by analogy. , 1991, Journal of experimental psychology. Learning, memory, and cognition.

[21]  Richard S. Sutton,et al.  Learning and Sequential Decision Making , 1989 .

[22]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[23]  Stephen Muggleton,et al.  Meta-interpretive learning: application to grammatical inference , 2013, Machine Learning.

[24]  Ivan Bratko,et al.  Behavioural Cloning: Phenomena, Results and Problems , 1995 .

[25]  Stephen Muggleton,et al.  Ultra-Strong Machine Learning: comprehensibility of programs learned with ILP , 2018, Machine Learning.

[26]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[27]  Bettina Finzel,et al.  Mutual Explanations for Cooperative Decision Making in Medicine , 2020, KI - Künstliche Intelligenz.

[28]  Vincent Aleven,et al.  An effective metacognitive strategy: learning by doing and explaining with a computer-based Cognitive Tutor , 2002, Cogn. Sci..

[29]  Shie Mannor,et al.  Graying the black box: Understanding DQNs , 2016, ICML.

[30]  Stephen Muggleton,et al.  Bias reformulation for one-shot function induction , 2014, ECAI.

[31]  Dianne C. Berry,et al.  Implicit learning in the control of complex systems , 1995 .

[32]  Jaime G. Carbonell,et al.  Derivational analogy: a theory of reconstructive problem solving and expertise acquisition , 1993 .

[33]  Stephen Muggleton,et al.  Meta-interpretive learning of higher-order dyadic datalog: predicate invention revisited , 2013, Machine Learning.

[34]  Stephen K. Reed,et al.  Selecting analogous problems: Similarity versus inclusiveness , 1990, Memory & cognition.

[35]  J. Gregory Trafton,et al.  Memory for goals: an activation-based model , 2002, Cogn. Sci..

[36]  Ivan Bratko,et al.  Reconstructing Human Skill with Machine Learning , 1994, ECAI.

[37]  Katia P. Sycara,et al.  Transparency and Explanation in Deep Reinforcement Learning Neural Networks , 2018, AIES.

[38]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[39]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[40]  Z. Dienes,et al.  A theory of implicit and explicit knowledge , 1999, Behavioral and Brain Sciences.

[41]  H. Klausmeier,et al.  Relationship of selected cognitive abilities to concept attainment and information processing. , 1967, Journal of educational psychology.

[42]  John R. Anderson,et al.  Use of analogy in a production system architecture , 1989 .

[43]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[44]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[45]  John N. Williams,et al.  Performance and Competence in Second Language Acquisition , 1996 .

[46]  K. Holyoak,et al.  Surface and structural similarity in analogical transfer , 1987, Memory & cognition.

[47]  Amit Dhurandhar,et al.  TED: Teaching AI to Explain its Decisions , 2018, AIES.

[48]  Ute Schmid,et al.  Empirical Evidence for Derivational Analogy , 2020, Proceedings of the Twenty First Annual Conference of the Cognitive Science Society.

[49]  Noam Chomsky,et al.  The faculty of language: what is it, who has it, and how did it evolve? , 2002, Science.

[50]  T. Niblett,et al.  AUTOMATIC INDUCTION OF CLASSIFICATION RULES FOR A CHESS ENDGAME , 1982 .

[51]  Donald Michie,et al.  Cognitive models from subcognitive skills , 1990 .

[52]  Thomas L. Griffiths,et al.  Faster Teaching via POMDP Planning , 2016, Cogn. Sci..

[53]  J. Marks Performance and Competence in Second Language Acquisition , 1998 .

[54]  Jerry R. Hobbs Abduction in Natural Language Understanding , 2008 .

[55]  Michael Kearns,et al.  On the complexity of teaching , 1991, COLT '91.

[56]  M A Just,et al.  From the SelectedWorks of Marcel Adam Just 1990 What one intelligence test measures : A theoretical account of the processing in the Raven Progressive Matrices Test , 2016 .

[57]  Rui Camacho,et al.  Building symbolic representations of intuitive real-time skills from performance data , 1994, Machine Intelligence 13.

[58]  D. Kahneman Thinking, Fast and Slow , 2011 .

[59]  David Sarne,et al.  Summarizing agent strategies , 2019, Autonomous Agents and Multi-Agent Systems.

[60]  Allen and Rosenbloom Paul S. Newell,et al.  Mechanisms of Skill Acquisition and the Law of Practice , 1993 .

[61]  Andrew Cropper,et al.  Efficiently learning efficient programs , 2017 .

[62]  Luc De Raedt,et al.  Relational Reinforcement Learning , 2001, Machine Learning.

[63]  Stephen K. Reed,et al.  Use of examples and procedures in problem solving , 1991 .

[64]  Ehud Shapiro,et al.  Algorithmic Program Debugging , 1983 .

[65]  Stephanie M. Stalinski,et al.  Journal of Experimental Psychology: Learning, Memory, and Cognition , 2012 .

[66]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[67]  Stephen Muggleton,et al.  Inductive logic programming , 1991, New Generation Computing.

[68]  M. Gervasio,et al.  Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents' Capabilities and Limitations , 2019, Artif. Intell..

[69]  Andrei N. Kolmogorov,et al.  On Tables of Random Numbers (Reprinted from "Sankhya: The Indian Journal of Statistics", Series A, Vol. 25 Part 4, 1963) , 1998, Theor. Comput. Sci..

[70]  Xiaojin Zhu,et al.  Machine Teaching: An Inverse Problem to Machine Learning and an Approach Toward Optimal Education , 2015, AAAI.

[71]  Andrew Cropper Learning efficient logic programs , 2018, Machine Learning.

[72]  Vítor Santos Costa,et al.  Inductive Logic Programming , 2013, Lecture Notes in Computer Science.

[73]  Hui Zhang,et al.  Verbal Explanations for Deep Reinforcement Learning Neural Networks with Attention on Extracted Features , 2019, 2019 28th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN).

[74]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.