Psychological Foundations of Explainability and Interpretability in Artificial Intelligence

In this paper, we make the case that interpretability and explainability are distinct requirements for machine learning systems. To make this case, we provide an overview of the literature in experimental psychology pertaining to interpretation (especially of numerical stimuli) and comprehension. We find that interpretation refers to the ability to contextualize a model’s output in a manner that relates it to the system’s designed functional purpose, and the goals, values, and preferences of end users. In contrast, explanation refers to the ability to accurately describe the mechanism, or implementation, that led to an algorithm’s output, often so that the algorithm can be improved in some way. Beyond these definitions, our review shows that humans differ from one another in systematic ways, that affect the extent to which they prefer to make decisions based on detailed explanations versus less precise interpretations. These individual differences, such as personality traits and skills, are associated with their abilities to derive meaningful interpretations from precise explanations of model output. This implies that system output should be tailored to different types of users.

[1]  Timothy Baldwin,et al.  Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality , 2014, EACL.

[2]  A. Tutt An FDA for Algorithms , 2016 .

[3]  Chris Russell,et al.  Explaining Explanations in AI , 2018, FAT.

[4]  C. Coping with complexity , 2006, Nature.

[5]  Haiyi Zhu,et al.  Explaining Decision-Making Algorithms through UI: Strategies to Help Non-Expert Stakeholders , 2019, CHI.

[6]  David A. Broniatowski Building the tower without climbing it: Progress in engineering systems , 2018, Syst. Eng..

[7]  Morton Ann Gernsbacher,et al.  The Structure-Building Framework: What it is, what it might also be, and why. , 1995 .

[8]  Ankur Taly,et al.  Explainable machine learning in deployment , 2020, FAT*.

[9]  David A. Broniatowski,et al.  The Role of Individual User Differences in Interpretable and Explainable Machine Learning Systems , 2020, ArXiv.

[10]  Gary Klein,et al.  Metrics for Explainable AI: Challenges and Prospects , 2018, ArXiv.

[11]  V. Reyna,et al.  The science of false memory , 2005 .

[12]  P. Ubel,et al.  Measuring Numeracy without a Math Test: Development of the Subjective Numeracy Scale , 2007, Medical decision making : an international journal of the Society for Medical Decision Making.

[13]  Tim Miller,et al.  Explainable AI: Beware of Inmates Running the Asylum Or: How I Learnt to Stop Worrying and Love the Social and Behavioural Sciences , 2017, ArXiv.

[14]  T. Lombrozo The structure and function of explanations , 2006, Trends in Cognitive Sciences.

[15]  F. Mathy,et al.  What’s magic about magic numbers? Chunking and data compression in short-term memory , 2012, Cognition.

[16]  Gary Klein,et al.  Explaining Explanation, Part 3: The Causal Landscape , 2018, IEEE Intelligent Systems.

[17]  A. Tversky,et al.  Advances in prospect theory: Cumulative representation of uncertainty , 1992 .

[18]  David A. Broniatowski,et al.  Germs Are Germs, and Why Not Take a Risk? Patients’ Expectations for Prescribing Antibiotics in an Inner-City Emergency Department , 2015, Medical decision making : an international journal of the Society for Medical Decision Making.

[19]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[20]  C. K. Mertz,et al.  PSYCHOLOGICAL SCIENCE Research Article Numeracy and Decision Making , 2022 .

[21]  Russ Abbott,et al.  Emergence explained: Abstractions: Getting epiphenomena to do real work , 2006, Complex..

[22]  V. Reyna,et al.  Fuzzy-trace theory and false memory: new frontiers. , 1998, Journal of experimental child psychology.

[23]  Cynthia Rudin,et al.  Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead , 2018, Nature Machine Intelligence.

[24]  Melissa L. Finucane,et al.  Risk as Analysis and Risk as Feelings: Some Thoughts about Affect, Reason, Risk, and Rationality , 2004, Risk analysis : an official publication of the Society for Risk Analysis.

[25]  Johannes Gehrke,et al.  Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission , 2015, KDD.

[26]  Timothy O'Connor,et al.  A Companion to the Philosophy of Action , 2010 .

[27]  Gary Klein,et al.  Explaining Explanation, Part 1: Theoretical Foundations , 2017, IEEE Intelligent Systems.

[28]  William J. Clancey,et al.  Explaining Explanation, Part 4: A Deep Dive on Deep Nets , 2018, IEEE Intelligent Systems.

[29]  V. Reyna,et al.  Fuzzy-trace theory: dual processes in memory, reasoning, and cognitive neuroscience. , 2001, Advances in child development and behavior.

[30]  T. Trabasso Causal Cohesion and Story Coherence. , 1982 .

[31]  M. Yuan,et al.  Dimension reduction and parameter estimation for additive index models , 2010 .

[32]  Lynn Hasher,et al.  Is memory schematic , 1983 .

[33]  Gary Klein,et al.  Explaining Explanation, Part 2: Empirical Foundations , 2017, IEEE Intelligent Systems.

[34]  V. Reyna Of Viruses, Vaccines, and Variability: Qualitative Meaning Matters , 2020, Trends in Cognitive Sciences.

[35]  W. Shadish,et al.  Experimental and Quasi-Experimental Designs for Generalized Causal Inference , 2001 .

[36]  Murray Singer,et al.  Veridical and false memory for text: A multiprocess analysis ☆ , 2008 .

[37]  Marlone D. Henderson,et al.  There Are Many Ways to See the Forest for the Trees , 2013, Perspectives on psychological science : a journal of the Association for Psychological Science.

[38]  N. Pennington,et al.  The story model for juror decision making , 1993 .

[39]  Sameer Singh,et al.  Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods , 2020, AIES.

[40]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[41]  V. Reyna,et al.  Individual Differences in Numeracy and Cognitive Reflection, with Implications for Biases and Fallacies in Probability Judgment. , 2012, Journal of behavioral decision making.

[42]  Michael Gilead,et al.  Above and beyond the concrete: The diverse representational substrates of the predictive brain , 2019, Behavioral and Brain Sciences.

[43]  D. Kahneman,et al.  Heuristics and Biases: The Psychology of Intuitive Judgment , 2002 .

[44]  Mark Steyvers,et al.  Topics in semantic representation. , 2007, Psychological review.

[45]  H. E. Pople,et al.  Internist-I, an Experimental Computer-Based Diagnostic Consultant for General Internal Medicine , 1982 .

[46]  Jacob Feldman,et al.  Conceptual complexity and the bias/variance tradeoff , 2011, Cognition.

[47]  Andrew McCallum,et al.  Optimizing Semantic Coherence in Topic Models , 2011, EMNLP.

[48]  David A. Broniatowski,et al.  Categorical Risk Perception Drives Variability in Antibiotic Prescribing in the Emergency Department: A Mixed Methods Observational Study , 2017, Journal of General Internal Medicine.

[49]  Kenneth R. Hammond,et al.  Coherence and correspondence theories in judgment and decision making. , 2000 .

[50]  David A. Broniatowski,et al.  Does gist drive NASA experts’ design decisions? , 2020, Syst. Eng..

[51]  Zebin Yang,et al.  Enhancing Explainability of Neural Networks Through Architecture Constraints , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[52]  James Shanteau,et al.  Emerging Perspectives on Judgment and Decision Research , 2003 .

[53]  Jun Zhao,et al.  'It's Reducing a Human Being to a Percentage': Perceptions of Justice in Algorithmic Decisions , 2018, CHI.

[54]  Joshua A. Kroll The fallacy of inscrutability , 2018, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[55]  Niklas Elmqvist,et al.  The human touch: How non-expert users perceive, interpret, and fix topic models , 2017, Int. J. Hum. Comput. Stud..

[56]  Margo I. Seltzer,et al.  Scalable Bayesian Rule Lists , 2016, ICML.

[57]  G. Klein,et al.  Decision Making in Action: Models and Methods , 1993 .

[58]  Xiangen Hu,et al.  The development and analysis of tutorial dialogues in AutoTutor Lite , 2013, Behavior research methods.

[59]  Konstantinos V. Katsikopoulos,et al.  Decision Methods for Design: Insights from Psychology , 2012 .

[60]  R. Axelrod Structure of decision : the cognitive maps of political elites , 2015 .

[61]  David A. Broniatowski,et al.  Gist and Verbatim in Narrative Memory , 2013, CMN.

[62]  Daniel G. Goldstein,et al.  Manipulating and Measuring Model Interpretability , 2018, CHI.

[63]  Jie Chen,et al.  Explainable Neural Networks based on Additive Index Models , 2018, ArXiv.

[64]  Lalana Kagal,et al.  Explaining Explanations: An Overview of Interpretability of Machine Learning , 2018, 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA).

[65]  David A. Broniatowski,et al.  A Formal Model of Fuzzy-Trace Theory: Variations on Framing Effects and the Allais Paradox , 2018, Decision.

[66]  David A. Broniatowski,et al.  Assessing causal claims about complex engineered systems with quantitative data: internal, external, and construct validity , 2017, Syst. Eng..

[67]  J. J. Diehl,et al.  Story Recall and Narrative Coherence of High-Functioning Children with Autism Spectrum Disorders , 2006, Journal of abnormal child psychology.

[68]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[69]  Andrew W. Martin,et al.  What a Story? , 2017, Speaking through the Mask.

[70]  Evan A. Wilhelms,et al.  Gist Representations and Communication of Risks about HIV-AIDS: A Fuzzy-Trace Theory Approach. , 2015, Current HIV research.

[71]  Paul van den Broek,et al.  Using Texts in Science Education: Cognitive Processes and Knowledge Representation , 2010 .

[72]  V. Reyna A new intuitionism: Meaning, memory, and development in Fuzzy-Trace Theory. , 2012, Judgment and decision making.

[73]  C. F. Kao,et al.  The efficient assessment of need for cognition. , 1984, Journal of personality assessment.

[74]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[75]  A. Berztiss,et al.  Requirements Engineering , 2002, J. Object Technol..

[76]  V. Reyna,et al.  Fuzzy-Trace Theory and False Memory , 2002 .

[77]  I. Nonaka,et al.  SECI, Ba and Leadership: a Unified Model of Dynamic Knowledge Creation , 2000 .

[78]  M. Polanyi,et al.  Tacit Knowing: Its Bearing on Some Problems of Philosophy , 1962 .

[79]  J. Reidenberg,et al.  Accountable Algorithms , 2016 .

[80]  Michelle Gaddy Everson,et al.  Effects of Causal Text Revisions on More- and Less-Skilled Readers' Comprehension of Easy and Difficult Texts , 2000 .

[81]  Konstantinos V. Katsikopoulos,et al.  Coherence and correspondence in engineering design: informing the conversation and connecting with judgment and decision-making research , 2009, Judgment and Decision Making.

[82]  T. Trabasso,et al.  Causal thinking and the representation of narrative events , 1985 .

[83]  Valerie F Reyna,et al.  Fuzzy‐Trace Theory, Risk Communication, and Product Labeling in Sexually Transmitted Diseases , 2003, Risk analysis : an official publication of the Society for Risk Analysis.

[84]  Jon M. Kleinberg,et al.  Simplicity Creates Inequity: Implications for Fairness, Stereotypes, and Interpretability , 2018, EC.

[85]  Evan A. Wilhelms,et al.  Influence of Explanatory Images on Risk Perceptions and Treatment Preference , 2018, Arthritis care & research.

[86]  Christopher L. Magee,et al.  Engineering Systems: Meeting Human Needs in a Complex Technological World , 2011 .

[87]  K. Fujita,et al.  Psychological distance can improve decision making under information overload via gist memory. , 2013, Journal of experimental psychology. General.

[88]  Jens Rasmussen,et al.  The role of hierarchical knowledge representation in decisionmaking and system management , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[89]  Petru Lucian Curşeu Need for cognition and rationality in decision-making , 2006 .

[90]  I. Nonaka A Dynamic Theory of Organizational Knowledge Creation , 1994 .

[91]  M. Gernsbacher,et al.  Investigating differences in general comprehension skill. , 1990, Journal of experimental psychology. Learning, memory, and cognition.

[92]  J. Cacioppo,et al.  DISPOSITIONAL DIFFERENCES IN COGNITIVE MOTIVATION : THE LIFE AND TIMES OF INDIVIDUALS VARYING IN NEED FOR COGNITION , 1996 .

[93]  Amina Adadi,et al.  Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[94]  D. Gallo Associative Illusions of Memory: False Memory Research in DRM and Related Tasks , 2006 .

[95]  Valerie F Reyna,et al.  Developmental Reversals in Risky Decision Making , 2014, Psychological science.

[96]  A. Tversky,et al.  The framing of decisions and the psychology of choice. , 1981, Science.

[97]  Rebecca K. Helm,et al.  From Meaning to Money: Translating Injury Into Dollars , 2018, Law and human behavior.

[98]  Freddy Lécué,et al.  Explainable AI: The New 42? , 2018, CD-MAKE.

[99]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[100]  T. Trabasso,et al.  Constructing inferences during narrative text comprehension. , 1994, Psychological review.

[101]  V. Reyna CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE How People Make Decisions That Involve Risk A Dual-Processes Approach , 2022 .

[102]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[103]  J. Feldman,et al.  Bayes and the Simplicity Principle in Perception Simplicity versus Likelihood Principles in Perception , 2022 .

[104]  J. R. Thomas,et al.  Relation of knowledge and performance in boys' tennis: age and expertise. , 1989, Journal of experimental child psychology.

[105]  J. Murphy The General Data Protection Regulation (GDPR) , 2018, Irish medical journal.

[106]  P. Bauer,et al.  Coherence of Personal Narratives Across the Lifespan: A Multidimensional Model and Coding Method , 2011, Journal of cognition and development : official journal of the Cognitive Development Society.

[107]  V. Reyna,et al.  Risk and Rationality in Adolescent Decision Making , 2006, Psychological science in the public interest : a journal of the American Psychological Society.

[108]  N Moray,et al.  A lattice theory approach to the structure of mental models. , 1990, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[109]  Eldar Shafir,et al.  Deep thoughts and shallow frames; on the susceptibility to framing effects. , 2003 .

[110]  Valerie F. Reyna,et al.  Coherence and Correspondence Criteria for Rationality: Experts' Estimation of Risks of Sexually Transmitted Infections. , 2005 .

[111]  Russ Abbott,et al.  Putting complex systems to work , 2007, Complex..

[112]  Been Kim,et al.  Towards A Rigorous Science of Interpretable Machine Learning , 2017, 1702.08608.

[113]  野中 郁次郎,et al.  The Knowledge-Creating Company: How , 1995 .

[114]  Rolf A. Zwaan,et al.  Situation models in language comprehension and memory. , 1998, Psychological bulletin.

[115]  V. Reyna,et al.  Physician decision making and cardiac risk: effects of knowledge, risk perception, risk tolerance, and fuzzy processing. , 2006, Journal of experimental psychology. Applied.

[116]  V. Reyna,et al.  Fuzzy-trace theory: An interim synthesis , 1995 .

[117]  Galit Shmueli,et al.  To Explain or To Predict? , 2010, 1101.0891.

[118]  Gary Klein,et al.  Naturalistic Decision Making , 2008, Hum. Factors.

[119]  Valerie F Reyna,et al.  Abstraction: An alternative neurocognitive account of recognition, prediction, and decision making , 2020, Behavioral and Brain Sciences.

[120]  Tilmann Betsch,et al.  Framing the framing effect: the impact of context cues on solutions to the ‘Asian disease’ problem , 1998 .

[121]  W. Kintsch The representation of meaning in memory , 1974 .

[122]  Carlos Guestrin,et al.  Anchors: High-Precision Model-Agnostic Explanations , 2018, AAAI.

[123]  C. P. Goodman,et al.  The Tacit Dimension , 2003 .

[124]  David A. Broniatowski,et al.  Do design decisions depend on “dictators”? , 2018, Research in engineering design.

[125]  S. Frederick Journal of Economic Perspectives—Volume 19, Number 4—Fall 2005—Pages 25–42 Cognitive Reflection and Decision Making , 2022 .

[126]  T. Yarkoni,et al.  Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning , 2017, Perspectives on psychological science : a journal of the Association for Psychological Science.

[127]  David A. Broniatowski,et al.  Patients’ and Clinicians’ Perceptions of Antibiotic Prescribing for Upper Respiratory Infections in the Acute Care Setting , 2018, Medical decision making : an international journal of the Society for Medical Decision Making.

[128]  V. Reyna,et al.  A Theory of Medical Decision Making and Health: Fuzzy Trace Theory , 2008, Medical decision making : an international journal of the Society for Medical Decision Making.

[129]  Tim Miller,et al.  Explanation in Artificial Intelligence: Insights from the Social Sciences , 2017, Artif. Intell..

[130]  Valerie F. Reyna,et al.  When Irrational Biases Are Smart: A Fuzzy-Trace Theory of Complex Decision Making , 2018, Journal of Intelligence.

[131]  Michael Veale,et al.  Enslaving the Algorithm: From a “Right to an Explanation” to a “Right to Better Decisions”? , 2018, IEEE Security & Privacy.

[132]  Y. Trope,et al.  Construal-level theory of psychological distance. , 2010, Psychological review.

[133]  Anne Leitch,et al.  Mental models: an interdisciplinary synthesis of theory and methods , 2011 .