On Pearl’s Hierarchy and the Foundations of Causal Inference

Cause-and-effect relationships play a central role in how we perceive and make sense of the world around us, how we act upon it, and ultimately, how we under­ stand ourselves. Almost two decades ago, computer scientist Judea Pearl made a breakthrough in understanding causality by discovering and systematically study­ ing the “Ladder of Causation,” a framework that highlights the distinct roles of seeing, doing, and imagining. In honor of this landmark discovery, we name this the Pearl Causal Hierarchy (PCH). In this chapter, we develop a novel and comprehensive treatment of the PCH through two complementary lenses: one logical-probabilistic and another inferential-graphical. Following Pearl’s own pre­ sentation of the hierarchy, we begin by showing how the PCH organically emerges from a well-specified collection of causal mechanisms (a structural causal model, or SCM). We then turn to the logical lens. Our first result, the Causal Hierarchy Theorem (CHT), demonstrates that the three layers of the hierarchy almost always separate in a measure-theoretic sense. Roughly speaking, the CHT says that data at one layer virtually always underdetermines information at higher layers. As in most practical settings the scientist does not have access to the precise form of the underlying causal mechanisms—only to data generated by them with respect to some of the PCH’s layers—this motivates us to study inferences within the PCH through the graphical lens. Specifically, we explore a set of methods known as causal inference that enable inferences bridging the PCH’s layers given a partial 510 Chapter 27 On Pearl’s Hierarchy and the Foundations of Causal Inference specification of the SCM. For instance, one may want to infer what would happen had an intervention been performed in the environment (second-layer statement) when only passive observations (first-layer data) are available. We introduce a fam­ ily of graphical models that allows the scientist to represent such a partial speci­ fication of the SCM in a cognitively meaningful and parsimonious way. Finally, we investigate an inferential system known as do-calculus, showing how it can be suf­ ficient, and in many cases necessary, to allow inferences across the PCH’s layers. We believe that connecting with the essential dimensions of human experience as delineated by the PCH is a critical step toward creating the next generation of arti­ ficial intelligence (AI) systems that will be safe, robust, human-compatible, and aligned with the social good. 27.

[1]  Thomas F. Icard,et al.  Calibrating generative models: The probabilistic Chomsky–Schützenberger hierarchy , 2020 .

[2]  Michael I. Jordan Graphical Models , 2018, Biomarker Analysis in Clinical Trials with R.

[3]  Thomas Icard,et al.  Probabilistic Reasoning across the Causal Hierarchy , 2020, AAAI.

[4]  B. Schölkopf,et al.  Causality for Machine Learning , 2019, Probabilistic and Causal Inference.

[5]  Thomas Icard,et al.  On Open-Universe Causal Reasoning , 2019, UAI.

[6]  J. Pearl,et al.  Note on ‘‘Generalizability of Study Results‘‘ , 2019, Epidemiology.

[7]  Thomas Icard,et al.  On the Conditional Logic of Simulation Models , 2018, IJCAI.

[8]  Elias Bareinboim,et al.  Fairness in Decision-Making - The Causal Explanation Formula , 2018, AAAI.

[9]  Bernhard Schölkopf,et al.  Elements of Causal Inference: Foundations and Learning Algorithms , 2017 .

[10]  Robert H. Strotz,et al.  Recursive versus non-recursive systems: An attempt at a synthesis , 2017 .

[11]  Jonathan Cottrell Ideas, Evidence, and Method: Hume's Skepticism and Naturalism Concerning Knowledge and Causation , 2017 .

[12]  Elias Bareinboim,et al.  Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[13]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[14]  Jiji Zhang A Lewisian Logic of Causal Counterfactuals , 2013, Minds and Machines.

[15]  Elias Bareinboim,et al.  Causal Inference by Surrogate Experiments: z-Identifiability , 2012, UAI.

[16]  Judea Pearl,et al.  Aspects of Graphical Models Connected with Causality , 2011 .

[17]  Elias Bareinboim,et al.  Local Characterizations of Causal Bayesian Networks , 2011, GKR.

[18]  J. Pearl The Mediation Formula: A Guide to the Assessment of Causal Pathways in Nonlinear Models , 2011 .

[19]  Joseph Y. Halpern FROM CAUSAL MODELS TO COUNTERFACTUAL STRUCTURES , 2010, The Review of Symbolic Logic.

[20]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[21]  Derek C. Penn,et al.  Causal cognition in human and nonhuman animals: a comparative, critical review. , 2007, Annual review of psychology.

[22]  Marco Valtorta,et al.  Identifiability in Causal Bayesian Networks: A Sound and Complete Algorithm , 2006, AAAI.

[23]  Judea Pearl,et al.  Identification of Joint Interventional Distributions in Recursive Semi-Markovian Causal Models , 2006, AAAI.

[24]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[25]  Jin Tian,et al.  On the Testable Implications of Causal Models with Hidden Variables , 2002, UAI.

[26]  Jin Tian,et al.  A general identification condition for causal effects , 2002, AAAI/IAAI.

[27]  D. Allen Making things happen. , 2000, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[28]  Manabu Kuroki,et al.  IDENTIFIABILITY CRITERIA FOR CAUSAL EFFECTS OF JOINT INTERVENTIONS , 1999 .

[29]  Joseph Y. Halpern Axiomatizing Causal Reasoning , 1998, UAI.

[30]  J. Pearl Causal diagrams for empirical research , 1995 .

[31]  James M. Robins,et al.  Probabilistic evaluation of sequential plans from causal models with hidden variables , 1995, UAI.

[32]  Judea Pearl,et al.  Testing Identifiability of Causal Effects , 1995, UAI.

[33]  D. Braddon-Mitchell NATURE'S CAPACITIES AND THEIR MEASUREMENT , 1991 .

[34]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[35]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[36]  Ronald Fagin,et al.  A logic for reasoning about probabilities , 1988, [1988] Proceedings. Third Annual Information Symposium on Logic in Computer Science.

[37]  Robert L. Shook,et al.  The Book of Why , 1983, Journal of MultiDisciplinary Evaluation.

[38]  J. Mackie,et al.  The cement of the universe : a study of causation , 1977 .

[39]  Larry J. Stockmeyer,et al.  The Polynomial-Time Hierarchy , 1976, Theor. Comput. Sci..

[40]  Daniel Thalmann,et al.  Autonomy , 2005, SIGGRAPH Courses.

[41]  Noam Chomsky,et al.  On Certain Formal Properties of Grammars , 1959, Inf. Control..

[42]  J. I The Design of Experiments , 1936, Nature.

[43]  Elias Bareinboim,et al.  General Identifiability with Arbitrary Surrogate Experiments , 2019, UAI.

[44]  Bernhard Schölkopf,et al.  Causal Consistency of Structural Equation Models , 2017, UAI.

[45]  J. Böhnke Explanation in causal inference: Methods for mediation and interaction. , 2016, Quarterly journal of experimental psychology.

[46]  Rachael Briggs,et al.  Interventionist counterfactuals , 2012 .

[47]  Gavin Brown,et al.  Uncertainty in Artificial Intelligence , 2010, UAI 2011.

[48]  J. Bennett,et al.  Enquiry Concerning Human Understanding , 2010 .

[49]  D. Rubin,et al.  Matched Sampling for Causal Effects: The Central Role of the Propensity Score in Observational Studies for Causal Effects , 2006 .

[50]  F. Keil,et al.  Explanation and understanding , 2015 .

[51]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[52]  Patrick Suppes,et al.  When are probabilistic explanations possible? , 2005, Synthese.

[53]  J. Pearl Bayesianism and Causality, or, Why I am Only a Half-Bayesian , 2001 .

[54]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[55]  J. Pearl,et al.  An Axiomatic Characterization of Causal Counterfactuals , 1998 .

[56]  D. Nute,et al.  Counterfactuals , 1992 .

[57]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[58]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[59]  H. Simon,et al.  Causal Ordering and Identifiability , 1977 .

[60]  David Hume A Treatise of Human Nature: Being an Attempt to introduce the experimental Method of Reasoning into Moral Subjects , 1972 .

[61]  Evert W. Beth,et al.  On Padoa’s Method in the Theory of Definition , 1953 .

[62]  T. Haavelmo The Statistical Implications of a System of Simultaneous Equations , 1943 .

[63]  J. Locke An Essay concerning Human Understanding , 1924, Nature.