The role of exchangeability in causal inference

The notion of exchangeability has been recognized in the causal inference literature in various guises, but only rarely in the original Bayesian meaning as a symmetry property between individual units in statistical inference. Since the latter is a standard ingredient in Bayesian inference, we argue that in Bayesian causal inference it is natural to link the causal model, including the notion of confounding and definition of causal contrasts of interest, to the concept of exchangeability. Here we relate the Bayesian notion of exchangeability to alternative conditions for unconfounded inferences, commonly stated using potential outcome variables, and define causal contrasts in the presence of exchangeability in terms of limits of posterior predictive expectations for further exchangeable units. We demonstrate that this reasoning also carries over to longitudinal settings where parametric inferences are susceptible to the so-called null paradox. We interpret the paradox in terms of an exchangeability assumption made on too coarse a scale.

[1]  L. M. M.-T. Theory of Probability , 1929, Nature.

[2]  L. J. Savage,et al.  Symmetric measures on Cartesian products , 1955 .

[3]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[4]  M. R. Novick,et al.  The Role of Exchangeability in Inference , 1981 .

[5]  James L. Powell,et al.  A comparison of the Box-Cox maximum likelihood estimator and the non-linear two-stage least squares estimator , 1981 .

[6]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[7]  J M Robins,et al.  Identifiability, exchangeability, and epidemiological confounding. , 1986, International journal of epidemiology.

[8]  P. Holland Statistics and Causal Inference , 1985 .

[9]  B. M. Hill,et al.  Theory of Probability , 1990 .

[10]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[11]  James M. Robins,et al.  Estimation of Effects of Sequential Treatments by Reparameterizing Directed Acyclic Graphs , 1997, UAI.

[12]  J. Pearl,et al.  Confounding and Collapsibility in Causal Inference , 1999 .

[13]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[14]  A. Dawid Causal Inference without Counterfactuals , 2000 .

[15]  J. Robins,et al.  Marginal Structural Models to Estimate the Joint Causal Effect of Nonrandomized Treatments , 2001 .

[16]  D. Lindley Seeing and Doing: the Concept of Causation , 2002 .

[17]  S. Greenland Quantifying Biases in Causal Models: Classical Confounding vs Collider-Stratification Bias , 2003, Epidemiology.

[18]  E. Arjas,et al.  Causal Reasoning from Longitudinal Data * , 2004 .

[19]  Erica Moodie,et al.  Dynamic treatment regimes. , 2004, Clinical trials.

[20]  J. Robins,et al.  Estimating causal effects from epidemiological data , 2006, Journal of Epidemiology and Community Health.

[21]  Juha Karvanen,et al.  Estimation of quantile mixtures via L-moments and trimmed L-moments , 2006, Comput. Stat. Data Anal..

[22]  Siddhartha Chib,et al.  Analysis of treatment response data without the joint distribution of potential outcomes , 2007 .

[23]  Andrew Gelman,et al.  Struggles with survey weighting and regression modeling , 2007, 0710.5005.

[24]  P. McCullagh Sampling bias and logistic models , 2008 .

[25]  Tyler J. VanderWeele,et al.  Marginal Structural Models for the Estimation of Direct and Indirect Effects , 2009, Epidemiology.

[26]  R. Joyce,et al.  1 Mesoamerica : A Working Model for Archaeology , 2003 .

[27]  Stephen R Cole,et al.  The consistency statement in causal inference: a definition or an assumption? , 2009, Epidemiology.

[28]  J. Robins,et al.  Intervening on risk factors for coronary heart disease: an application of the parametric g-formula. , 2009, International journal of epidemiology.

[29]  Tyler J. VanderWeele,et al.  Concerning the consistency assumption in causal inference. , 2009, Epidemiology.

[30]  Judea Pearl,et al.  On the Consistency Rule in Causal Inference: Axiom, Definition, Assumption, or Theorem? , 2010, Epidemiology.

[31]  A. Philip Dawid,et al.  Identifying the consequences of dynamic treatment strategies: A decision-theoretic overview , 2010, ArXiv.

[32]  K. Røysland A martingale approach to continuous-time marginal structural models , 2009, 0901.2593.

[33]  Juha Karvanen,et al.  Secondary Analysis under Cohort Sampling Designs Using Conditional Likelihood , 2012 .

[34]  Sonia Petrone,et al.  Predictive construction of priors in Bayesian nonparametrics , 2012 .

[35]  Elja Arjas,et al.  Causal Inference from Observational Data: A Bayesian Predictive Approach , 2012 .

[36]  S. Greenland Causal Inference as a Prediction Problem: Assumptions, Identification and Evidence Synthesis , 2012 .

[37]  Jessica G. Young,et al.  The parametric g‐formula to estimate the effect of highly active antiretroviral therapy on incident AIDS or death , 2012, Statistics in medicine.

[38]  S. Vansteelandt,et al.  On model selection and model misspecification in causal inference , 2012, Statistical methods in medical research.

[39]  Stuart G. Baker,et al.  Causal inference, probability theory, and graphical insights , 2013, Statistics in medicine.

[40]  Stephen R Cole,et al.  The Parametric g-Formula for Time-to-event Data: Intuition and a Worked Example , 2014, Epidemiology.

[41]  Anne Gégout-Petit,et al.  The stochastic system approach for estimating dynamic treatments effect , 2015, Lifetime Data Analysis.

[42]  Niels Keiding,et al.  Standardization and Control for Confounding in Observational Studies: A Historical Perspective , 2015, 1503.02853.

[43]  J. A. Ferreira,et al.  Some models and methods for the analysis of observational data , 2015 .

[44]  Olli Saarela,et al.  On Bayesian estimation of marginal structural models , 2015, Biometrics.

[45]  James M Robins,et al.  Smoking cessation and long-term weight gain in the Framingham Heart Study: an application of the parametric g-formula for a continuous outcome , 2016, European Journal of Epidemiology.

[46]  Monica Musio,et al.  From Statistical Evidence to Evidence of Causality , 2013, 1311.7513.

[47]  Léo R. Belzile,et al.  A Bayesian view of doubly robust causal inference , 2016, 1701.04093.

[48]  Pekka Martikainen,et al.  Unemployment and subsequent depression: A mediation analysis using the parametric G-formula. , 2017, Social science & medicine.

[49]  Michael D. Attfield,et al.  Diesel Exhaust, Respirable Dust, and Ischemic Heart Disease: An Application of the Parametric g-formula , 2019, Epidemiology.

[50]  J. A. Ferreira Causality from the Point of View of Statistics , 2019, 1908.07301.

[51]  Daniel Commenges,et al.  Causality without potential outcomes and the dynamic approach , 2019, 1905.01195.

[52]  J. A. Ferreira,et al.  Causality from the Point of View of Classical Statistics , 2019 .

[53]  Jianying Hu,et al.  G-Computation and Hierarchical Models for Estimating Multiple Causal Effects From Observational Disease Registries With Irregular Visits. , 2019, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[54]  P. Bühlmann,et al.  Invariance, Causality and Robustness , 2018, Statistical Science.

[55]  P. Dawid Decision-theoretic foundations for statistical causality , 2020, Journal of Causal Inference.