A Review of Some Recent Advances in Causal Inference

We give a selective review of some recent developments in causal inference, intended for researchers who are not familiar with graphical models and causality, and with a focus on methods that are applicable to large data sets. We mainly address the problem of estimating causal effects from observational data. For example, one can think of estimating the effect of single or multiple gene knockouts from wild-type gene expression data, that is, from gene expression measurements that were obtained without doing any gene knockout experiments. We assume that the observational data are generated from a causal structure that can be represented by a directed acyclic graph (DAG). First, we discuss estimation of causal effects when the underlying causal DAG is known. In large-scale networks, however, the causal DAG is often unknown. Next, we therefore discuss causal structure learning, that is, learning information about the causal structure from observational data. We then combine these two parts and discuss methods to estimate (bounds on) causal effects from observational data when the causal structure is unknown. We also illustrate this method on a yeast gene expression data set. We close by mentioning several extensions of the discussed work.

[1]  Bernhard Schölkopf,et al.  Nonlinear causal discovery with additive noise models , 2008, NIPS.

[2]  M. Maathuis,et al.  Estimating the effect of joint interventions from observational data in sparse high-dimensional settings , 2014, 1407.2451.

[3]  Constantin F. Aliferis,et al.  Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification Part II: Analysis and Extensions , 2010, J. Mach. Learn. Res..

[4]  Elias Bareinboim,et al.  Transportability from Multiple Environments with Limited Experiments: Completeness Results , 2014, NIPS.

[5]  P. Spirtes,et al.  Causation, Prediction, and Search, 2nd Edition , 2001 .

[6]  Peter Bühlmann,et al.  p-Values for High-Dimensional Regression , 2008, 0811.2177.

[7]  Johannes Textor,et al.  A Complete Generalized Adjustment Criterion , 2015, UAI.

[8]  David Maxwell Chickering,et al.  Optimal Structure Identification With Greedy Search , 2002, J. Mach. Learn. Res..

[9]  Alain Hauser,et al.  Understanding consistency in hybrid causal structure learning , 2015 .

[10]  Naftali Harris,et al.  PC algorithm for nonparanormal graphical models , 2013, J. Mach. Learn. Res..

[11]  Joseph Ramsey A PC-Style Markov Blanket for High Dimensional Datasets , 2006 .

[12]  Richard Scheines,et al.  A Statistical Problem for Inference to Regulatory Structure from Associations of Gene Expression Measurements with Microarrays , 2003, Bioinform..

[13]  Patrik O. Hoyer,et al.  Estimation of causal effects using linear non-Gaussian causal models with hidden variables , 2008, Int. J. Approx. Reason..

[14]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[15]  Judea Pearl,et al.  Comment: Graphical Models, Causality and Intervention , 2016 .

[16]  Using Causal Discovery Algorithms to Learn About Our Planet’s Climate , 2015 .

[17]  J. Robins,et al.  Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. , 2000, Epidemiology.

[18]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[19]  J. Pearl Causal inference in statistics: An overview , 2009 .

[20]  Peter Bühlmann,et al.  Predicting causal effects in large-scale systems from observational data , 2010, Nature Methods.

[21]  T. Heskes,et al.  Learning Sparse Causal Models is not NP-hard , 2013, UAI.

[22]  P. Bühlmann,et al.  Causal Stability Ranking , 2011 .

[23]  Peter Bühlmann,et al.  Causal Inference Using Graphical Models with the R Package pcalg , 2012 .

[24]  Franz von Kutschera,et al.  Causation , 1993, J. Philos. Log..

[25]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[26]  Judea Pearl,et al.  Identification of Conditional Interventional Distributions , 2006, UAI.

[27]  P. Kemmeren,et al.  De-Novo Learning of Genome-Scale Regulatory Networks in S. cerevisiae , 2014, PloS one.

[28]  Constantin F. Aliferis,et al.  Local Causal and Markov Blanket Induction for Causal Discovery and Feature Selection for Classification Part I: Algorithms and Empirical Evaluation , 2010, J. Mach. Learn. Res..

[29]  Mark W. Woolrich,et al.  Network modelling methods for FMRI , 2011, NeuroImage.

[30]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[31]  Constantin F. Aliferis,et al.  Algorithms for Large Scale Markov Blanket Discovery , 2003, FLAIRS.

[32]  Maciej Liskiewicz,et al.  Constructing Separators and Adjustment Sets in Ancestral Graphs , 2014, CI@UAI.

[33]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[34]  Dario Floreano,et al.  Generating Realistic In Silico Gene Networks for Performance Assessment of Reverse Engineering Methods , 2009, J. Comput. Biol..

[35]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[36]  James M. Robins,et al.  On the Validity of Covariate Adjustment for Estimating Causal Effects , 2010, UAI.

[37]  J. Peters,et al.  Identifiability of Gaussian structural equation models with equal error variances , 2012, 1205.2536.

[38]  Jiji Zhang,et al.  On the completeness of orientation rules for causal discovery in the presence of latent confounders and selection bias , 2008, Artif. Intell..

[39]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[40]  Russell A. Poldrack,et al.  Six problems for causal inference from fMRI , 2010, NeuroImage.

[41]  Clark Glymour,et al.  Atypical Effective Connectivity of Social Brain Networks in Individuals with Autism , 2013, Brain Connect..

[42]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[43]  Ioannis G. Tollis,et al.  Learning Causal Structure from Overlapping Variable Sets , 2010, AISTATS.

[44]  Aapo Hyvärinen,et al.  A direct method for estimating a causal ordering in a linear non-Gaussian acyclic model , 2009, UAI.

[45]  Diego Colombo,et al.  Order-independent constraint-based causal structure learning , 2012, J. Mach. Learn. Res..

[46]  Imme Ebert-Uphoff,et al.  Causal Discovery for Climate Research Using Graphical Models , 2012 .

[47]  Clark Glymour,et al.  Search for Additive Nonlinear Time Series Causal Models , 2008, J. Mach. Learn. Res..

[48]  Peter Bühlmann,et al.  Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm , 2007, J. Mach. Learn. Res..

[49]  Steven L. Scott,et al.  Inferring causal impact using Bayesian structural time-series models , 2015, 1506.00356.

[50]  Thomas S. Richardson,et al.  Causal Inference in the Presence of Latent Variables and Selection Bias , 1995, UAI.

[51]  Peter Bühlmann,et al.  Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs (Abstract) , 2011, UAI.

[52]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[53]  M. Maathuis,et al.  Estimating high-dimensional intervention effects from observational data , 2008, 0810.4214.

[54]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .

[55]  Aapo Hyvärinen,et al.  Estimation of a Structural Vector Autoregression Model Using Non-Gaussianity , 2010, J. Mach. Learn. Res..

[56]  Jonas Peters,et al.  Causal inference by using invariant prediction: identification and confidence intervals , 2015, 1501.01332.

[57]  Bernhard Schölkopf,et al.  On Causal Discovery with Cyclic Additive Noise Models , 2011, NIPS.

[58]  Vincenzo Lagani,et al.  Towards Integrative Causal Analysis of Heterogeneous Data Sets and Studies , 2012, J. Mach. Learn. Res..

[59]  D. Katz The American Statistical Association , 2000 .

[60]  Thomas S. Richardson,et al.  A Discovery Algorithm for Directed Cyclic Graphs , 1996, UAI.

[61]  Thomas S. Richardson,et al.  Learning high-dimensional directed acyclic graphs with latent and selection variables , 2011, 1104.5617.

[62]  P. Hoyer,et al.  On Causal Discovery from Time Series Data using FCI , 2010 .

[63]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[64]  J. Pearl Causal diagrams for empirical research , 1995 .

[65]  P. Smyth,et al.  Cyclic Causal Discovery from Continuous Equilibrium Data , 2013, UAI.

[66]  P. Games Correlation and Causation: A Logical Snafu , 1990 .

[67]  Daniel Chicharro,et al.  Algorithms of causal inference for the analysis of effective connectivity among brain regions , 2014, Front. Neuroinform..

[68]  David Danks,et al.  Integrating Locally Learned Causal Structures with Overlapping Variables , 2008, NIPS.

[69]  M. Maathuis,et al.  A Generalized Back-door Criterion 1 , 2015 .

[70]  R. P. McDonald,et al.  Structural Equations with Latent Variables , 1989 .

[71]  P. Spirtes,et al.  Ancestral graph Markov models , 2002 .

[72]  Korbinian Strimmer,et al.  From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data , 2007, BMC Systems Biology.