[Causal Inference in Medicine Part II. Directed acyclic graphs--a useful method for confounder selection, categorization of potential biases, and hypothesis specification].

Confounding is frequently a primary concern in epidemiological studies. With the increasing complexity of hypothesized relationships among exposures, outcomes, and covariates, it becomes very difficult to present these hypotheses lucidly and comprehensively. Graphical models are of great benefit in this regard. In this article, we focuse on directed acyclic graphs (DAGs), and review their value for confounder selection, categorization of potential biases, and hypothesis specification. We also discuss the importance of considering causal structures before selecting the covariates to be included in a statistical model and the potential biases introduced by inappropriately adjusting statistical models for covariates. DAGs are nonparametric and qualitative tools for visualizing research hypotheses regarding an exposure, an outcome, and covariates. Causal structures represented in DAGs will rarely be perfectly "correct" owing to the uncertainty about the underlying causal relationships. Nevertheless, to the extent that using DAGs forces greater clarity about causal assumptions, we are able to consider key sources of bias and uncertainty when interpreting study results. In summary, in this article, we review the following three points. (1) Although researchers have not adopted a consistent definition of confounders, using DAGs and the rules of d-separation we are able to identify clearly which variables we must condition on or adjust for in order to test a causal hypothesis under a set of causal assumptions. (2) We also show that DAGs should accurately correspond to research hypotheses of interest. To obtain a valid causal interpretation, research hypotheses should be defined explicitly from the perspective of a counterfactual model before drawing DAGs. A proper interpretation of the coefficients of a statistical model for addressing a specific research hypothesis relies on an accurate specification of a causal DAG reflecting the underlying causal structure. Unless DAGs correspond to research hypotheses, we cannot reliably reach proper conclusions testing the research hypotheses. Finally, (3) we have briefly reviewed other approaches to causal inference, and illustrate how these models are connected.

[1]  S. Schneeweiss,et al.  Causation of Bias: The Episcope , 2001, Epidemiology.

[2]  J M Robins,et al.  The role of model selection in causal inference from nonexperimental data. , 1986, American journal of epidemiology.

[3]  Y. Matsuyama,et al.  Marginal Structural Models as a Tool for Standardization , 2003, Epidemiology.

[4]  M. Glymour,et al.  Does childhood schooling affect old age memory or mental status? Using state schooling laws as natural experiments , 2008, Journal of Epidemiology & Community Health.

[5]  M. Hernán,et al.  Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology. , 2002, American journal of epidemiology.

[6]  J. Robins Data, Design, and Background Knowledge in Etiologic Inference , 2001, Epidemiology.

[7]  Causal interpretation based on DAGs. , 2008, Epidemiology.

[8]  J. Robins,et al.  When is baseline adjustment useful in analyses of change? An example with education and cognitive change. , 2005, American journal of epidemiology.

[9]  J. Robins,et al.  Four Types of Effect Modification: A Classification Based on Directed Acyclic Graphs , 2007, Epidemiology.

[10]  Sander Greenland,et al.  Causal Diagrams , 2011, International Encyclopedia of Statistical Science.

[11]  Colin L. Soskolne,et al.  The Future of Epidemiology , 2009, Academic medicine : journal of the Association of American Medical Colleges.

[12]  S. Greenland,et al.  Estimating causal effects. , 2002, International journal of epidemiology.

[13]  Wiebe R. Pestman,et al.  Instrumental Variables: Application and Limitations , 2006, Epidemiology.

[14]  I. Kawachi,et al.  Work-based social networks and health status among Japanese employees , 2009, Journal of Epidemiology & Community Health.

[15]  P. Heagerty,et al.  Potential Confounding by Exposure History and Prior Outcomes: An Example From Perinatal Epidemiology , 2007, Epidemiology.

[16]  Marshall M Joffe,et al.  On the estimation and use of propensity scores in case-control and case-cohort studies. , 2007, American journal of epidemiology.

[17]  Sander Greenland,et al.  Causation and Causal Inference , 2021, International Encyclopedia of Statistical Science.

[18]  C Hill,et al.  [Introduction to regression models]. , 2000, Bulletin du cancer.

[19]  J. Robins,et al.  A Structural Approach to Selection Bias , 2004, Epidemiology.

[20]  M. Hernán A definition of causal effect for epidemiological research , 2004, Journal of Epidemiology and Community Health.

[21]  J. Robins,et al.  Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. , 2000, Epidemiology.

[22]  C R Weinberg,et al.  Invited commentary: Barker meets Simpson. , 2005, American journal of epidemiology.

[23]  J. Avorn,et al.  Variable selection for propensity score models. , 2006, American journal of epidemiology.

[24]  M. Hernán,et al.  The birth weight "paradox" uncovered? , 2006, American journal of epidemiology.

[25]  A. Wilcox,et al.  Invited commentary: the perils of birth weight--a lesson from directed acyclic graphs. , 2006, American journal of epidemiology.

[26]  James M Robins,et al.  Directed acyclic graphs, sufficient causes, and the properties of conditioning on a common effect. , 2007, American journal of epidemiology.

[27]  M Susser,et al.  Choosing a future for epidemiology: II. From black box to Chinese boxes and eco-epidemiology. , 1996, American journal of public health.

[28]  F. Speizer Interpreting Epidemiologic Evidence: Strategies for Study Design and Analysis , 2004 .

[29]  P. Holland Statistics and Causal Inference , 1985 .

[30]  J M Robins,et al.  Confidence intervals for causal parameters. , 1988, Statistics in medicine.

[31]  M. Glymour,et al.  USING CAUSAL DIAGRAMS TO UNDERSTAND COMMON PROBLEMS IN SOCIAL EPIDEMIOLOGY , 2006 .

[32]  A. Roux,et al.  Using directed acyclic graphs to guide analyses of neighbourhood health effects: an introduction , 2008, Journal of Epidemiology & Community Health.

[33]  S. Greenland Quantifying Biases in Causal Models: Classical Confounding vs Collider-Stratification Bias , 2003, Epidemiology.

[34]  J. Robins,et al.  Instruments for Causal Inference: An Epidemiologist's Dream? , 2006, Epidemiology.

[35]  Ira Tager,et al.  Effects of body composition and leisure-time physical activity on transitions in physical functioning in the elderly. , 2005, American journal of epidemiology.

[36]  J. Pearl,et al.  Causal diagrams for epidemiologic research. , 1999, Epidemiology.

[37]  M. Parascandola,et al.  Causation in epidemiology , 2001, Journal of epidemiology and community health.

[38]  Sander Greenland,et al.  An overview of relations among causal modelling methods. , 2002, International journal of epidemiology.

[39]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[40]  H. Morgenstern,et al.  Confounding in health research. , 2001, Annual review of public health.