Bayesian Effect Estimation Accounting for Adjustment Uncertainty

Model-based estimation of the effect of an exposure on an outcome is generally sensitive to the choice of which confounding factors are included in the model. We propose a new approach, which we call Bayesian adjustment for confounding (BAC), to estimate the effect of an exposure of interest on the outcome, while accounting for the uncertainty in the choice of confounders. Our approach is based on specifying two models: (1) the outcome as a function of the exposure and the potential confounders (the outcome model); and (2) the exposure as a function of the potential confounders (the exposure model). We consider Bayesian variable selection on both models and link the two by introducing a dependence parameter, ω, denoting the prior odds of including a predictor in the outcome model, given that the same predictor is in the exposure model. In the absence of dependence (ω= 1), BAC reduces to traditional Bayesian model averaging (BMA). In simulation studies, we show that BAC, with ω > 1, estimates the exposure effect with smaller bias than traditional BMA, and improved coverage. We, then, compare BAC, a recent approach of Crainiceanu, Dominici, and Parmigiani (2008, Biometrika 95, 635-651), and traditional BMA in a time series data set of hospital admissions, air pollution levels, and weather variables in Nassau, NY for the period 1999-2005. Using each approach, we estimate the short-term effects of on emergency admissions for cardiovascular diseases, accounting for confounding. This application illustrates the potentially significant pitfalls of misusing variable selection methods in the context of adjustment uncertainty.

[1]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[2]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[3]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[4]  F. Dominici,et al.  Combining evidence on air pollution and daily mortality from the 20 largest US cities: a hierarchical modelling strategy , 2000 .

[5]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[6]  Paul Gustafson,et al.  Bayesian Propensity Score Analysis for Observational Data , 2006 .

[7]  Merlise A. Clyde,et al.  Model uncertainty and health effect studies for particulate matter , 2000 .

[8]  Ciprian M. Crainiceanu,et al.  Adjustment uncertainty in effect estimation , 2008 .

[9]  J. Avorn,et al.  Variable selection for propensity score models. , 2006, American journal of epidemiology.

[10]  Guido Consonni,et al.  Compatibility of prior specifications across linear models , 2008, 1102.2981.

[11]  F. Dominici,et al.  Trends in Air Pollution and Mortality: An Approach to the Assessment of Unmeasured Confounding , 2007, Epidemiology.

[12]  James M. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models: Rejoinder , 1999 .

[13]  F. Dominici,et al.  Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. , 2006, JAMA.

[14]  J. Robins,et al.  Estimating exposure effects by modelling the expectation of exposure conditional on confounders. , 1992, Biometrics.

[15]  A. Raftery Bayesian Model Selection in Social Research , 1995 .

[16]  T. Louis,et al.  Model choice in time series studies of air pollution and mortality , 2006 .

[17]  Gary Koop,et al.  Measuring the health effects of air pollution: to what extent can we really say that people are dying from bad air? , 2004 .

[18]  Variable selection versus shrinkage in the control of multiple confounders. Response. , 2008 .

[19]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[20]  Edward I. George,et al.  Bayesian Treed Models , 2002, Machine Learning.

[21]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[22]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[23]  Donald Rubin,et al.  Estimating Causal Effects from Large Data Sets Using Propensity Scores , 1997, Annals of Internal Medicine.

[24]  Adrian E. Raftery,et al.  Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data , 2005, Bioinform..

[25]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[26]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[27]  Keming Yu,et al.  Bayesian Mode Regression , 2012, 1208.0579.

[28]  S Greenland,et al.  The impact of confounder selection criteria on effect estimation. , 1989, American journal of epidemiology.

[29]  T. Hastie,et al.  Improved Semiparametric Time Series Models of Air Pollution and Mortality , 2004 .

[30]  S. Richardson,et al.  Variable selection and Bayesian model averaging in case‐control studies , 2001, Statistics in medicine.

[31]  Chris Hans,et al.  Model uncertainty and variable selection in Bayesian lasso regression , 2010, Stat. Comput..

[32]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.