A comparison of full model specification and backward elimination of potential confounders when estimating marginal and conditional causal effects on binary outcomes from observational data

A common view in epidemiology is that automated confounder selection methods, such as backward elimination, should be avoided as they can lead to biased effect estimates and underestimation of their variance. Nevertheless, backward elimination remains regularly applied. We investigated if and under which conditions causal effect estimation in observational studies can improve by using backward elimination on a prespecified set of potential confounders. An expression was derived that quantifies how variable omission relates to bias and variance of effect estimators. Additionally, 3960 scenarios were defined and investigated by simulations comparing bias and mean squared error (MSE) of the conditional log odds ratio, log(cOR), and the marginal log risk ratio, log(mRR), between full models including all prespecified covariates and backward elimination of these covariates. Applying backward elimination resulted in a mean bias of 0.03 for log(cOR) and 0.02 for log(mRR), compared to 0.56 and 0.52 for log(cOR) and log(mRR), respectively, for a model without any covariate adjustment, and no bias for the full model. In less than 3% of the scenarios considered, the MSE of the log(cOR) or log(mRR) was slightly lower (max 3%) when backward elimination was used compared to the full model. When an initial set of potential confounders can be specified based on background knowledge, there is minimal added value of backward elimination. We advise not to use it and otherwise to provide ample arguments supporting its use.

[1]  Kellyn F Arnold,et al.  Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations , 2020, International journal of epidemiology.

[2]  G. Heinze,et al.  Selection of variables for multivariable models: Opportunities and limitations in quantifying model stability by resampling , 2020, Statistics in medicine.

[3]  P. Kolh,et al.  Routine preoperative aortic computed tomography angiography is associated with reduced risk of stroke in coronary artery bypass grafting: a propensity-matched analysis. , 2019, European journal of cardio-thoracic surgery : official journal of the European Association for Cardio-thoracic Surgery.

[4]  Willi Sauerbrei,et al.  State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues , 2019, Diagnostic and Prognostic Research.

[5]  Els Goetghebeur,et al.  Formulating causal questions and principled statistical answers , 2019, Statistics in medicine.

[6]  D. Talbot,et al.  A descriptive review of variable selection methods in four epidemiologic journals: there is still room for improvement , 2019, European Journal of Epidemiology.

[7]  T. VanderWeele Principles of confounder selection , 2019, European Journal of Epidemiology.

[8]  Vanessa Didelez,et al.  Covariate selection strategies for causal inference: Classification and comparison , 2018, Biometrical journal. Biometrische Zeitschrift.

[9]  John Hsu,et al.  A Second Chance to Get Causal Inference Right: A Classification of Data Science Tasks , 2018, CHANCE.

[10]  Georg Heinze,et al.  Variable selection – A review and recommendations for the practicing statistician , 2018, Biometrical journal. Biometrische Zeitschrift.

[11]  Michael J Crowther,et al.  Using simulation studies to evaluate statistical methods , 2017, Statistics in medicine.

[12]  Georg Heinze,et al.  Firth's logistic regression with rare events: accurate effect estimates and predictions? , 2017, Statistics in medicine.

[13]  Georg Heinze,et al.  Five myths about variable selection , 2017, Transplant international : official journal of the European Society for Organ Transplantation.

[14]  Douglas G. Altman,et al.  No rationale for 1 variable per 10 events criterion for binary logistic regression analysis , 2016, BMC Medical Research Methodology.

[15]  Sander Greenland,et al.  Outcome modelling strategies in epidemiology: traditional methods and basic alternatives , 2016, International journal of epidemiology.

[16]  Ashkan Ertefaie,et al.  Variable Selection in Causal Inference using a Simultaneous Penalization Method , 2015, 1511.08501.

[17]  Rolf H H Groenwold,et al.  Reporting of covariate selection and balance assessment in propensity score analysis is suboptimal: a systematic review. , 2015, Journal of clinical epidemiology.

[18]  Brian J Reich,et al.  Confounder selection via penalized credible regions , 2014, Biometrics.

[19]  G. Heinze,et al.  Augmented Backward Elimination: A Pragmatic and Purposeful Way to Develop Statistical Models , 2014, PloS one.

[20]  Luke W. Miratrix,et al.  To Adjust or Not to Adjust? Sensitivity Analysis of M-Bias and Butterfly-Bias , 2014, 1408.0324.

[21]  Victor Chernozhukov,et al.  Post-Selection Inference for Generalized Linear Models With Many Controls , 2013, 1304.3969.

[22]  A. Buja,et al.  Valid post-selection inference , 2013, 1306.1059.

[23]  S. Vansteelandt,et al.  On model selection and model misspecification in causal inference , 2012, Statistical methods in medical research.

[24]  I. Shpitser,et al.  A New Criterion for Confounder Selection , 2011, Biometrics.

[25]  E. Steyerberg,et al.  [Regression modeling strategies]. , 2011, Revista espanola de cardiologia.

[26]  Galit Shmueli,et al.  To Explain or To Predict? , 2010, 1101.0891.

[27]  H. Tiemeier,et al.  Variable selection: current practice in epidemiological studies , 2009, European Journal of Epidemiology.

[28]  A. Hoes,et al.  Poor quality of reporting confounding bias in observational intervention studies: a systematic review. , 2008, Annals of epidemiology.

[29]  Sander Greenland,et al.  Invited commentary: variable selection versus shrinkage in the control of multiple confounders. , 2007, American journal of epidemiology.

[30]  Jesse A Berlin,et al.  Relative risks and confidence intervals were easily computed indirectly from multivariable logistic regression. , 2007, Journal of clinical epidemiology.

[31]  P. Rochon,et al.  Readers should systematically assess methods used to identify, measure and analyze confounding in observational cohort studies. , 2007, Journal of clinical epidemiology.

[32]  B. M. Pötscher,et al.  MODEL SELECTION AND INFERENCE: FACTS AND FICTION , 2005, Econometric Theory.

[33]  Sander Greenland,et al.  Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case-control studies. , 2004, American journal of epidemiology.

[34]  S. Greenland Quantifying Biases in Causal Models: Classical Confounding vs Collider-Stratification Bias , 2003, Epidemiology.

[35]  M. Schemper,et al.  A solution to the problem of separation in logistic regression , 2002, Statistics in medicine.

[36]  G W Sun,et al.  Inappropriate use of bivariable analysis to screen risk factors for use in multivariable analysis. , 1996, Journal of clinical epidemiology.

[37]  D. Firth Bias reduction of maximum likelihood estimates , 1993 .

[38]  P. McCullagh,et al.  Bias Correction in Generalized Linear Models , 1991 .

[39]  J. Ioannidis,et al.  Interpretation of epidemiologic studies very often lacked adequate consideration of confounding. , 2018, Journal of clinical epidemiology.

[40]  E. Hak,et al.  Quality of reporting of confounding remained suboptimal after the STROBE guideline. , 2016, Journal of clinical epidemiology.

[41]  Peter C Austin,et al.  Absolute risk reductions, relative risks, relative risk reductions, and numbers needed to treat can be obtained from a logistic regression model. , 2010, Journal of clinical epidemiology.

[42]  R. Schaefer Bias correction in maximum likelihood logistic regression. , 1985, Statistics in medicine.