Quantifying Biases in Causal Models: Classical Confounding vs Collider-Stratification Bias

It has long been known that stratifying on variables affected by the study exposure can create selection bias. More recently it has been shown that stratifying on a variable that precedes exposure and disease can induce confounding, even if there is no confounding in the unstratified (crude) estimate. This paper examines the relative magnitudes of these biases under some simple causal models in which the stratification variable is graphically depicted as a collider (a variable directly affected by two or more other variables in the graph). The results suggest that bias from stratifying on variables affected by exposure and disease may often be comparable in size with bias from classical confounding (bias from failing to stratify on a common cause of exposure and disease), whereas other biases from collider stratification may tend to be much smaller.

[1]  J BERKSON,et al.  Limitations of the application of fourfold table analysis to hospital data. , 1946, Biometrics.

[2]  E. Kitagawa,et al.  Components of a Difference Between Two Rates , 1955 .

[3]  I NICOLETTI,et al.  The Planning of Experiments , 1936, Rivista di clinica pediatrica.

[4]  E. C. Hammond,et al.  Smoking and lung cancer: recent evidence and a discussion of some questions. , 1959, Journal of the National Cancer Institute.

[5]  I. Bross,et al.  Pertinency of an extraneous variable. , 1967, Journal of chronic diseases.

[6]  H Checkoway,et al.  Bias due to misclassification in the estimation of relative risk. , 1977, American journal of epidemiology.

[7]  J. Schlesselman Assessing effects of confounding variables. , 1978, American journal of epidemiology.

[8]  A R Feinstein,et al.  Alternative analytic methods for case-control studies of estrogens and endometrial cancer. , 1979, The New England journal of medicine.

[9]  A. Feinstein,et al.  Alternative Analytic Methods for Case-Control Studies of Estrogens and Endometrial Cancer , 1978 .

[10]  Correcting a bias? , 1978, The New England journal of medicine.

[11]  R Neutra,et al.  Control of confounding in the assessment of medical technology. , 1980, International journal of epidemiology.

[12]  N. Breslow,et al.  Statistical methods in cancer research: volume 1- The analysis of case-control studies , 1980 .

[13]  R Neutra,et al.  An analysis of detection bias and proposed corrections in the study of estrogens and endometrial cancer. , 1981, Journal of chronic diseases.

[14]  Takashi Yanagawa,et al.  Case-control studies: Assessing the effect of a confounding factor , 1984 .

[15]  M. Gail,et al.  Indirect corrections for confounding under multiplicative and additive risk models. , 1988, American journal of industrial medicine.

[16]  S Greenland,et al.  Randomization, Statistics, and Causal Inference , 1990, Epidemiology.

[17]  W. Flanders,et al.  Indirect Assessment of Confounding: Graphic Description and Limits on Effect of Adjusting for Covariates , 1990, Epidemiology.

[18]  S Greenland Reducing mean squared error in the analysis of stratified epidemiologic studies. , 1991, Biometrics.

[19]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[20]  C R Weinberg,et al.  Toward a clearer definition of confounding. , 1993, American journal of epidemiology.

[21]  S Greenland,et al.  The interpretation of multiplicative-model parameters as standardized parameters. , 1994, Statistics in medicine.

[22]  J. Pearl Causal diagrams for empirical research , 1995 .

[23]  M. Szklo,et al.  Epidemiology: Beyond the Basics , 1999 .

[24]  G. Shaw,et al.  Maternal pesticide exposure from multiple sources and selected congenital anomalies. , 1999 .

[25]  J. Pearl,et al.  Causal diagrams for epidemiologic research. , 1999, Epidemiology.

[26]  J. Pearl,et al.  Confounding and Collapsibility in Causal Inference , 1999 .

[27]  J. Robins Data, Design, and Background Knowledge in Etiologic Inference , 2001, Epidemiology.

[28]  P. Simpson,et al.  Statistical methods in cancer research , 2001, Journal of surgical oncology.

[29]  J. Kaufman,et al.  Assessment of Structured Socioeconomic Effects on Health , 2001, Epidemiology.

[30]  S. Cole,et al.  Fallibility in estimating direct effects. , 2002, International journal of epidemiology.

[31]  M. Hernán,et al.  Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology. , 2002, American journal of epidemiology.

[32]  Sander Greenland,et al.  An overview of relations among causal modelling methods. , 2002, International journal of epidemiology.

[33]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.