Controlling for Continuous Confounders in Epidemiologic Research

Multiple regression models are commonly used to control for confounding in epidemiologic research. Parametric regression models, such as multiple logistic regression, are powerful tools to control for multiple covariates provided that the covariate‐risk associations are correctly specified. Residual confounding may result, however, from inappropriate specification of the confounder‐risk association. In this paper, we illustrate the order of magnitude of residual confounding that may occur with traditional approaches to control for continuous confounders in multiple logistic regression, such as inclusion of a single linear term or categorization of the confounder, under a variety of assumptions on the confounder‐risk association. We show that inclusion of the confounder as a single linear term often provides satisfactory control for confounding even in situations in which the model assumptions are clearly violated. In contrast, categorization of the confounder may often lead to serious residual confounding if the number of categories is small. Alternative strategies to control for confounding, such as polynomial regression or linear spline regression, are a useful supplement to the more traditional approaches.

[1]  S W Lagakos,et al.  Effects of mismodelling and mismeasuring explanatory variables on tests of their association with a response variable. , 1988, Statistics in medicine.

[2]  S Greenland,et al.  Avoiding power loss associated with categorization and ordinal scores in dose-response and trend analysis. , 1995, Epidemiology.

[3]  J. Manson,et al.  Body weight and mortality. A 27-year follow-up of middle-aged men. , 1993, JAMA.

[4]  K Ulm,et al.  A statistical method for assessing a threshold in epidemiological studies. , 1991, Statistics in medicine.

[5]  M. Graffar [Modern epidemiology]. , 1971, Bruxelles medical.

[6]  S Greenland,et al.  Tests for trend and dose response: misinterpretations and alternatives. , 1992, American journal of epidemiology.

[7]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[8]  D Wartenberg,et al.  Defining exposure in case-control studies: a new approach. , 1991, American journal of epidemiology.

[9]  W. G. Cochran The effectiveness of adjustment by subclassification in removing bias in observational studies. , 1968, Biometrics.

[10]  D.,et al.  Regression Models and Life-Tables , 2022 .

[11]  H. Becher,et al.  The concept of residual confounding in regression models and some applications. , 1992, Statistics in medicine.

[12]  L. P. Zhao,et al.  Efficiency loss from categorizing quantitative exposures into qualitative exposures in case-control studies. , 1992, American journal of epidemiology.

[13]  P. Boyle,et al.  Analysis of quantitative data by quantiles in epidemiologic studies: classification according to cases, noncases, or all subjects? , 1991, Epidemiology.

[14]  K Poikolainen,et al.  Alcohol and mortality: a review. , 1995, Journal of clinical epidemiology.

[15]  M Schumacher,et al.  Outcome-oriented cutpoints in analysis of quantitative exposures. , 1994, American journal of epidemiology.

[16]  S. Greenland Dose‐Response and Trend Analysis in Epidemiology: Alternatives to Categorical Analysis , 1995, Epidemiology.