Simulation study of confounder-selection strategies.

In the absence of prior knowledge about population relations, investigators frequently employ a strategy that uses the data to help them decide whether to adjust for a variable. The authors compared the performance of several such strategies for fitting multiplicative Poisson regression models to cohort data: 1) the "change-in-estimate" strategy, in which a variable is controlled if the adjusted and unadjusted estimates differ by some important amount; 2) the "significance-test-of-the-covariate" strategy, in which a variable is controlled if its coefficient is significantly different from zero at some predetermined significance level; 3) the "significance-test-of-the-difference" strategy, which tests the difference between the adjusted and unadjusted exposure coefficients; 4) the "equivalence-test-of-the-difference" strategy, which significance-tests the equivalence of the adjusted and unadjusted exposure coefficients; and 5) a hybrid strategy that takes a weighted average of adjusted and unadjusted estimates. Data were generated from 8,100 population structures at each of several sample sizes. The performance of the different strategies was evaluated by computing bias, mean squared error, and coverage rates of confidence intervals. At least one variation of each strategy that was examined performed acceptably. The change-in-estimate and equivalence-test-of-the-difference strategies performed best when the cut-point for deciding whether crude and adjusted estimates differed by an important amount was set to a low value (10%). The significance test strategies performed best when the alpha level was set to much higher than conventional levels (0.20).

[1]  S. Greenland,et al.  A Comparison of the Performance of Model‐Based Confidence Intervals When the Correct Model Form Is Unknown: Coverage of Asymptotic Means , 1994, Epidemiology.

[2]  S Greenland,et al.  Methods for epidemiologic analyses of multiple exposures: a review and comparative study of maximum-likelihood, preliminary-testing, and empirical-Bayes regression. , 1993, Statistics in medicine.

[3]  S Greenland Reducing mean squared error in the analysis of stratified epidemiologic studies. , 1991, Biometrics.

[4]  R. Regal,et al.  The effects of model selection on confidence intervals for the size of a closed population. , 1991, Statistics in medicine.

[5]  Clifford M. Hurvich,et al.  The impact of model selection on inference in linear regression , 1990 .

[6]  L A Kalish Reducing mean squared error in the analysis of pair-matched case-control studies. , 1990, Biometrics.

[7]  Sander Greenland,et al.  Comment: Cautions in the use of preliminary‐test estimators , 1989 .

[8]  David Kriebel,et al.  Research Methods in Occupational Epidemiology , 1989 .

[9]  S Greenland,et al.  Modeling and variable selection in epidemiologic analysis. , 1989, American journal of public health.

[10]  S Greenland,et al.  The impact of confounder selection criteria on effect estimation. , 1989, American journal of epidemiology.

[11]  J. E. Glynn,et al.  Numerical Recipes: The Art of Scientific Computing , 1989 .

[12]  Clive Osmond,et al.  Modern Statistical Methods in Chronic Disease Epidemiology. , 1988 .

[13]  Norman E. Breslow,et al.  The design and analysis of cohort studies , 1987 .

[14]  Ronald A. Thisted,et al.  Elements of statistical computing , 1986 .

[15]  J. Fleiss,et al.  Significance tests have a role in epidemiologic research: reactions to A. M. Walker. , 1986, American journal of public health.

[16]  W W Hauck,et al.  A proposal for interpreting and reporting negative studies. , 1986, Statistics in medicine.

[17]  L. Devroye Non-Uniform Random Variate Generation , 1986 .

[18]  J M Robins,et al.  The role of model selection in causal inference from nonexperimental data. , 1986, American journal of epidemiology.

[19]  On pooling across strata when frequency matching has been followed in a cohort study. , 1985, Biometrics.

[20]  J M Robins,et al.  Estimation of a common effect parameter from sparse follow-up data. , 1985, Biometrics.

[21]  H. Morgenstern,et al.  Epidemiologic Research: Principles and Quantitative Methods. , 1983 .

[22]  J. Schlesselman,et al.  Case-Control Studies: Design, Conduct, Analysis , 1982 .

[23]  I. D. Hill,et al.  Correction: Algorithm AS 183: An Efficient and Portable Pseudo-Random Number Generator , 1982 .

[24]  N. Breslow,et al.  Statistical methods in cancer research. Vol. 1. The analysis of case-control studies. , 1981 .

[25]  R Neutra,et al.  Control of confounding in the assessment of medical technology. , 1980, International journal of epidemiology.

[26]  Stephen E. Fienberg,et al.  The analysis of cross-classified categorical data , 1980 .

[27]  L. Dales,et al.  An improper use of statistical significance testing in studying covariables. , 1978, International journal of epidemiology.

[28]  A. Whittemore Collapsibility of Multidimensional Contingency Tables , 1978 .

[29]  O S Miettinen,et al.  Stratification by a multivariate confounder score. , 1976, American journal of epidemiology.

[30]  M. Graffar [Modern epidemiology]. , 1971, Bruxelles medical.