Estimating exposure effects by modelling the expectation of exposure conditional on confounders.

In order to estimate the causal effects of one or more exposures or treatments on an outcome of interest, one has to account for the effect of "confounding factors" which both covary with the exposures or treatments and are independent predictors of the outcome. In this paper we present regression methods which, in contrast to standard methods, adjust for the confounding effect of multiple continuous or discrete covariates by modelling the conditional expectation of the exposures or treatments given the confounders. In the special case of a univariate dichotomous exposure or treatment, this conditional expectation is identical to what Rosenbaum and Rubin have called the propensity score. They have also proposed methods to estimate causal effects by modelling the propensity score. Our methods generalize those of Rosenbaum and Rubin in several ways. First, our approach straightforwardly allows for multivariate exposures or treatments, each of which may be continuous, ordinal, or discrete. Second, even in the case of a single dichotomous exposure, our approach does not require subclassification or matching on the propensity score so that the potential for "residual confounding," i.e., bias, due to incomplete matching is avoided. Third, our approach allows a rather general formalization of the idea that it is better to use the "estimated propensity score" than the true propensity score even when the true score is known. The additional power of our approach derives from the fact that we assume the causal effects of the exposures or treatments can be described by the parametric component of a semiparametric regression model. To illustrate our methods, we reanalyze the effect of current cigarette smoking on the level of forced expiratory volume in one second in a cohort of 2,713 adult white males. We compare the results with those obtained using standard methods.

[1]  P. Robinson ROOT-N-CONSISTENT SEMIPARAMETRIC REGRESSION , 1988 .

[2]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .

[3]  Charles F. Manski,et al.  Analog estimation methods in econometrics , 1988 .

[4]  J. Robins Correcting for non-compliance in randomized trials using structural nested mean models , 1994 .

[5]  J. Robins,et al.  The foundations of confounding in epidemiology , 1987 .

[6]  W. Newey,et al.  Semiparametric Efficiency Bounds , 1990 .

[7]  D. Pregibon,et al.  Graphical Methods for Assessing Logistic Regression Models , 1984 .

[8]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[9]  J. Robins The control of confounding by intermediate variables. , 1989, Statistics in medicine.

[10]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[11]  W. J. Hall,et al.  Information and Asymptotic Efficiency in Parametric-Nonparametric Models , 1983 .

[12]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[13]  M. Gail,et al.  Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates , 1984 .

[14]  D. Pierce The Asymptotic Effect of Substituting Estimators for Parameters in Certain Types of Statistics , 1982 .

[15]  T. Louis,et al.  Cumulative and reversible effects of lifetime smoking on simple tests of lung function in adults. , 1988, The American review of respiratory disease.

[16]  G. Chamberlain Asymptotic efficiency in estimation with conditional moment restrictions , 1987 .

[17]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[18]  P. Rosenbaum Permutation Tests for Matched Pairs with Adjustments for Covariates , 1988 .

[19]  J. Robins,et al.  G-Estimation of the Effect of Prophylaxis Therapy for Pneumocystis carinii Pneumonia on the Survival of AIDS Patients , 1992, Epidemiology.

[20]  B. Efron,et al.  Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher information , 1978 .

[21]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[22]  M. Kendall Theoretical Statistics , 1956, Nature.

[23]  J. Robins Estimation of the time-dependent accelerated failure time model in the presence of confounding factors , 1992 .

[24]  P. W. Bowman,et al.  PHS Public Health Service , 1963 .

[25]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[26]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[27]  P. Rosenbaum Conditional Permutation Tests and the Propensity Score in Observational Studies , 1984 .