Causal inference with measurement error in outcomes: Bias analysis and estimation methods

Inverse probability weighting estimation has been popularly used to consistently estimate the average treatment effect. Its validity, however, is challenged by the presence of error-prone variables. In this paper, we explore the inverse probability weighting estimation with mismeasured outcome variables. We study the impact of measurement error for both continuous and discrete outcome variables and reveal interesting consequences of the naive analysis which ignores measurement error. When a continuous outcome variable is mismeasured under an additive measurement error model, the naive analysis may still yield a consistent estimator; when the outcome is binary, we derive the asymptotic bias in a closed-form. Furthermore, we develop consistent estimation procedures for practical scenarios where either validation data or replicates are available. With validation data, we propose an efficient method for estimation of average treatment effect; the efficiency gain is substantial relative to usual methods of using validation data. To provide protection against model misspecification, we further propose a doubly robust estimator which is consistent even when either the treatment model or the outcome model is misspecified. Simulation studies are reported to assess the performance of the proposed methods. An application to a smoking cessation dataset is presented.

[1]  Jared K Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. , 2017, Statistics in medicine.

[2]  J. Haukoos,et al.  The Propensity Score. , 2015, JAMA.

[3]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[4]  Els Goetghebeur,et al.  Comparison of causal effect estimators under exposure misclassification , 2010 .

[5]  C Frost,et al.  Correcting for measurement error in binary and continuous variables using replicates , 2001, Statistics in medicine.

[6]  Li‐Pang Chen Statistical analysis with measurement error or misclassification: Strategy, method and application. Grace Y. Yi. New York: Springer‐Verlag. , 2019, Biometrics.

[7]  W. Velicer,et al.  Biochemical verification of tobacco use and cessation. , 2002, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[8]  Wenqing He,et al.  Methods for Bivariate Survival Data with Mismeasured Covariates Under an Accelerated Failure Time Model , 2006 .

[9]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[10]  Raymond J Carroll,et al.  Functional and Structural Methods With Mixed Measurement Error and Misclassification in Covariates , 2015, Journal of the American Statistical Association.

[11]  J. R. Lockwood,et al.  Inverse probability weighting with error-prone covariates. , 2013, Biometrika.

[12]  Roger Logan,et al.  Estimation and Inference for Logistic Regression with Covariate Misclassification and Measurement Error in Main Study/Validation Study Designs , 2000 .

[13]  Peter C Austin,et al.  The performance of different propensity score methods for estimating marginal odds ratios, Statistics in Medicine 2007; 26:3078–3094 , 2008 .

[14]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[15]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[16]  Danielle Braun,et al.  Using Validation Data to Adjust the Inverse Probability Weighting Estimator for Misclassified Treatment , 2016 .

[17]  Andrew W. Roddam,et al.  Measurement Error in Nonlinear Models: a Modern Perspective , 2008 .

[18]  W. Newey,et al.  Large sample estimation and hypothesis testing , 1986 .

[19]  P. Morley-Forster,et al.  The Effectiveness of a Perioperative Smoking Cessation Program: A Randomized Clinical Trial , 2013, Anesthesia and analgesia.

[20]  Grace Y. Yi,et al.  A NOTE ON MIS-SPECIFIED ESTIMATING FUNCTIONS , 2010 .

[21]  J. Neuhaus Bias and efficiency loss due to misclassified responses in binary regression , 1999 .

[22]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[23]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[24]  Grace Y. Yi,et al.  Statistical Analysis with Measurement Error or Misclassification , 2017 .

[25]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[26]  C. Heyde Quasi-likelihood and its application : a general approach to optimal parameter estimation , 1998 .

[27]  L. Magder,et al.  Logistic regression when the outcome is measured with uncertainty. , 1997, American journal of epidemiology.

[28]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[29]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .