Propensity scores with misclassified treatment assignment: a likelihood‐based adjustment

Summary Propensity score methods are widely used in comparative effectiveness research using claims data. In this context, the inaccuracy of procedural or billing codes in claims data frequently misclassifies patients into treatment groups, that is, the treatment assignment (T) is often measured with error. In the context of a validation data where treatment assignment is accurate, we show that misclassification of treatment assignment can impact three distinct stages of a propensity score analysis: (i) propensity score estimation; (ii) propensity score implementation; and (iii) outcome analysis conducted conditional on the estimated propensity score and its implementation. We examine how the error in T impacts each stage in the context of three common propensity score implementations: subclassification, matching, and inverse probability of treatment weighting (IPTW). Using validation data, we propose a two‐step likelihood‐based approach which fully adjusts for treatment misclassification bias under subclassification. This approach relies on two common measurement error‐assumptions; non‐differential measurement error and transportability of the measurement error model. We use simulation studies to assess the performance of the adjustment under subclassification, and also investigate the method's performance under matching or IPTW. We apply the methods to Medicare Part A hospital claims data to estimate the effect of resection versus biopsy on 1‐year mortality among 10284 Medicare beneficiaries diagnosed with brain tumors. The ICD9 billing codes from Medicare Part A inaccurately reflect surgical treatment, but SEER‐Medicare validation data are available with more accurate information.

[1]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[2]  James Lo,et al.  Scaling Roll Call Votes with wnominate in R , 2008 .

[3]  KyungMann Kim,et al.  Contrasting treatment‐specific survival using double‐robust estimators , 2012 .

[4]  William R. Shadish,et al.  On the Importance of Reliable Covariate Measurement in Selection Bias Adjustments Using Propensity Scores , 2011 .

[5]  J. Neuhaus,et al.  Binomial Regression with Misclassification , 2003, Biometrics.

[6]  E. Stuart,et al.  Using full matching to estimate causal effects in nonexperimental studies: examining the relationship between adolescent marijuana use and adult outcomes. , 2008, Developmental psychology.

[7]  K C Stange,et al.  Agreement of Medicare claims and tumor registry data for assessment of cancer-related treatment. , 2000, Medical care.

[8]  Til Stürmer,et al.  Adjusting effect estimates for unmeasured confounding with validation data using propensity score calibration. , 2005, American journal of epidemiology.

[9]  John P. Buonaccorsi,et al.  Measurement Error: Models, Methods, and Applications , 2010 .

[10]  Elizabeth A Stuart,et al.  Propensity score techniques and the assessment of measured covariate balance to test causal associations in psychological research. , 2010, Psychological methods.

[11]  Margaret S. Pepe,et al.  Inference using surrogate outcome data and a validation sample , 1992 .

[12]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[13]  Neetu Chawla,et al.  Limited validity of diagnosis codes in Medicare claims for identifying cancer metastases and inferring stage. , 2014, Annals of epidemiology.

[14]  E P Steinberg,et al.  Accuracy of Medicare Claims Data for Estimation of Cancer Incidence and Resection Rates Among Elderly Americans , 1991, Medical care.

[15]  Corwin M Zigler,et al.  A Cautionary Note on the Effect of Treatment Misclassification on the Average Treatment Effect , 2016 .

[16]  John P. Buonaccorsi,et al.  Measurement error in the response in the general linear model , 1996 .

[17]  Kara E Rudolph,et al.  An imputation-based solution to using mismeasured covariates in propensity score analysis , 2017, Statistical methods in medical research.

[18]  Els Goetghebeur,et al.  Comparison of causal effect estimators under exposure misclassification , 2010 .

[19]  Lisa I. Iezzoni,et al.  Risk Adjustment of Medicare Capitation Payments Using the CMS-HCC Model , 2004, Health care financing review.

[20]  J. R. Lockwood,et al.  Inverse probability weighting with error-prone covariates. , 2013, Biometrika.

[21]  D. Thomas,et al.  Exposure measurement error: influence on exposure-disease. Relationships and methods of correction. , 1993, Annual review of public health.

[22]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[23]  Gary King,et al.  MatchIt: Nonparametric Preprocessing for Parametric Causal Inference , 2011 .

[24]  J. R. Cook,et al.  Simulation-Extrapolation Estimation in Parametric Measurement Error Models , 1994 .

[25]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[26]  J. Goodwin,et al.  Information on radiation treatment in patients with breast cancer: the advantages of the linked medicare and SEER data. Surveillance, Epidemiology and End Results. , 1999, Journal of clinical epidemiology.