Estimating treatment effects on healthcare costs under exogeneity: is there a ‘magic bullet’?

Methods for estimating average treatment effects (ATEs), under the assumption of no unmeasured confounders, include regression models; propensity score (PS) adjustments using stratification, weighting, or matching; and doubly robust estimators (a combination of both). Researchers continue to debate about the best estimator for outcomes such as health care cost data, as they are usually characterized by an asymmetric distribution and heterogeneous treatment effects,. Challenges in finding the right specifications for regression models are well documented in the literature. Propensity score estimators are proposed as alternatives to overcoming these challenges. Using simulations, we find that in moderate size samples (n = 5,000), balancing on PSs that are estimated from saturated specifications can balance the covariate means across treatment arms but fails to balance higher-order moments and covariances amongst covariates. Therefore, unlike regression model, even if a formal model for outcomes is not required, PS estimators can be inefficient at best and biased at worst for health care cost data. Our simulation study, designed to take a ‘proof by contradiction’ approach, proves that no one estimator can be considered the best under all data generating processes for outcomes such as costs. The inverse-propensity weighted estimator is most likely to be unbiased under alternate data generating processes but is prone to bias under misspecification of the PS model and is inefficient compared to an unbiased regression estimator. Our results show that there are no ‘magic bullets’ when it comes to estimating treatment effects in health care costs. Care should be taken before naively applying any one estimator to estimate ATEs in these data. We illustrate the performance of alternative methods in a cost dataset on breast cancer treatment.

[1]  H. James VARIETIES OF SELECTION BIAS , 1990 .

[2]  J. Martikainen,et al.  Direct costs of warfarin treatment among patients with atrial fibrillation in a Finnish health care setting , 2006, Current medical research and opinion.

[3]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[4]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[5]  R. Rohh ALTERNATIVE METHODS FOR EVALUATING THE IMPACT OF INTERVENTIONS An Overview , 2001 .

[6]  B. Hillner,et al.  A sociodemographic and economic comparison of breast reconstruction, mastectomy, and conservative surgery. , 1999, Surgery.

[7]  Markus Frlich,et al.  Finite-Sample Properties of Propensity-Score Matching and Weighting Estimators , 2004, Review of Economics and Statistics.

[8]  D B Rubin,et al.  Matching using estimated propensity scores: relating theory to practice. , 1996, Biometrics.

[9]  C. Morris,et al.  A Comparison of Alternative Models for the Demand for Medical Care , 1983 .

[10]  Jeffrey A. Smith,et al.  Evaluating the Welfare State , 1998 .

[11]  M. Angermeyer,et al.  A comparison of methods to handle skew distributed cost variables in the analysis of the resource consumption in schizophrenia treatment. , 2002, The journal of mental health policy and economics.

[12]  J. Mandelblatt,et al.  Medicare Fees and Small Area Variations in Breast-Conserving Surgery among Elderly Women , 2001, Medical care research and review : MCRR.

[13]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[14]  M. Fay,et al.  Costs of treatment for elderly women with early-stage breast cancer in fee-for-service settings. , 2002, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[15]  Justin McCrary,et al.  New Evidence on the Finite Sample Properties of Propensity Score Matching and Reweighting Estimators , 2009, SSRN Electronic Journal.

[16]  Jianqing Fan Local Linear Regression Smoothers and Their Minimax Efficiencies , 1993 .

[17]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[18]  W. Manning,et al.  Estimating Log Models: To Transform or Not to Transform? , 1999, Journal of health economics.

[19]  J. I The Design of Experiments , 1936, Nature.

[20]  W. Manning,et al.  The logged dependent variable, heteroscedasticity, and the retransformation problem. , 1998, Journal of health economics.

[21]  R. Oaxaca Male-Female Wage Differentials in Urban Labor Markets , 1973 .

[22]  B. Given,et al.  Observation Interval for Evaluating the Costs of Surgical Interventions for Older Women With a New Diagnosis of Breast Cancer , 2001, Medical care.

[23]  Donald Rubin,et al.  Estimating Causal Effects from Large Data Sets Using Propensity Scores , 1997, Annals of Internal Medicine.

[24]  J. Lunceford,et al.  Strati cation and weighting via the propensity score in estimation of causal treatment e ects : a comparative study , 2004 .

[25]  M. Crowder On linear and quadratic estimating functions , 1987 .

[26]  Jeffrey M. Wooldridge,et al.  Some Alternatives to the Box-Cox Regression Model , 1992 .

[27]  Daniel M. Bolt,et al.  A Comparison of Alternative Models for Testlets , 2006 .

[28]  R J Little,et al.  Causal effects in clinical and epidemiological studies via potential outcomes: concepts and analytical approaches. , 2000, Annual review of public health.

[29]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[30]  M. Ravallion,et al.  Estimating the Benefit Incidence of an Antipoverty Program by Propensity-Score Matching , 2003 .

[31]  B. Arondekar,et al.  Scale of interest versus scale of estimation: comparing alternative estimators for the incremental costs of a comorbidity. , 2006, Health economics.

[32]  Paul R Rosenbaum,et al.  Optimal Pair Matching With Two Control Groups , 2004 .

[33]  Yi-Ting Hwang,et al.  An exploratory instrumental variable analysis of the outcomes of localized breast cancer treatments in a medicare population. , 2003, Health economics.

[34]  N. Duan Smearing Estimate: A Nonparametric Retransformation Method , 1983 .

[35]  Jianqing Fan Design-adaptive Nonparametric Regression , 1992 .

[36]  Michail Prodan,et al.  CHAPTER 17 – THE PLANNING OF EXPERIMENTS , 1968 .

[37]  D. Rubin Matched Sampling for Causal Effects: The Use of Matched Sampling and Regression Adjustment to Remove Bias in Observational Studies , 1973 .

[38]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[39]  Irène Gijbels,et al.  Local likelihood and local partial likelihood in hazard regression , 1997 .

[40]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[41]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[42]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[43]  Donald B. Rubin,et al.  Estimating the Causal Effects of Marketing Interventions Using Propensity Score Methodology , 2006 .

[44]  Donald B. Rubin,et al.  Characterizing the effect of matching using linear propensity score methods with normal distributions , 1992 .

[45]  J. Olsen,et al.  Lumpectomy or mastectomy? Is breast conserving surgery too expensive? , 1997, Breast Cancer Research and Treatment.

[46]  Joseph Kang,et al.  Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2958.

[47]  M. Bullano,et al.  Longitudinal Evaluation of Health Plan Cost per Venous Thromboembolism or Bleed Event in Patients With a Prior Venous Thromboembolism Event During Hospitalization , 2005, Journal of managed care pharmacy : JMCP.

[48]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[49]  A. Basu,et al.  Estimating marginal and incremental effects on health outcomes using flexible link and variance function models. , 2005, Biostatistics.

[50]  Jinyong Hahn,et al.  When to Control for Covariates? Panel Asymptotics for Estimates of Treatment Effects , 2004, Review of Economics and Statistics.

[51]  Rajeev Dehejia,et al.  Program Evaluation as a Decision Problem , 1999 .

[52]  R. Quandt A New Approach to Estimating Switching Regressions , 1972 .

[53]  E. Keeler,et al.  Health insurance and the demand for medical care: evidence from a randomized experiment. , 1987, The American economic review.

[54]  Per Kragh Andersen,et al.  3. Applied Logistic Regression. 2nd edn. David W. Hosmer and Stanley Lemeshow. Wiley, New York, 2000. No. of pages: xii+373. Price: £60.95. ISBN 0‐471‐35632‐8 , 2002 .

[55]  S. Edge,et al.  Treatment of early-stage breast cancer. , 1999, Current problems in cancer.

[56]  G. Imbens,et al.  Large Sample Properties of Matching Estimators for Average Treatment Effects , 2004 .

[57]  D. Pregibon Goodness of Link Tests for Generalized Linear Models , 1980 .

[58]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[59]  P. Rosenbaum Covariance Adjustment in Randomized Experiments and Observational Studies , 2002 .

[60]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[61]  Richard E. Quandt,et al.  The Econometrics Of Disequilibrium , 1988 .

[62]  D. Buist,et al.  Cost comparison of mastectomy versus breast-conserving therapy for early-stage breast cancer. , 2001, Journal of the National Cancer Institute.

[63]  Daniel L. Millimet,et al.  On the Specification of Propensity Scores: With Applications to the Analysis of Trade Policies , 2006 .

[64]  A. Roy Some thoughts on the distribution of earnings , 1951 .

[65]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[66]  M C Hornbrook,et al.  Modeling risk using generalized linear models. , 1999, Journal of health economics.

[67]  Charles F. Manski,et al.  Evaluating Welfare and Training Programs. , 1994 .

[68]  J. Heckman,et al.  Longitudinal Analysis of Labor Market Data: Alternative methods for evaluating the impact of interventions , 1985 .

[69]  Jane-Ling Wang,et al.  A comparison of hazard rate estimators for left truncated and right censored data , 1992 .

[70]  W. Ershler,et al.  Economic burden of patients with anemia in selected diseases. , 2005, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[71]  Roderick J. A. Little,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models: Comment , 1999 .

[72]  Zhong Zhao Sensitivity of Propensity Score Methods to the Specifications , 2005, SSRN Electronic Journal.

[73]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[74]  J. Mullahy Much Ado About Two: Reconsidering Retransformation and the Two-Part Model in Health Economics , 1998, Journal of health economics.

[75]  K. Schulman,et al.  Economic evaluation of breast cancer treatment: considering the value of patient choice. , 2003, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[76]  Zhong Zhao,et al.  Using Matching to Estimate Treatment Effects: Data Requirements, Matching Metrics, and Monte Carlo Evidence , 2004, Review of Economics and Statistics.

[77]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[78]  T. Hastie,et al.  Local Regression: Automatic Kernel Carpentry , 1993 .

[79]  C. Glymour,et al.  STATISTICS AND CAUSAL INFERENCE , 1985 .

[80]  Y. Bao Predicting the Use of Outpatient Mental Health Services: Do Modeling Approaches Make a Difference? , 2002, Inquiry : a journal of medical care organization, provision and financing.

[81]  Peter C Austin,et al.  A critical appraisal of propensity‐score matching in the medical literature between 1996 and 2003 , 2008, Statistics in medicine.

[82]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[83]  James J. Heckman,et al.  Randomization and Social Policy Evaluation , 1991 .

[84]  Theo Gasser,et al.  Finite-Sample Variance of Local Polynomials: Analysis and Solutions , 1996 .