Healthcare Cost Regressions: Going Beyond the Mean to Estimate the Full Distribution.

Understanding the data generating process behind healthcare costs remains a key empirical issue. Although much research to date has focused on the prediction of the conditional mean cost, this can potentially miss important features of the full distribution such as tail probabilities. We conduct a quasi-Monte Carlo experiment using the English National Health Service inpatient data to compare 14 approaches in modelling the distribution of healthcare costs: nine of which are parametric and have commonly been used to fit healthcare costs, and five others are designed specifically to construct a counterfactual distribution. Our results indicate that no one method is clearly dominant and that there is a trade-off between bias and precision of tail probability forecasts. We find that distributional methods demonstrate significant potential, particularly with larger sample sizes where the variability of predictions is reduced. Parametric distributions such as log-normal, generalised gamma and generalised beta of the second kind are found to estimate tail probabilities with high precision but with varying bias depending upon the cost threshold being considered.

[1]  D. Cox Regression Models and Life-Tables , 1972 .

[2]  C. Morris,et al.  A Comparison of Alternative Models for the Demand for Medical Care , 1983 .

[3]  Willard G. Manning,et al.  Monte Carlo evidence on the choice between sample selection and two-part models , 1987 .

[4]  D. Andrews Chi-Square Diagnostic Tests for Econometric Models: Introduction and Applications , 1988 .

[5]  J. Hausman,et al.  Flexible parametric estimation of duration and competing risk models , 1990 .

[6]  F. Peracchi,et al.  The Conditional Distribution of Excess Returns: An Empirical Analysis , 1994 .

[7]  P. Deb,et al.  Demand for Medical Care by the Elderly: A Finite Mixture Approach , 1997 .

[8]  M C Hornbrook,et al.  Modeling risk using generalized linear models. , 1999, Journal of health economics.

[9]  Randall P. Ellis,et al.  Risk Adjustment in Competitive Health Plan Markets , 2000 .

[10]  Donna B. Gilleskie,et al.  Estimating the Effects of Covariates on Health Expenditures , 2000 .

[11]  W. Manning,et al.  Estimating Log Models: To Transform or Not to Transform? , 1999, Journal of health economics.

[12]  J. Heckman Micro Data, Heterogeneity, and the Evaluation of Public Policy: Nobel Lecture , 2001, Journal of Political Economy.

[13]  A. Briggs,et al.  Something old, something new, something borrowed, something blue: a framework for the marriage of health econometrics and cost-effectiveness analysis. , 2002, Health economics.

[14]  P. Deb,et al.  A Quasi-experimental Comparison of Econometric Models for Health Care Expenditures , 2003 .

[15]  W. Manning,et al.  Improving Risk Adjustment for Medicare Capitated Reimbursement Using Nonlinear Models , 2003, Medical care.

[16]  Scott L. Zeger,et al.  Disease cases and their medical costs attributable to smoking: an analysis of the national medical expenditure survey , 2003 .

[17]  Melinda Beeuwkes Buntin,et al.  Too much ado about two-part models and transformation? Comparing methods of modeling Medicare expenditures. , 2004, Journal of health economics.

[18]  Anirban Basu,et al.  Comparing alternative models: log vs Cox proportional hazard? , 2004, Health economics.

[19]  J. Mata,et al.  Counterfactual decomposition of changes in wage distributions using quantile regression , 2005 .

[20]  Anirban Basu,et al.  Generalized Modeling Approaches to Risk Adjustment of Skewed Outcomes Data , 2003, Journal of health economics.

[21]  W. Walls Modelling heavy tails and skewness in film returns , 2005 .

[22]  B. Melly Decomposition of differences in distribution using quantile regression , 2005 .

[23]  B. Arondekar,et al.  Scale of interest versus scale of estimation: comparing alternative estimators for the incremental costs of a comorbidity. , 2006, Health economics.

[24]  David J. Vanness,et al.  Perspectives on Mean-based Evaluation of Health Care , 2006 .

[25]  Y. Pentsak Addressing skewness and kurtosis in health care econometrics , 2007 .

[26]  N. Fortin,et al.  Unconditional Quantile Regressions , 2007 .

[27]  A. Monfort,et al.  Fourth Order Pseudo Maximum Likelihood Methods , 2008 .

[28]  Christopher Cox,et al.  The generalized F distribution: An umbrella for parametric survival analysis , 2008, Statistics in medicine.

[29]  Willard G. Manning,et al.  Issues for the Next Generation of Health Care Cost Analyses , 2009, Medical care.

[30]  John Mullahy,et al.  Econometric Modeling of Health Care Costs and Expenditures: A Survey of Analytical Issues and Related Policy Considerations , 2009, Medical care.

[31]  V. Chernozhukov,et al.  Inference on Counterfactual Distributions , 2009, 0904.0951.

[32]  W. Manning,et al.  Measuring racial/ethnic disparities across the distribution of health care expenditures. , 2009, Health services research.

[33]  Steven C Hill,et al.  Health expenditure estimation and functional form: applications of the generalized gamma and extended estimating equations models. , 2009, Health economics.

[34]  Andrew M. Jones Models for Health Care , 2010 .

[35]  Peter C Smith,et al.  A person based formula for allocating commissioning funds to general practices in England: development of a statistical model , 2011, BMJ : British Medical Journal.

[36]  S. Jenkins GB2FIT: Stata module to fit Generalized Beta of the Second Kind distribution by maximum likelihood , 2012 .

[37]  C. Meyerhoefer,et al.  The Medical Care Costs of Obesity: an Instrumental Variables Approach , 2010, Journal of health economics.

[38]  M. Koopmanschap,et al.  Health Expenditure Growth: Looking Beyond the Average Through Decomposition of the Full Distribution , 2012, Journal of health economics.

[39]  Nigel Rice,et al.  A quasi-Monte Carlo comparison of developments in parametric and semi-parametric regression methods for heavy tailed and non-normal data: with an application to healthcare costs , 2013 .

[40]  Patrick Turley,et al.  Skewness and Kurtosis Properties of Income Distribution Models , 2013 .

[41]  T. Mora,et al.  The influence of obesity and overweight on medical costs: a panel data perspective , 2013, The European Journal of Health Economics.

[42]  Andrew M. Jones,et al.  APPLYING BETA-TYPE SIZE DISTRIBUTIONS TO HEALTHCARE COST REGRESSIONS , 2014 .

[43]  Katherine E Henson,et al.  Risk of Suicide After Cancer Diagnosis in England , 2018, JAMA psychiatry.

[44]  Marcel Bilger,et al.  Measuring overfitting in nonlinear models: a new method and an application to health expenditures. , 2015, Health economics.

[45]  D.,et al.  Regression Models and Life-Tables , 2022 .