A robust approach for skewed and heavy-tailed outcomes in the analysis of health care expenditures.

In this paper robust statistical procedures are presented for the analysis of skewed and heavy-tailed outcomes as they typically occur in health care data. The new estimators and test statistics are extensions of classical maximum likelihood techniques for generalized linear models. In contrast to their classical counterparts, the new robust techniques show lower variability and excellent efficiency properties in the presence of small deviations from the assumed model, i.e. when the underlying distribution of the data lies in a neighborhood of the model. A simulation study, an analysis on real data, and a sensitivity analysis confirm the good theoretical statistical properties of the new techniques.

[1]  E. Ronchetti,et al.  Robust Estimators for Simultaneous Equations Models , 1997 .

[2]  F. Hampel The Influence Curve and Its Role in Robust Estimation , 1974 .

[3]  Han Bleichrodt,et al.  Handbook of Health Economics , 2001 .

[4]  E. Ronchetti,et al.  Robust Inference for Generalized Linear Models , 2001 .

[5]  C. Morris,et al.  A Comparison of Alternative Models for the Demand for Medical Care , 1983 .

[6]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[7]  Donna B. Gilleskie,et al.  Estimating the Effects of Covariates on Health Expenditures , 2000 .

[8]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[9]  John Mullahy,et al.  HETEROGENEITY, EXCESS ZEROS, AND THE STRUCTURE OF COUNT DATA MODELS , 1997 .

[10]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[11]  Dana P. Goldman,et al.  Cost-Containment and Adverse Selection in Medicaid HMOs , 1998 .

[12]  N. Duan Smearing Estimate: A Nonparametric Retransformation Method , 1983 .

[13]  Elvezio Ronchetti,et al.  Variable Selection for Marginal Longitudinal Generalized Linear Models , 2003, Biometrics.

[14]  Alfio Marazzi,et al.  Adaptively truncated maximum likelihood regression with asymmetric errors , 2004 .

[15]  Donna B. Gilleskie,et al.  A flexible approach for estimating the effects of covariates on health expenditures. , 2000, Journal of health economics.

[16]  E. Keeler,et al.  Health insurance and the demand for medical care: evidence from a randomized experiment. , 1987, The American economic review.

[17]  E. Ronchetti,et al.  Bias‐calibrated estimation from sample surveys containing outliers , 1998 .

[18]  W. Manning,et al.  Estimating Log Models: To Transform or Not to Transform? , 1999, Journal of health economics.

[19]  J. Mullahy Much Ado About Two: Reconsidering Retransformation and the Two-Part Model in Health Economics , 1998, Journal of health economics.

[20]  J. Mullahy Specification and testing of some modified count data models , 1986 .

[21]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[22]  W. Manning,et al.  The logged dependent variable, heteroscedasticity, and the retransformation problem. , 1998, Journal of health economics.

[23]  M C Hornbrook,et al.  Modeling risk using generalized linear models. , 1999, Journal of health economics.

[24]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[25]  D. Ruppert Robust Statistics: The Approach Based on Influence Functions , 1987 .