A flexible two-part random effects model for correlated medical costs.

In this paper, we propose a flexible "two-part" random effects model (Olsen and Schafer, 2001; Tooze et al., 2002) for correlated medical cost data. Typically, medical cost data are right-skewed, involve a substantial proportion of zero values, and may exhibit heteroscedasticity. In many cases, such data are also obtained in hierarchical form, e.g., on patients served by the same physician. The proposed model specification therefore consists of two generalized linear mixed models (GLMM), linked together by correlated random effects. Respectively, and conditionally on the random effects and covariates, we model the odds of cost being positive (Part I) using a GLMM with a logistic link and the mean cost (Part II) given that costs were actually incurred using a generalized gamma regression model with random effects and a scale parameter that is allowed to depend on covariates (cf., Manning et al., 2005). The class of generalized gamma distributions is very flexible and includes the lognormal, gamma, inverse gamma and Weibull distributions as special cases. We demonstrate how to carry out estimation using the Gaussian quadrature techniques conveniently implemented in SAS Proc NLMIXED. The proposed model is used to analyze pharmacy cost data on 56,245 adult patients clustered within 239 physicians in a mid-western U.S. managed care organization.

[1]  J. Aitchison On the Distribution of a Positive Random Variable Having a Discrete Probability Mass at the Origin , 1955 .

[2]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[3]  S. Raudenbush,et al.  Maximum Likelihood for Generalized Linear Models with Nested Random Effects via High-Order, Multivariate Laplace Approximation , 2000 .

[4]  D. Bates,et al.  Approximations to the Log-Likelihood Function in the Nonlinear Mixed-Effects Model , 1995 .

[5]  Christian P. Robert,et al.  The Bayesian choice : from decision-theoretic foundations to computational implementation , 2007 .

[6]  Shihti Yu,et al.  On the choice between sample selection and two-part models , 1996 .

[7]  William M. Tierney,et al.  Regression analysis of health care charges with heteroscedasticity , 2001 .

[8]  Christian P. Robert,et al.  The Bayesian choice , 1994 .

[9]  Huazhen Lin,et al.  Non‐parametric heteroscedastic transformation regression models for skewed data with an application to health care costs , 2008 .

[10]  C. Morris,et al.  A Comparison of Alternative Models for the Demand for Medical Care , 1983 .

[11]  Nicola J Cooper,et al.  Predicting costs over time using Bayesian Markov chain Monte Carlo methods: an application to early inflammatory polyarthritis. , 2007, Health economics.

[12]  J. Heckman Sample selection bias as a specification error , 1979 .

[13]  Gary K Grunwald,et al.  Analysis of repeated measures data with clumping at zero , 2002, Statistical methods in medical research.

[14]  M. Cowen,et al.  Casemix adjustment of managed care claims data using the clinical classification for health policy research method. , 1998, Medical care.

[15]  D. Hedeker,et al.  A random-effects ordinal regression model for multilevel analysis. , 1994, Biometrics.

[16]  Lei Liu,et al.  A multi‐level two‐part random effects model, with application to an alcohol‐dependence study , 2008, Statistics in medicine.

[17]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[18]  J. Burridge,et al.  A note on nonregular likelihood functions in heteroscedastic regression models , 1994 .

[19]  William A. Knaus,et al.  A random effects four-part model, with application to correlated medical costs , 2008, Comput. Stat. Data Anal..

[20]  J. Mullahy,et al.  Tobit at Fifty: A Brief History of Tobin&Apos;S Remarkable Estimator, of Related Empirical Methods, and of Limited Dependent Variable Econometrics in Health Economics , 2008, Health economics.

[21]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[22]  Qing Liu,et al.  A note on Gauss—Hermite quadrature , 1994 .

[23]  G. Molenberghs,et al.  Models for Discrete Longitudinal Data , 2005 .

[24]  Anirban Basu,et al.  Generalized Modeling Approaches to Risk Adjustment of Skewed Outcomes Data , 2003, Journal of health economics.

[25]  Gene H. Golub,et al.  Calculation of Gauss quadrature rules , 1967, Milestones in Matrix Computation.

[26]  J. Mullahy Much Ado About Two: Reconsidering Retransformation and the Two-Part Model in Health Economics , 1998, Journal of health economics.

[27]  L. Tierney,et al.  Accurate Approximations for Posterior Moments and Marginal Densities , 1986 .

[28]  Robert L. Strawderman,et al.  Bayesian Inference for a Two-Part Hierarchical Model , 2006 .

[29]  Joseph L Schafer,et al.  A Two-Part Random-Effects Model for Semicontinuous Longitudinal Data , 2001 .

[30]  Willard G. Manning,et al.  Monte Carlo evidence on the choice between sample selection and two-part models , 1987 .

[31]  Russell D. Wolfinger,et al.  Laplace's approximation for nonlinear mixed models. , 1993 .

[32]  W. Manning,et al.  The logged dependent variable, heteroscedasticity, and the retransformation problem. , 1998, Journal of health economics.

[33]  R. Strawderman,et al.  Quantifying the Physician Contribution to Managed Care Pharmacy Expenses: A Random Effects Approach , 2002, Medical care.

[34]  Willard G. Manning,et al.  Choosing Between the Sample-Selection Model and the Multi-Part Model , 1984 .

[35]  Harvey Goldstein,et al.  Improved Approximations for Multilevel Models with Binary Responses , 1996 .

[36]  N. Duan Smearing Estimate: A Nonparametric Retransformation Method , 1983 .

[37]  Murray Aitkin,et al.  Variance Component Models with Binary Response: Interviewer Variability , 1985 .

[38]  M. Perlman,et al.  Health, Economics, and Health Economics , 1983 .

[39]  C. McCulloch Maximum Likelihood Algorithms for Generalized Linear Mixed Models , 1997 .

[40]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[41]  Jeffrey M. Wooldridge,et al.  Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data , 2003 .

[42]  J. G. Cragg Some Statistical Models for Limited Dependent Variables with Application to the Demand for Durable Goods , 1971 .

[43]  Shou-En Lu,et al.  Analyzing Excessive No Changes in Clinical Trials with Clustered Data , 2004, Biometrics.

[44]  P. Albert Comment on Lu, et. al. 2004: Analyzing excessive no changes in clinical trials with clustered data. , 2005, Biometrics.

[45]  D Y Lin,et al.  Methods for analyzing health care utilization and costs. , 1999, Annual review of public health.

[46]  A. Hald Maximum Likelihood Estimation of the Parameters of a Normal Distribution which is Truncated at a Known Point , 1949 .