Regression Estimators for Generic Health-Related Quality of Life and Quality-Adjusted Life Years

Purpose. To develop regression models for outcomes with truncated supports, such as health-related quality of life (HRQoL) data, and account for features typical of such data such as a skewed distribution, spikes at 1 or 0, and heteroskedasticity. Methods. Regression estimators based on features of the Beta distribution. First, both a single equation and a 2-part model are presented, along with estimation algorithms based on maximum-likelihood, quasi-likelihood, and Bayesian Markov-chain Monte Carlo methods. A novel Bayesian quasi-likelihood estimator is proposed. Second, a simulation exercise is presented to assess the performance of the proposed estimators against ordinary least squares (OLS) regression for a variety of HRQoL distributions that are encountered in practice. Finally, the performance of the proposed estimators is assessed by using them to quantify the treatment effect on QALYs in the EVALUATE hysterectomy trial. Overall model fit is studied using several goodness-of-fit tests such as Pearson’s correlation test, link and reset tests, and a modified Hosmer-Lemeshow test. Results. The simulation results indicate that the proposed methods are more robust in estimating covariate effects than OLS, especially when the effects are large or the HRQoL distribution has a large spike at 1. Quasi-likelihood techniques are more robust than maximum likelihood estimators. When applied to the EVALUATE trial, all but the maximum likelihood estimators produce unbiased estimates of the treatment effect. Conclusion. One and 2-part Beta regression models provide flexible approaches to regress the outcomes with truncated supports, such as HRQoL, on covariates, after accounting for many idiosyncratic features of the outcomes distribution. This work will provide applied researchers with a practical set of tools to model outcomes in cost-effectiveness analysis.

[1]  W. Manning,et al.  The logged dependent variable, heteroscedasticity, and the retransformation problem. , 1998, Journal of health economics.

[2]  P. Grootendorst Censoring in statistical models of health status: What happens when one can do better than ‘1’ , 2004, Quality of Life Research.

[3]  A. Basu,et al.  Estimating marginal and incremental effects on health outcomes using flexible link and variance function models. , 2005, Biostatistics.

[4]  J. B. Ramsey,et al.  Tests for Specification Errors in Classical Linear Least‐Squares Regression Analysis , 1969 .

[5]  P. J. Huber The 1972 Wald Lecture Robust Statistics: A Review , 1972 .

[6]  R. Brooks EuroQol: the current state of play. , 1996, Health policy.

[7]  J. Brazier,et al.  The estimation of a preference-based measure of health from the SF-36. , 2002, Journal of health economics.

[8]  R. Oaxaca Male-Female Wage Differentials in Urban Labor Markets , 1973 .

[9]  Andrea Manca,et al.  Cost effectiveness analysis of laparoscopic hysterectomy compared with standard hysterectomy: results from a randomised trial , 2004, BMJ : British Medical Journal.

[10]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[11]  Alastair Gray,et al.  Estimating Utility Values for Health States of Type 2 Diabetic Patients Using the EQ-5D (UKPDS 62) , 2002, Medical decision making : an international journal of the Society for Medical Decision Making.

[12]  W. Manning Dealing with Skewed Data on Costs and Expenditures , 2006 .

[13]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[14]  Peter C Austin,et al.  Bayesian Extensions of the Tobit Model for Analyzing Measures of Health Status , 2002, Medical decision making : an international journal of the Society for Medical Decision Making.

[15]  John M. Thompson,et al.  Bayesian Analysis in Stata with WinBUGS , 2006 .

[16]  Leslie E. Papke,et al.  Econometric Methods for Fractional Response Variables with an Application to 401(K) Plan Participation Rates , 1993 .

[17]  Mark Whittaker,et al.  The eVALuate study: two parallel randomised trials, one comparing laparoscopic with abdominal hysterectomy, the other comparing laparoscopic with vaginal hysterectomy , 2004, BMJ : British Medical Journal.

[18]  W. Manning,et al.  Estimating Log Models: To Transform or Not to Transform? , 1999, Journal of health economics.

[19]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[20]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[21]  Thomas A. Severini,et al.  Extended Generalized Estimating Equations for Clustered Data , 1998 .

[22]  J. Mullahy Much Ado About Two: Reconsidering Retransformation and the Two-Part Model in Health Economics , 1998, Journal of health economics.

[23]  C. Gouriéroux,et al.  PSEUDO MAXIMUM LIKELIHOOD METHODS: THEORY , 1984 .

[24]  R. Goeree,et al.  Analysis of health utility data when some subjects attain the upper bound of 1: are Tobit and CLAD models appropriate? , 2010, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[25]  Peter C Austin,et al.  A comparison of methods for analyzing health-related quality-of-life measures. , 2002, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[26]  B. McCullough,et al.  Regression analysis of variates observed on (0, 1): percentages, proportions and fractions , 2003 .

[27]  Peter Green,et al.  Markov chain Monte Carlo in Practice , 1996 .

[28]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[29]  Albert W Wu,et al.  Addressing ceiling effects in health status measures: a comparison of techniques applied to measures for people with HIV disease. , 2007, Health services research.

[30]  Richard J. Cook,et al.  Generalized Linear Model , 2014 .

[31]  D. Pregibon Goodness of Link Tests for Generalized Linear Models , 1980 .

[32]  Andrew M. Jones The Elgar Companion to Health Economics , 2007 .

[33]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[34]  Anirban Basu,et al.  Generalized Modeling Approaches to Risk Adjustment of Skewed Outcomes Data , 2003, Journal of health economics.

[35]  Michael Smithson,et al.  A better lemon squeezer? Maximum-likelihood regression with beta-distributed dependent variables. , 2006, Psychological methods.