Multiple Imputation Methods for Handling Missing Data in Cost-effectiveness Analyses That Use Data from Hierarchical Studies

Purpose. Multiple imputation (MI) has been proposed for handling missing data in cost-effectiveness analyses (CEAs). In CEAs that use cluster randomized trials (CRTs), the imputation model, like the analysis model, should recognize the hierarchical structure of the data. This paper contrasts a multilevel MI approach that recognizes clustering, with single-level MI and complete case analysis (CCA) in CEAs that use CRTs. Methods. We consider a multilevel MI approach compatible with multilevel analytical models for CEAs that use CRTs. We took fully observed data from a CEA that evaluated an intervention to improve diagnosis of active labor in primiparous women using a CRT (2078 patients, 14 clusters). We generated scenarios with missing costs and outcomes that differed, for example, according to the proportion with missing data (10%–50%), the covariates that predicted missing data (individual, cluster-level), and the missingness mechanism: missing completely at random (MCAR), missing at random (MAR), or missing not at random (MNAR). We estimated incremental net benefits (INBs) for each approach and compared them with the estimates from the fully observed data, the “true” INBs. Results. When costs and outcomes were assumed to be MCAR, the INBs for each approach were similar to the true estimates. When data were MAR, the point estimates from the CCA differed from the true estimates. Multilevel MI provided point estimates and standard errors closer to the true values than did single-level MI across all settings, including those in which a high proportion of observations had cost and outcome data MAR and when data were MNAR. Conclusions. Multilevel MI accommodates the multilevel structure of the data in CEAs that use cluster trials and provides accurate cost-effectiveness estimates across the range of circumstances considered.

[1]  M. Kenward,et al.  Small sample inference for fixed effects from restricted maximum likelihood. , 1997, Biometrics.

[2]  Harvey Goldstein,et al.  MULTILEVEL MODELLING NEWSLETTER , 2002 .

[3]  John Cairns,et al.  Improving costing methods in multicentre economic evaluation: the use of multiple imputation for unit costs. , 2009, Health economics.

[4]  Richard Grieve,et al.  Statistical Methods for Cost-Effectiveness Analyses That Use Data from Cluster Randomized Trials , 2012, Medical decision making : an international journal of the Society for Medical Decision Making.

[5]  Gladys McPherson,et al.  Simple imputation methods were inadequate for missing not at random (MNAR) quality of life data , 2008, Health and quality of life outcomes.

[6]  Nigel Rice,et al.  Cost-Effectiveness Analysis Using Data from Multinational Trials: The Use of Bivariate Hierarchical Modeling , 2007, Medical decision making : an international journal of the Society for Medical Decision Making.

[7]  John Cairns,et al.  Multilevel models for estimating incremental net benefits in multinational studies. , 2007, Health economics.

[8]  Marie Davidian,et al.  Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates , 2008, Biometrics.

[9]  James R Carpenter,et al.  Sensitivity analysis after multiple imputation under missing at random: a weighting approach , 2007, Statistical methods in medical research.

[10]  M. Kenward,et al.  Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls , 2009, BMJ : British Medical Journal.

[11]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[12]  Harvey Goldstein,et al.  Multilevel models with multivariate mixed response types , 2009 .

[13]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[14]  Sean D Sullivan,et al.  The impact of using different imputation methods for missing quality of life scores on the estimation of the cost-effectiveness of lung-volume-reduction surgery. , 2009, Health economics.

[15]  Richard Grieve,et al.  Developing Appropriate Methods for Cost-Effectiveness Analysis of Cluster Randomized Trials , 2012, Medical decision making : an international journal of the Society for Medical Decision Making.

[16]  Patrick Royston,et al.  Multiple imputation using chained equations: Issues and guidance for practice , 2011, Statistics in medicine.

[17]  W. Manning,et al.  Estimating Log Models: To Transform or Not to Transform? , 1999, Journal of health economics.

[18]  Harvey Goldstein,et al.  REALCOM-IMPUTE Software for Multilevel Multiple Imputation with Mixed Response Types , 2011 .

[19]  S. Richardson,et al.  Strategy for Modelling Nonrandom Missing Data Mechanisms in Observational Studies Using Bayesian Methods , 2012 .

[20]  Andrew Briggs,et al.  Missing... presumed at random: cost-analysis of incomplete data. , 2003, Health economics.

[21]  Michael G. Kenward,et al.  Multiple Imputation and its Application: Carpenter/Multiple Imputation and its Application , 2013 .

[22]  Carol L. Armour,et al.  Consumer Preference for Dinoprostone Vaginal Gel Using Stated Preference Discrete Choice Modelling , 2012, PharmacoEconomics.

[23]  J. Schafer,et al.  Computational Strategies for Multivariate Linear Mixed-Effects Models With Missing Values , 2002 .

[24]  Michael G. Kenward,et al.  Multiple Imputation and its Application , 2013 .

[25]  A. Gelman,et al.  Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box , 2011 .

[26]  Jan B Oostenbrink,et al.  The analysis of incomplete cost data due to dropout. , 2005, Health economics.

[27]  Richard Grieve,et al.  Bayesian Hierarchical Models for Cost-Effectiveness Analyses that Use Data from Cluster Randomized Trials , 2010, Medical decision making : an international journal of the Society for Medical Decision Making.

[28]  Oliver Rivero-Arias,et al.  Evaluation of software for multiple imputation of semi-continuous data , 2007, Statistical methods in medical research.

[29]  Linda M. Collins,et al.  New methods for the analysis of change , 2001 .

[30]  Joseph L. Schafer,et al.  Multiple imputation with PAN. , 2001 .

[31]  Ofer Harel,et al.  Missing data techniques for multilevel data: implications of model misspecification , 2011 .

[32]  Michael G. Kenward,et al.  Handling missing values in cost effectiveness analyses that use data from cluster randomized trials , 2012, 1206.6070.

[33]  S. Noble,et al.  Missing data in trial-based cost-effectiveness analysis: the current state of play. , 2012, Health economics.

[34]  Paul McNamee,et al.  Women's preferences for aspects of labor management: results from a discrete choice experiment. , 2011, Birth.

[35]  A. Briggs,et al.  Statistical Analysis of Cost-effectiveness Data: Willan/Statistical Analysis of Cost-effectiveness Data , 2006 .

[36]  Graham Scotland Elicitation and application of preference values in economic evaluation : case studies in reproductive health , 2011 .

[37]  Simon G Thompson,et al.  Methods for incorporating covariate adjustment, subgroup analysis and between-centre differences into cost-effectiveness evaluations. , 2005, Health economics.

[38]  Joseph Kang,et al.  Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2958.

[39]  Rebecca R Andridge,et al.  Quantifying the impact of fixed effects modeling of clusters in multiple imputation for cluster randomized trials , 2011, Biometrical journal. Biometrische Zeitschrift.

[40]  Paul McNamee,et al.  Effects of algorithm for diagnosis of active labour: cluster randomised trial , 2008, BMJ : British Medical Journal.

[41]  Allan Donner,et al.  Imputation Strategies for Missing Continuous Outcomes in Cluster Randomized Trials , 2008, Biometrical journal. Biometrische Zeitschrift.

[42]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[43]  Richard Grieve,et al.  Methods for covariate adjustment in cost-effectiveness analysis that use cluster randomised trials. , 2012, Health economics.

[44]  Nicola J Cooper,et al.  Estimating the cost-effectiveness of an intervention in a clinical trial when partial cost information is available: a Bayesian approach. , 2008, Health economics.

[45]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[46]  Andrew Briggs,et al.  Statistical Analysis of Cost-Effectiveness Data , 2006 .