Using multiple imputation to estimate missing data in meta‐regression

Summary There is a growing need for scientific synthesis in ecology and evolution. In many cases, meta-analytic techniques can be used to complement such synthesis. However, missing data are a serious problem for any synthetic efforts and can compromise the integrity of meta-analyses in these and other disciplines. Currently, the prevalence of missing data in meta-analytic data sets in ecology and the efficacy of different remedies for this problem have not been adequately quantified. We generated meta-analytic data sets based on literature reviews of experimental and observational data and found that missing data were prevalent in meta-analytic ecological data sets. We then tested the performance of complete case removal (a widely used method when data are missing) and multiple imputation (an alternative method for data recovery) and assessed model bias, precision and multimodel rankings under a variety of simulated conditions using published meta-regression data sets. We found that complete case removal led to biased and imprecise coefficient estimates and yielded poorly specified models. In contrast, multiple imputation provided unbiased parameter estimates with only a small loss in precision. The performance of multiple imputation, however, was dependent on the type of data missing. It performed best when missing values were weighting variables, but performance was mixed when missing values were predictor variables. Multiple imputation performed poorly when imputing raw data which were then used to calculate effect size and the weighting variable. We conclude that complete case removal should not be used in meta-regression and that multiple imputation has the potential to be an indispensable tool for meta-regression in ecology and evolution. However, we recommend that users assess the performance of multiple imputation by simulating missing data on a subset of their data before implementing it to recover actual missing data.

[1]  Wolfgang Viechtbauer,et al.  Conducting Meta-Analyses in R with the metafor Package , 2010 .

[2]  A. Hrõbjartsson,et al.  Empirical evidence for selective reporting of outcomes in randomized trials: comparison of protocols to published articles. , 2004, JAMA.

[3]  C. Lortie,et al.  Publication and Related Biases , 2013 .

[4]  Dennis L. Murray,et al.  Tropics, trophics and taxonomy: the determinants of parasite‐associated host mortality , 2010 .

[5]  P. Boyle,et al.  Beyond classical meta-analysis: can inadequately reported studies be included? , 2004, Drug discovery today.

[6]  Trivellore E Raghunathan,et al.  What do we do with missing data? Some options for analysis of incomplete data. , 2004, Annual review of public health.

[7]  S. Thompson,et al.  How should meta‐regression analyses be undertaken and interpreted? , 2002, Statistics in medicine.

[8]  Simon P Blomberg,et al.  Extrinsic versus intrinsic factors in the decline and extinction of Australian marsupials , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[9]  M. Forbes,et al.  Variable reporting and quantitative reviews: a comparison of three meta‐analytical techniques , 2003 .

[10]  Duncan N. L. Menge,et al.  Gauging the impact of meta-analysis on ecology , 2012, Evolutionary Ecology.

[11]  Shinichi Nakagawa,et al.  Methodological issues and advances in biological meta-analysis , 2012, Evolutionary Ecology.

[12]  Ben Vandermeer,et al.  A systematic review identifies a lack of standardization in methods for handling missing variance data. , 2006, Journal of clinical epidemiology.

[13]  Shinichi Nakagawa,et al.  Missing inaction: the dangers of ignoring missing data. , 2008, Trends in ecology & evolution.

[14]  Nik Ruzni Nik Idris,et al.  The Effects of Imputing the Missing Standard Deviations on the Standard Error of Meta Analysis Estimates , 2009, Commun. Stat. Simul. Comput..

[15]  Jessica Gurevitch,et al.  STATISTICAL ISSUES IN ECOLOGICAL META‐ANALYSES , 1999 .

[16]  Marc Mangel,et al.  Accelerate Synthesis in Ecology and Environmental Sciences , 2009 .

[17]  Shinichi Nakagawa,et al.  The influence of male age on within‐pair and extra‐pair paternity in passerines , 2012 .

[18]  Kate E. Jones,et al.  PanTHERIA: a species‐level database of life history, ecology, and geography of extant and recently extinct mammals , 2009 .

[19]  D. Moher,et al.  The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomized trials. , 2001, Journal of the American Podiatric Medical Association.

[20]  P. Boyle,et al.  A meta‐analysis of trials of transurethral needle ablation for treating symptomatic benign prostatic hyperplasia , 2004, BJU international.

[21]  Douglas G Altman,et al.  Comparison of imputation methods for handling missing covariate data when fitting a Cox proportional hazards model: a resampling study , 2010, BMC medical research methodology.

[22]  M. Lajeunesse 13. Recovering Missing or Partial Data from Studies: A Survey of Conversions and Imputations for Meta-analysis , 2013 .

[23]  R. Freckleton,et al.  Model averaging, missing data and multiple imputation: a case study for behavioural ecology , 2010, Behavioral Ecology and Sociobiology.

[24]  V. Bala Chaudhary,et al.  Advancing Synthetic Ecology: A Database System to Facilitate Complex Ecological Meta-Analyses , 2010 .

[25]  Jessica Gurevitch,et al.  A Meta-Analysis of Competition in Field Experiments , 1992, The American Naturalist.

[26]  G. Smith,et al.  Bias in meta-analysis detected by a simple, graphical test , 1997, BMJ.

[27]  D. Rubin Multiple Imputation After 18+ Years , 1996 .

[28]  Dennis Murray,et al.  Raccoon ecology database: A resource for population dynamics modelling and meta-analysis , 2008, Ecol. Informatics.

[29]  S. Higgins,et al.  TRY – a global database of plant traits , 2011, Global Change Biology.

[30]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[31]  Wim Van den Noortgate,et al.  Estimation of the predictive power of the model in mixed-effects meta-regression: A simulation study. , 2014, The British journal of mathematical and statistical psychology.

[32]  G. Arnqvist,et al.  Meta-analysis: synthesizing research findings in ecology and evolution. , 1995, Trends in ecology & evolution.

[33]  P C Lambert,et al.  Reporting of prognostic markers: current problems and development of guidelines for evidence-based practice in the future , 2003, British Journal of Cancer.

[34]  A. Garg,et al.  Imputing variance estimates do not alter the conclusions of a meta-analysis with continuous outcomes: a case study of changes in renal function after living kidney donation. , 2007, Journal of clinical epidemiology.

[35]  A. Gelman,et al.  Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box , 2011 .