Power analysis for generalized linear mixed models in ecology and evolution

‘Will my study answer my research question?’ is the most fundamental question a researcher can ask when designing a study, yet when phrased in statistical terms – ‘What is the power of my study?’ or ‘How precise will my parameter estimate be?’ – few researchers in ecology and evolution (EE) try to answer it, despite the detrimental consequences of performing under- or over-powered research. We suggest that this reluctance is due in large part to the unsuitability of simple methods of power analysis (broadly defined as any attempt to quantify prospectively the ‘informativeness’ of a study) for the complex models commonly used in EE research. With the aim of encouraging the use of power analysis, we present simulation from generalized linear mixed models (GLMMs) as a flexible and accessible approach to power analysis that can account for random effects, overdispersion and diverse response distributions. We illustrate the benefits of simulation-based power analysis in two research scenarios: estimating the precision of a survey to estimate tick burdens on grouse chicks and estimating the power of a trial to compare the efficacy of insecticide-treated nets in malaria mosquito control. We provide a freely available R function, sim.glmm, for simulating from GLMMs. Analysis of simulated data revealed that the effects of accounting for realistic levels of random effects and overdispersion on power and precision estimates were substantial, with correspondingly severe implications for study design in the form of up to fivefold increases in sampling effort. We also show the utility of simulations for identifying scenarios where GLMM-fitting methods can perform poorly. These results illustrate the inadequacy of standard analytical power analysis methods and the flexibility of simulation-based power analysis for GLMMs. The wider use of these methods should contribute to improving the quality of study design in EE.

[1]  G. Cumming Understanding the New Statistics: Effect Sizes, Confidence Intervals, and Meta-Analysis , 2011 .

[2]  C Arrowsmith,et al.  Analysis of aggregation, a worked example: numbers of ticks on red grouse chicks , 2001, Parasitology.

[3]  Roel Bosker,et al.  Standard Errors and Sample Sizes for Two-Level Research , 1993 .

[4]  E. Williams Experimental Designs Balanced for the Estimation of Residual Effects of Treatments , 1949 .

[5]  Edgar Erdfelder,et al.  G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences , 2007, Behavior research methods.

[6]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[7]  M. Taborsky Sample Size in the Study of Behaviour , 2010 .

[8]  H. Schielzeth,et al.  Conclusions beyond support: overconfident estimates in mixed models , 2008, Behavioral ecology : official journal of the International Society for Behavioral Ecology.

[9]  Denis Réale,et al.  Measuring individual differences in reaction norms in field and experimental studies: a power analysis of random regression models , 2011 .

[10]  A. Zuur,et al.  A Beginner’s Guide to GLM and GLMM with R: A Frequentist and Bayesian Perspective for Ecologists , 2013 .

[11]  Mirko S. Winkler,et al.  Efficacy of ICON® Maxx in the laboratory and against insecticide-resistant Anopheles gambiae in central Côte d'Ivoire , 2012, Malaria Journal.

[12]  David R. Anderson,et al.  Kullback-Leibler information as a basis for strong inference in ecological studies , 2001 .

[13]  C. I. Bliss,et al.  FITTING THE NEGATIVE BINOMIAL DISTRIBUTION TO BIOLOGICAL DATA AND NOTE ON THE EFFICIENT FITTING OF THE NEGATIVE BINOMIAL , 1953 .

[14]  Robert B. O'Hara,et al.  Do not log‐transform count data , 2010 .

[15]  F. Juanes,et al.  The importance of statistical power analysis: an example from Animal Behaviour , 1996, Animal Behaviour.

[16]  Mark Rowland,et al.  An experimental hut evaluation of Olyset® nets against anopheline mosquitoes after seven years use in Tanzanian villages , 2008, Malaria Journal.

[17]  Mollie E. Brooks,et al.  Generalized linear mixed models: a practical guide for ecology and evolution. , 2009, Trends in ecology & evolution.

[18]  Daniel R. Smith,et al.  Power rangers: no improvement in the statistical power of analyses published in Animal Behaviour , 2011, Animal Behaviour.

[19]  D. Bates,et al.  Linear Mixed-Effects Models using 'Eigen' and S4 , 2015 .

[20]  D. Barr,et al.  Random effects structure for confirmatory hypothesis testing: Keep it maximal. , 2013, Journal of memory and language.

[21]  A. Møller,et al.  A survey of the statistical power of research in behavioral ecology and animal behavior , 2003 .

[22]  Francis K C Hui,et al.  The arcsine is asinine: the analysis of proportions in ecology. , 2011, Ecology.

[23]  S. Hurlbert Pseudoreplication and the Design of Ecological Field Experiments , 1984 .

[24]  Martin Crowder,et al.  Beta-binomial Anova for Proportions , 1978 .

[25]  C. Gallagher Extending the Linear Model With R: Generalized Linear, Mixed Effects and Nonparametric Regression Models , 2007 .

[26]  Paul Johnson,et al.  Combining Organophosphate Treated Wall Linings and Long-lasting Insecticidal Nets for Improved Control of Pyrethroid Resistant Anopheles gambiae , 2014, PloS one.

[27]  D. Nussey,et al.  The evolutionary ecology of individual phenotypic plasticity in wild populations , 2007, Journal of evolutionary biology.

[28]  Richard Hooper,et al.  Versatile Sample-Size Calculation using Simulation , 2013 .

[29]  Alain F. Zuur,et al.  Zero inflated models and generalized linear mixed models with R , 2012 .

[30]  Tom A. B. Snijders Power and Sample Size in Multilevel Linear Models , 2005 .

[31]  C. Lengeler,et al.  Insecticide-treated bed nets and curtains for preventing malaria. , 2004, The Cochrane database of systematic reviews.

[32]  L. L. Eberhardt,et al.  Appraising Variability in Population Studies , 1978 .

[33]  Julian Di Stefano,et al.  How much power is enough? Against the development of an arbitrary convention for statistical power calculations , 2003 .

[34]  J. Hox,et al.  Sufficient Sample Sizes for Multilevel Modeling , 2005 .

[35]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[36]  Benjamin M. Bolker,et al.  Ecological Models and Data in R , 2008 .