Experimental design matters for statistical analysis: how to handle blocking.

BACKGROUND Nowadays, evaluation of the effects of pesticides often relies on experimental designs that involve multiple concentrations of the pesticide of interest or multiple pesticides at specific comparable concentrations and, possibly, secondary factors of interest. Unfortunately, the experimental design is often more or less neglected when analysing data. Two data examples were analysed using different modelling strategies. First, in a randomized complete block design, mean heights of maize treated with a herbicide and one of several adjuvants were compared. Second, translocation of an insecticide applied to maize as a seed treatment was evaluated using incomplete data from an unbalanced design with several layers of hierarchical sampling. Extensive simulations were carried out to further substantiate the effects of different modelling strategies. RESULTS It was shown that results from suboptimal approaches (two-sample t-tests and ordinary ANOVA assuming independent observations) may be both quantitatively and qualitatively different from the results obtained using an appropriate linear mixed model. The simulations demonstrated that the different approaches may lead to differences in coverage percentages of confidence intervals and type 1 error rates, confirming that misleading conclusions can easily happen when an inappropriate statistical approach is chosen. CONCLUSION To ensure that experimental data are summarized appropriately, avoiding misleading conclusions, the experimental design should duly be reflected in the choice of statistical approaches and models. We recommend that author guidelines should explicitly point out that authors need to indicate how the statistical analysis reflects the experimental design. © 2017 Society of Chemical Industry.

[1]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[2]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[3]  T. Hothorn,et al.  Simultaneous Inference in General Parametric Models , 2008, Biometrical journal. Biometrische Zeitschrift.

[4]  P. Brockhoff,et al.  Tests in Linear Mixed Effects Models , 2015 .

[5]  Steven G. Luke,et al.  Evaluating significance in linear mixed-effects models in R , 2016, Behavior Research Methods.

[6]  C. Ritz,et al.  Simultaneous comparisons of treatments at multiple time points: Combined marginal models versus joint modeling , 2017, Statistical methods in medical research.

[7]  Russell V. Lenth,et al.  Least-Squares Means: The R Package lsmeans , 2016 .

[8]  W. Stroup Rethinking the Analysis of Non-Normal Data in Plant and Soil Science , 2015 .

[9]  Anthony R. Ives,et al.  Three points to consider when choosing a LM or GLM test for count data , 2016 .

[10]  G. Oehlert A first course in design and analysis of experiments , 2000 .

[11]  Per B. Brockhoff,et al.  lmerTest Package: Tests in Linear Mixed Effects Models , 2017 .

[12]  Stanley H. Cohen,et al.  Design and Analysis , 2010 .

[13]  C. Krupke,et al.  Translocation of the neonicotinoid seed treatment clothianidin in maize , 2017, PloS one.

[14]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[15]  R. Schäfer,et al.  Ecotoxicology is not normal , 2015, Environmental Science and Pollution Research.

[16]  Hans-Peter Piepho,et al.  A Hitchhiker's guide to mixed models for randomized experiments , 2003 .

[17]  Hans-Peter Piepho,et al.  A simulation study on tests of hypotheses and confidence intervals for fixed effects in mixed models for blocked experiments with missing data , 2005 .

[18]  O. Kempthorne,et al.  Introduction to experimental design , 1994 .

[19]  Ramon C. Littell,et al.  Analysis of unbalanced mixed model data: A case study comparison of ANOVA versus REML/GLS , 2002 .

[20]  Piepho Analysing disease incidence data from designed experiments by generalized linear mixed models , 1999 .

[21]  L. Madden,et al.  Evaluation of Generalized Linear Mixed Models for Analyzing Disease Incidence Data Obtained in Designed Experiments. , 2002, Plant disease.