Current statistical issues in Weed Research

Onofri A, Carbonell EA, Piepho H-P, Mortimer AM & Cousens RD (2010). Current statistical issues in Weed Research. Weed Research50, 5–24. Summary The correct design of experimental studies, the selection of the appropriate statistical analysis of data and the efficient presentation of results are key to the good conduct and communication of science. The last Guidance for the use and presentation of statistics in Weed Research was published in 1988. Since then, there have been developments in both the scope of research covered by the journal and in the statistical techniques available. This paper addresses the changes in statistics and provides a reference work that will aid researchers in the design and analysis of their work. It will also provide guidance for editors and reviewers. The paper is organised into sections, which will aid the selection of relevant paragraphs, as we recognise that particular approaches require particular statistical analysis. It also uses examples, questions and checklists, so that non-specialists can work towards the correct approach. Statistics can be complex, so knowing when to seek specialist advice is important. The structure and layout of this contribution should help weed scientists, but it cannot provide a comprehensive guide to every technique. Therefore, we provide references to further reading. We would like to reinforce the idea that statistical methods are not a set of recipes whose mindless application is required by convention; each experiment or study may involve subtleties that these guidelines cannot cover. Nevertheless, we anticipate that this paper will help weed scientists in their initial designs for research, in the analysis of data and in the presentation of results for publication.

[1]  D G Altman,et al.  Statistics notes: Absence of evidence is not evidence of absence , 1995 .

[2]  Anthony C. Atkinson,et al.  Plots, transformations, and regression : an introduction to graphical methods of diagnostic regression analysis , 1987 .

[3]  Ruth G. Shaw,et al.  Anova for Unbalanced Data: An Overview , 1993 .

[4]  Hans-Peter Piepho,et al.  A Note on the Analysis of Designed Experiments with Complex Treatment Structure , 2006 .

[5]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[6]  Lene Theil Skovgaard,et al.  Applied regression analysis. 3rd edn. N. R. Draper and H. Smith, Wiley, New York, 1998. No. of pages: xvii+706. Price: £45. ISBN 0‐471‐17082‐8 , 2000 .

[7]  S. Goodman Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy , 1999, Annals of Internal Medicine.

[8]  Erwin L. Leclerg,et al.  Field Plot Technique , 1939 .

[9]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[10]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[11]  H. Abdi The Bonferonni and Šidák Corrections for Multiple Comparisons , 2006 .

[12]  Mollie E. Brooks,et al.  Generalized linear mixed models: a practical guide for ecology and evolution. , 2009, Trends in ecology & evolution.

[13]  B. Everitt Unresolved Problems in Cluster Analysis , 1979 .

[14]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[15]  José L. Vicente-Villardón,et al.  Identifying molecular markers associated with classification of genotypes by External Logistic Biplots , 2008, Bioinform..

[16]  Laurence V. Madden,et al.  Some methods allowing for aggregated patterns of disease incidence in the analysis of data from designed experiments , 1995 .

[17]  H. Piepho,et al.  Best Linear Unbiased Prediction (BLUP) for regional yield trials: a comparison to additive main effects and multiplicative interaction (AMMI) analysis , 1994, Theoretical and Applied Genetics.

[18]  Seyed Abolghasem Mohammadi,et al.  Analysis of Genetic Diversity in Crop Plants—Salient Statistical Tools and Considerations , 2003 .

[19]  S. Weisberg Applied Linear Regression: Weisberg/Applied Linear Regression 3e , 2005 .

[20]  T. Hothorn,et al.  Simultaneous Inference in General Parametric Models , 2008, Biometrical journal. Biometrische Zeitschrift.

[21]  Hans-Peter Piepho,et al.  A Mixed Modelling Approach for Randomized Experiments with Repeated Measures , 2004 .

[22]  S. Weisberg Applied Linear Regression , 1981 .

[23]  R. Hilborn,et al.  The Ecological Detective: Confronting Models with Data , 1997 .

[24]  Piepho Analysing disease incidence data from designed experiments by generalized linear mixed models , 1999 .

[25]  L. Madden,et al.  New applications of statistical tools in plant pathology. , 2004, Phytopathology.

[26]  John A. Nelder,et al.  The statistics of linear models: back to basics , 1995 .

[27]  Hans-Peter Piepho,et al.  The folded exponential transformation for proportions , 2003 .

[28]  Anil K. Jain,et al.  Bootstrap technique in cluster analysis , 1987, Pattern Recognit..

[29]  Hans-Peter Piepho,et al.  An Algorithm for a Letter-Based Representation of All-Pairwise Comparisons , 2004 .

[30]  D. Ruppert,et al.  Transformation and Weighting in Regression , 1988 .

[31]  Patrick J. Heagerty,et al.  Weighted empirical adaptive variance estimators for correlated data regression , 1999 .

[32]  Robert P Freckleton,et al.  Why do we still use stepwise modelling in ecology and behaviour? , 2006, The Journal of animal ecology.

[33]  K. Leonard,et al.  Similarity coefficients for molecular markers in studies of genetic relationships between individuals for haploid, diploid, and polyploid species , 2005, Molecular ecology.

[34]  William G. Cochran,et al.  Experimental designs, 2nd ed. , 1957 .

[35]  Murray K Clayton How should we achieve high-quality reporting of statistics in scientific journals? A commentary on "Guidelines for reporting statistics in journals published by the American Physiological Society". , 2007, Advances in physiology education.

[36]  G. Molenberghs,et al.  Models for Discrete Longitudinal Data , 2005 .

[37]  Timothy J. Robinson,et al.  Linear Models With R , 2005, Technometrics.

[38]  Richard P. Marini,et al.  Are Nonsignificant Differences Really Not Significant , 1999 .

[39]  D. J. Finney Bioassay and the practice of statistical inference , 1979 .

[40]  Scott E. Maxwell,et al.  Designing Experiments and Analyzing Data , 1991 .

[41]  D Edwards,et al.  The efficiency of simulation-based multiple comparisons. , 1987, Biometrics.

[42]  N. C. Kenkel,et al.  Review: Multivariate analysis in weed science research , 2002, Weed Science.

[43]  N. Draper,et al.  Applied Regression Analysis , 1966 .

[44]  Subir Ghosh,et al.  Nonparametric Analysis of Longitudinal Data in Factorial Experiments , 2003, Technometrics.

[45]  J. S. Fenlon How can a statistician improve your interpretation , 1995 .

[46]  S. C. Pearce,et al.  Data Analysis in Agricultural Experimentation. I. Contrasts of Interest , 1992, Experimental Agriculture.

[47]  S. R. Searle,et al.  Linear Models For Unbalanced Data , 1988 .

[48]  W. Ahrens,et al.  Use of the Arcsine and Square Root Transformations for Subjectively Determined Percentage Data , 1990, Weed Science.

[49]  J. John,et al.  Cyclic and Computer Generated Designs , 1995 .

[50]  K. Phelps Some problems in the statistical analysis of trials for resistance screening , 1991 .

[51]  Foster B. Cady,et al.  Field Plot Technique. , 1963 .

[52]  Oliver Schabenberger,et al.  Statistical Tests for Hormesis and Effective Dosages in Herbicide Dose Response , 1999 .

[53]  Fergus L. Sinclair,et al.  Designing experiments and analysing data. , 2002 .

[54]  J. Hsu Multiple Comparisons: Theory and Methods , 1996 .

[55]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[56]  J. Faraway Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models , 2005 .

[57]  A. Ramette Multivariate analyses in microbial ecology , 2007, FEMS microbiology ecology.

[58]  Hans-Peter Piepho,et al.  Analysis of a randomized block design with unequal subclass numbers , 1997 .

[59]  J. Leeuw Nonlinear Regression with R , 2008 .

[60]  J. Gower,et al.  Minimum Spanning Trees and Single Linkage Cluster Analysis , 1969 .

[61]  G. P. Bernet,et al.  Simultaneous agronomic and molecular characterization of genotypes via the generalised procrustes analysis: an application to cucumber , 2005 .

[62]  F. J. Pierce,et al.  Contemporary Statistical Models for the Plant and Soil Sciences , 2001 .

[63]  P. Weinberger,et al.  VARIABILITY OF PLANT GROWTH WITHIN CONTROLLED-ENVIRONMENT CHAMBERS AS RELATED TO TEMPERATURE AND LIGHT DISTRIBUTION , 1973 .

[64]  J. Streibig,et al.  Nonlinear Mixed-Model Regression to Analyze Herbicide Dose–Response Relationships1 , 2004, Weed Technology.

[65]  Steven B. Johnson On the Status of Statistics in Phytopathology , 1982 .

[66]  R. A. Jones,et al.  Review of data analysis methods for seed germination , 1984 .

[67]  J. Gower Some distance properties of latent root and vector methods used in multivariate analysis , 1966 .

[68]  Cathy Hawes,et al.  Design, analysis and statistical power of the Farm-Scale Evaluations of genetically modified herbicide-tolerant crops , 2003 .

[69]  Julian J. Faraway,et al.  Extending the Linear Model with R , 2004 .

[70]  Ruth G Shaw,et al.  Increased germination of diverse crop-wild hybrid sunflower seeds. , 2006, Ecological applications : a publication of the Ecological Society of America.

[71]  Richard E. Plant Comparison of Means of Spatial Data in Unreplicated Field Trials , 2007 .

[72]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[73]  L. Madden,et al.  Nonparametric analysis of ordinal data in designed factorial experiments. , 2004, Phytopathology.

[74]  J. Alexander,et al.  Theory and Methods: Critical Essays in Human Geography , 2008 .

[75]  N. Draper,et al.  Applied Regression Analysis: Draper/Applied Regression Analysis , 1998 .

[76]  B. Everitt An R and S-Plus® Companion to Multivariate Analysis , 2007 .

[77]  R. Sokal,et al.  THE COMPARISON OF DENDROGRAMS BY OBJECTIVE METHODS , 1962 .

[78]  P. G. N. Digby,et al.  Multivariate Analysis of Ecological Communities , 1987 .

[79]  Alan Agresti,et al.  Random effect models for repeated measures of zero-inflated count data , 2005 .

[80]  Roger D. Cousens,et al.  Misinterpretations of results in weed research through inappropriate use of statistics , 1988 .

[81]  J. Gower A General Coefficient of Similarity and Some of Its Properties , 1971 .

[82]  David A. Morrison,et al.  Pseudoreplication in experimental designs for the manipulation of seed germination treatments , 2000 .

[83]  G. Ruxton,et al.  Confidence intervals are a more useful complement to nonsignificant tests than are power calculations , 2003 .