Optimizing sampling approaches along ecological gradients

Summary Natural scientists and especially ecologists use manipulative experiments or field observations along gradients to differentiate patterns driven by processes from those caused by random noise. A well-conceived sampling design is essential for identifying, analysing and reporting underlying patterns in a statistically solid and reproducible manner, given the normal restrictions in labour, time and money. However, a technical guideline about an adequate sampling design to maximize prediction success under restricted resources is lacking. This study aims at developing such a solid and reproducible guideline for sampling along gradients in all fields of ecology and science in general. We conducted simulations with artificial data for five common response types known in ecology, each represented by a simple function (no response, linear, exponential, symmetric unimodal and asymmetric unimodal). In the simulations, we accounted for different levels of random and systematic error, the two sources of noise in ecological data. We quantified prediction success for varying total sample size, number of locations sampled along a spatial/temporal gradient and number of replicates per sampled location. The number of replicates becomes more important with increasing random error, whereas replicates become less relevant for a systematic error bigger than 20% of total variation. Thus, if high levels of systematic error are indicated or expected (e.g. in field studies with spatial autocorrelation, unaccountable additional environmental drivers or population clustering), continuous sampling with little to no replication is recommended. In contrast, sampling designs with replications are recommended in studies that can control for systematic errors. In a setting that is characteristic for ecological experiments and field studies strictly controlling for undeterminable systematic error (random error ≥10% and systematic error ≤10% of total variation), prediction success was best for an intermediate number of sampled locations along the gradient (10–15) and a low number of replicates per location (3). Our findings from reproducible, statistical simulations will help design appropriate and efficient sampling approaches and avoid erroneous conclusions based on studies with flawed sampling design, which is currently one of the main targets of public criticism against science.

[1]  R. Whittaker Evolution and measurement of species diversity , 1972 .

[2]  J. T. Curtis,et al.  An Upland Forest Continuum in the Prairie‐Forest Border Region of Wisconsin , 1951 .

[3]  Marie-Josée Fortin,et al.  Spatial autocorrelation and sampling design in plant ecology , 1989, Vegetatio.

[4]  André I. Khuri,et al.  Response surface methodology: 1966–1988 , 1989 .

[5]  Leif D. Nelson,et al.  False-Positive Psychology , 2011, Psychological science.

[6]  James E. Bartlett,et al.  Organizational research: Determining appropriate sample size in survey research , 2001 .

[7]  R. Whittaker,et al.  GRADIENT ANALYSIS OF VEGETATION* , 1967, Biological reviews of the Cambridge Philosophical Society.

[8]  N. C. Kenkel,et al.  On sampling procedures in population and community ecology , 1989, Vegetatio.

[9]  J. Dengler,et al.  Which function describes the species–area relationship best? A review and empirical evaluation , 2009 .

[10]  S. Hurlbert Pseudoreplication and the Design of Ecological Field Experiments , 1984 .

[11]  J. Connell Diversity in tropical rain forests and coral reefs. , 1978, Science.

[12]  H. Olff,et al.  A hierarchical set of models for species response analysis , 1993 .

[13]  James S. Clark Uncertainty in Ecological Inference and Forecasting1 , 2003 .

[14]  Elli Tzirkalli,et al.  Thresholds and the species–area relationship: a synthetic analysis of habitat island datasets , 2014 .

[15]  Marie-Josée Fortin,et al.  Design for simultaneous sampling of ecological variables: from concepts to numerical solutions , 1989 .

[16]  P. White,et al.  On the existence of ecological communities , 1994 .

[17]  Michael C. Frank,et al.  Estimating the reproducibility of psychological science , 2015, Science.

[18]  J. Grinnell The Niche-Relationships of the California Thrasher , 1917 .

[19]  V. Adriaenssens,et al.  Uncertainty in ecological status assessments of lakes and rivers using diatoms , 2009, Hydrobiologia.

[20]  J. T. Curtis,et al.  An Ordination of the Upland Forest Communities of Southern Wisconsin , 1957 .

[21]  Carsten Rahbek,et al.  The patterns and causes of elevational diversity gradients , 2012 .

[22]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[23]  R. P. Benedict,et al.  ASME Measurement Uncertainty , 1985 .

[24]  B. Newell,et al.  Priming Intelligent Behavior: An Elusive Phenomenon , 2013, PloS one.

[25]  J. Wicherts,et al.  The Rules of the Game Called Psychological Science , 2012, Perspectives on psychological science : a journal of the Association for Psychological Science.

[26]  G. Box,et al.  On the Experimental Attainment of Optimum Conditions , 1951 .

[27]  C. Begley,et al.  Drug development: Raise standards for preclinical cancer research , 2012, Nature.

[28]  F. Prinz,et al.  Believe it or not: how much can we rely on published data on potential drug targets? , 2011, Nature Reviews Drug Discovery.

[29]  Klara Dolos,et al.  Current measures for distance decay in similarity of species composition are influenced by study extent and grain size , 2012 .

[30]  M. Bertness,et al.  Positive interactions in communities. , 1994, Trends in ecology & evolution.

[31]  Jay T. Lennon,et al.  Knowing when to draw the line: designing more informative ecological experiments , 2005 .

[32]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[33]  Douglas W. Nychka,et al.  Tools for Spatial Data , 2016 .

[34]  J. Eckblad How many samples should be taken , 1991 .

[35]  C.J.F. ter Braak,et al.  A Theory of Gradient Analysis , 2004 .

[36]  Thomas Wohlgemuth,et al.  Precipitation manipulation experiments--challenges and recommendations for the future. , 2012, Ecology letters.

[37]  R. M. Nally Regression and model-building in conservation biology, biogeography and ecology: The distinction between – and reconciliation of – ‘predictive’ and ‘explanatory’ models , 2000, Biodiversity & Conservation.

[38]  H. Gleason,et al.  The individualistic concept of the plant association , 1939 .

[39]  Brian A. Nosek,et al.  Power failure: why small sample size undermines the reliability of neuroscience , 2013, Nature Reviews Neuroscience.

[40]  Juergen Kreyling,et al.  Global Change Experiments: Challenges and Opportunities , 2015 .

[41]  Mike P. Austin,et al.  Models for the analysis of species’ response to environmental gradients , 1987 .

[42]  James F. Reynolds,et al.  Is the change of plant–plant interactions with abiotic stress predictable? A meta‐analysis of field results in arid environments , 2005 .

[43]  Alfred Stein,et al.  An overview of spatial sampling procedures and experimental design of spatial studies for ecosystem comparisons , 2003 .