The priority of prediction in ecological understanding

The objective of science is to understand the natural world; we argue that prediction is the only way to demonstrate scientific understanding, implying that prediction should be a fundamental aspect of all scientific disciplines. Reproducibility is an essential requirement of good science and arises from the ability to develop models that make accurate predictions on new data. Ecology, however, with a few exceptions, has abandoned prediction as a central focus and faces its own crisis of reproducibility. Models are where ecological understanding is stored and they are the source of all predictions – no prediction is possible without a model of the world. Models can be improved in three ways: model variables, functional relationships among dependent and independent variables, and in parameter estimates. Ecologists rarely test to assess whether new models have made advances by identifying new and important variables, elucidating functional relationships, or improving parameter estimates. Without these tests it is difficult to know if we understand more today than we did yesterday. A new commitment to prediction in ecology would lead to, among other things, more mature (i.e. quantitative) hypotheses, prioritization of modeling techniques that are more appropriate for prediction (e.g. using continuous independent variables rather than categorical) and, ultimately, advancement towards a more general understanding of the natural world. Synthesis Ecology, with a few exceptions, has abandoned prediction and therefore the ability to demonstrate understanding. Here we address how this has inhibited progress in ecology and explore how a renewed focus on prediction would benefit ecologists. The lack of emphasis on prediction has resulted in a discipline that tests qualitative, imprecise hypotheses with little concern for whether the results are generalizable beyond where and when the data were collected. A renewed commitment to prediction would allow ecologists to address critical questions about the generalizability of our results and the progress we are making towards understanding the natural world.

[1]  Michael C. Frank,et al.  Estimating the reproducibility of psychological science , 2015, Science.

[2]  Ming Zhang,et al.  Artificial Higher Order Neural Networks for Economics and Business , 2008 .

[3]  Jorma Rissanen,et al.  Minimum Description Length Principle , 2010, Encyclopedia of Machine Learning.

[4]  P. Petraitis,et al.  Regression versus ANOVA (Peer-Reviewed Letter) , 2005 .

[5]  Owen L. Petchey,et al.  The ecological forecast horizon, and examples of its uses and determinants , 2015, bioRxiv.

[6]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[7]  Tim G Benton,et al.  Predictive ecology: systems approaches , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[8]  Katherine L. Gross,et al.  WHAT IS THE OBSERVED RELATIONSHIP BETWEEN SPECIES RICHNESS AND PRODUCTIVITY , 2001 .

[9]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[10]  M. Plank,et al.  Information on Biotic Interactions Improves Transferability of Distribution Models , 2014, The American Naturalist.

[11]  Julian D. Olden,et al.  Assessing transferability of ecological models: an underappreciated aspect of statistical validation , 2012 .

[12]  Steven F Railsback,et al.  Pattern-oriented modelling: a ‘multi-scope’ for predictive systems ecology , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[13]  R. Whittaker,et al.  Meta-analyses and mega-mistakes: calling time on meta-analysis of the species richness-productivity relationship. , 2010, Ecology.

[14]  Mevin B. Hooten,et al.  A guide to Bayesian model selection for ecologists , 2015 .

[15]  Peter B Adler,et al.  Productivity Is a Poor Predictor of Plant Species Richness , 2011, Science.

[16]  J. Downing,et al.  Biodiversity and stability in grasslands , 1996, Nature.

[17]  B. McGill,et al.  Testing the predictive performance of distribution models , 2013 .

[18]  S. Koreisha,et al.  The impact of measurement errors on ARMA prediction , 1999 .

[19]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[20]  P. Dillon,et al.  A Simple Method for Predicting the Capacity of a Lake for Development Based on Lake Trophic Status , 1975 .

[21]  J. Pearl Causal inference in statistics: An overview , 2009 .

[22]  Lawrence B. Slobodkin,et al.  A Critique for Ecology , 1991 .

[23]  J. Connell The Influence of Interspecific Competition and Other Factors on the Distribution of the Barnacle Chthamalus Stellatus , 1961 .

[24]  D. Murray,et al.  Regression versus Anova , 2022 .

[25]  Robert P Freckleton,et al.  Making predictive ecology more relevant to policy makers and practitioners , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[26]  David R. Anderson,et al.  Null Hypothesis Testing: Problems, Prevalence, and an Alternative , 2000 .

[27]  B. McGill,et al.  Mechanisms in macroecology: AWOL or purloined letter? Towards a pragmatic view of mechanism , 2010 .

[28]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[29]  Jay T. Lennon,et al.  Knowing when to draw the line: designing more informative ecological experiments , 2005 .

[30]  M. Sakamoto,et al.  Primary production by phytoplankton community in some Japanese lakes and its dependence on lake depth , 1966 .

[31]  R. Freckleton The problems of prediction and scale in applied ecology: the example of fire as a management tool , 2004 .

[32]  D. Schindler,et al.  Eutrophication and Recovery in Experimental Lakes: Implications for Lake Management , 1974, Science.

[33]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[34]  Annette Ostling,et al.  On Theory in Ecology , 2014 .

[35]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[36]  R. Dennis Cook,et al.  Cross-Validation of Regression Models , 1984 .

[37]  P. Inchausti,et al.  Investigating long-term ecological variability using the Global Population Dynamics Database. , 2001, Science.

[38]  Judea Pearl,et al.  ON THE CONNECTION BETWEEN THE COMPLEXITY AND CREDIBILITY OF INFERRED MODELS , 1978 .

[39]  James B. Grace,et al.  Structural Equation Modeling and Natural Systems , 2006 .

[40]  M. R. Evans,et al.  Modelling ecological systems in a changing world , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.