Validation and calibration of probabilistic predictions in ecology

Summary Predictive models in ecology are important for guiding policy and management. However, they are necessarily abstractions of natural systems, making predictive validation imperative. Models, which make predictions about binary outcomes (e.g. Species distribution models, population viability analysis, disease/invasion models), are widespread in the ecological literature. When supporting probability-based management decisions, these predictions need to be assessed with respect to the degree to which predicted probabilities agree with future outcomes. Many predictive models are not validated using external data and are often only assessed in terms of their ability to discriminate between outcomes rather than the degree to which they predicted the correct probabilities. We develop a novel Validation Metric Applied to Probabilistic Predictions (VMAPP), which provides a goodness-of-fit test of calibration for probabilistic prediction models using binary data (e.g. presences and absences in models of species distributions). We analyse the theoretic properties of this test and compare its performance against existing methods, and apply it to a published model in invasion biology, which forecasts the establishment probability of the zooplanktivorous spiny water flea (Bythotrephes longimanus). We selected 102 additional sites to sample four years after the training data were collected and use this independently collected data to assess predictive reliability using VMAPP. Theoretic simulation analysis shows that VMAPP outperforms existing metrics (Cox's regression technique and Hosmer & Lemeshow's χ2 test) in terms of statistical power to identify model miscalibration. Further, we find that under realistic conditions where model parameters are estimated (and have associated uncertainty) that VMAPP is more robust, retaining the appropriate type-I error rates (5%) where previous metrics fail (≤17%). Application of VMAPP to a published invasion model using empirical validation data shows that in addition to having high discriminative power, the model's probabilistic predictions agree with the observed outcomes as measured by VMAPP. We argue that quantifying ecological predictions as probabilities with associated uncertainty provides the most useful information to support management decisions. Ecological predictions, while uncertain, should still be rigorously validated. Identifying the circumstances in which our predictions deviate from observation can further inform the next generation of the model, bringing prediction and reality ever closer.

[1]  Michael A. McCarthy,et al.  A Bayesian model of metapopulation viability, with application to an endangered amphibian , 2013 .

[2]  John M. Drake,et al.  Allee Effects, Propagule Pressure and the Probability of Establishment: Risk Analysis for Biological Invasions , 2006, Biological Invasions.

[3]  Phil A. Graniero,et al.  BACKCASTING AND FORECASTING BIOLOGICAL INVASIONS OF INLAND LAKES , 2004 .

[4]  Jialiang Li,et al.  Variable and threshold selection to control predictive accuracy in logistic regression , 2014 .

[5]  Brian Dennis,et al.  ESTIMATING POPULATION TREND AND PROCESS VARIATION FOR PVA IN THE PRESENCE OF SAMPLING ERROR , 2004 .

[6]  Volker Grimm,et al.  Ecological models supporting environmental decision making: a strategy for the future. , 2010, Trends in ecology & evolution.

[7]  Antoine Guisan,et al.  Predictive habitat distribution models in ecology , 2000 .

[8]  M. Fortin,et al.  Measuring ecological niche overlap from occurrence and spatial environmental data , 2012 .

[9]  T. Hastie,et al.  Presence‐Only Data and the EM Algorithm , 2009, Biometrics.

[10]  Russell Steele,et al.  The value of a datum – how little data do we need for a quantitative risk analysis? , 2013 .

[11]  Simon Ferrier,et al.  Evaluating the predictive performance of habitat models developed using logistic regression , 2000 .

[12]  Trevor Hastie,et al.  A statistical explanation of MaxEnt for ecologists , 2011 .

[13]  W. Jetz,et al.  Unravelling the structure of species extinction risk for predictive conservation science , 2011, Proceedings of the Royal Society B: Biological Sciences.

[14]  B. Leung,et al.  Importing risk: quantifying the propagule pressure–establishment relationship at the pathway level , 2013 .

[15]  H. Resit Akçakaya,et al.  Predictive accuracy of population viability analysis in conservation biology , 2000, Nature.

[16]  R. Ohlemüller,et al.  Rapid Range Shifts of Species Associated with High Levels of Climate Warming , 2011, Science.

[17]  W. Thuiller,et al.  Predicting species distribution: offering more than simple habitat models. , 2005, Ecology letters.

[18]  A. H. Murphy,et al.  What Is a Good Forecast? An Essay on the Nature of Goodness in Weather Forecasting , 1993 .

[19]  Gretchen G. Moisen,et al.  A comparison of the performance of threshold criteria for binary classification in terms of predicted prevalence and Kappa , 2008 .

[20]  Michael T. Gastner,et al.  The risk of marine bioinvasion caused by global shipping. , 2013, Ecology letters.

[21]  Jane Elith,et al.  POC plots: calibrating species distribution models with presence-only data. , 2010, Ecology.

[22]  S. Higgins,et al.  Validation of a spatial simulation model of a spreading alien plant population , 2001 .

[23]  R. Meentemeyer,et al.  Equilibrium or not? Modelling potential distribution of invasive species in different stages of invasion , 2012 .

[24]  Joseph S. Meyer,et al.  Estimating Uncertainty in Population Growth Rates: Jackknife vs. Bootstrap Techniques , 1986 .

[25]  S. Richards,et al.  Prevalence, thresholds and the performance of presence–absence models , 2014 .

[26]  M. Burgman,et al.  Risks, decisions and biological conservation , 2013 .

[27]  A. Townsend Peterson,et al.  Rethinking receiver operating characteristic analysis applications in ecological niche modeling , 2008 .

[28]  Simon J. Mason,et al.  Understanding forecast verification statistics , 2008 .

[29]  S. Carpenter,et al.  Ecological forecasts: an emerging imperative. , 2001, Science.

[30]  N. Yan,et al.  Bythotrephes cederstroemi (Schoedler) in Muskoka Lakes: First Records of the European Invader in Inland Lakes in Canada , 1992 .

[31]  S L Hui,et al.  Validation techniques for logistic regression models. , 1991, Statistics in medicine.

[32]  B. Leung,et al.  Predicting the spread of invasive species in an uncertain world: accommodating multiple vectors and gaps in temporal and spatial data for Bythotrephes longimanus , 2011, Biological Invasions.

[33]  Antoine Guisan,et al.  Are niche-based species distribution models transferable in space? , 2006 .

[34]  Catherine A Calder,et al.  Accounting for uncertainty in ecological analysis: the strengths and limitations of hierarchical statistical modeling. , 2009, Ecological applications : a publication of the Ecological Society of America.

[35]  K. Fausch,et al.  Probabilistic accounting of uncertainty in forecasts of species distributions under climate change , 2013, Global change biology.

[36]  I. Jolliffe Uncertainty and Inference for Verification Measures , 2007 .

[37]  Jennifer Brown,et al.  New measures for assessing model equilibrium and prediction mismatch in species distribution models , 2013 .

[38]  M. Araújo,et al.  Validation of species–climate impact models under climate change , 2005 .

[39]  B. Leung,et al.  Propagule pressure, allee effects and the probability of establishment of an invasive species (Bythotrephes longimanus). , 2011 .

[40]  S. Manel,et al.  Evaluating presence-absence models in ecology: the need to account for prevalence , 2001 .