Prevalence affects the evaluation of discrimination capacity in presence-absence species distribution models

The aim of this study is to understand how prevalence—the ratio of instances of presence to total sample size—affects the estimation of three discrimination indexes commonly used in distribution modelling: the area under the receiver operating characteristic curve (AUC), the value of sensitivity at the threshold where sensitivity equals specificity (Se*), and the maximum value of the Youden index or true skill statistic (Y). For four sample size levels, samples of suitability scores for the instances of presence and absence with varying prevalences were simulated from known distributions so that the true values of the discrimination indexes were known, and the three indexes were empirically estimated (AUCest, Se*est, Yest). AUCest and Se*est are unbiased estimators, and the greatest precision is achieved with a balanced prevalence. As sample size increases, there is a larger prevalence interval around 0.5 in which precision is more or less stable. As a rule of thumb, in the case of n ≤ 100, at least ten observations of the rare state (either instances of presence or absence) should be considered, whereas the safety prevalence interval [0.01, 0.99] should be used for higher sample sizes. The lower the true discriminative power of the models, the higher the negative effect of prevalence. Yest is positively biased, and bias and precision become worse towards low and high prevalences. Highly unbalanced prevalences increase the imprecision in estimating the discrimination capacity of the models. Y is not recommended as a discrimination measure since it provides overoptimistic results.

[1]  N. Zimmermann,et al.  Habitat Suitability and Distribution Models: With Applications in R , 2017 .

[2]  Robert P. Anderson,et al.  Ecological Niches and Geographic Distributions , 2011 .

[3]  Enrique F Schisterman,et al.  The Youden Index and the Optimal Cut‐Point Corrected for Measurement Error , 2005, Biometrical journal. Biometrische Zeitschrift.

[4]  A. Jiménez‐Valverde Threshold-dependence as a desirable attribute for discrimination assessment: implications for the evaluation of species distribution models , 2014, Biodiversity and Conservation.

[5]  J. Hilden The Area under the ROC Curve and Its Competitors , 1991, Medical decision making : an international journal of the Society for Medical Decision Making.

[6]  Geoffrey I. Webb,et al.  On the Application of ROC Analysis to Predict Classification Performance Under Varying Class Distributions , 2005, Machine Learning.

[7]  R. Koenker Quantile Regression: Name Index , 2005 .

[8]  A. Jiménez‐Valverde,et al.  The ghost of unbalanced species distribution data in geographical model predictions , 2006 .

[9]  J. Lobo,et al.  Threshold criteria for conversion of probability of species presence to either–or presence–absence , 2007 .

[10]  V. Muggeo Estimating regression models with unknown break‐points , 2003, Statistics in medicine.

[11]  F. Jiguet,et al.  Selecting pseudo‐absences for species distribution models: how, where and how many? , 2012 .

[12]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[13]  B. Reiser,et al.  Estimation of the Youden Index and its Associated Cutoff Point , 2005, Biometrical journal. Biometrische Zeitschrift.

[14]  D. Bamber The area above the ordinal dominance graph and the area below the receiver operating characteristic graph , 1975 .

[15]  H Brenner,et al.  Variation of sensitivity, specificity, likelihood ratios and predictive values with disease prevalence. , 1997, Statistics in medicine.

[16]  J Hilden,et al.  Regret graphs, diagnostic uncertainty and Youden's Index. , 1996, Statistics in medicine.

[17]  I. Somodi,et al.  Prevalence dependence in model goodness measures with special emphasis on true skill statistics , 2017, Ecology and evolution.

[18]  Adam B. Smith On evaluating species distribution models with random background sites in place of absences when test presences disproportionately sample suitable habitat , 2013 .

[19]  Mauro Fois,et al.  Using species distribution models at local scale to guide the search of poorly known species: Review, methodological issues and future directions , 2018, Ecological Modelling.

[20]  M. Leeflang,et al.  Bias in sensitivity and specificity caused by data-driven selection of optimal cutoff values: mechanisms, magnitude, and solutions. , 2008, Clinical chemistry.

[21]  A. Márcia Barbosa,et al.  Discrimination capacity in species distribution models depends on the representativeness of the environmental domain , 2013 .

[22]  Stanley Lemeshow,et al.  Applied Logistic Regression, Second Edition , 1989 .

[23]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[24]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[25]  A. Jiménez‐Valverde Sample size for the evaluation of presence-absence models , 2020 .

[26]  Céline Bellard,et al.  Without quality presence–absence data, discrimination metrics such as TSS can be misleading measures of model performance , 2018, Journal of Biogeography.

[27]  A. Jiménez‐Valverde Insights into the area under the receiver operating characteristic curve (AUC) as a discrimination measure in species distribution modelling , 2012 .

[28]  B. Reiser,et al.  Estimation of the area under the ROC curve , 2002, Statistics in medicine.

[29]  S. Manel,et al.  Evaluating presence-absence models in ecology: the need to account for prevalence , 2001 .

[30]  D. Rogers,et al.  The effects of species’ range sizes on the accuracy of distribution models: ecological phenomenon or statistical artefact? , 2004 .

[31]  Truly Santika Assessing the effect of prevalence on the predictive performance of species distribution models using simulated data , 2011 .

[32]  David J. Hand,et al.  ROC Curves for Continuous Data , 2009 .

[33]  J. L. McCune Species distribution models predict rare species occurrences despite significant effects of landscape context , 2016 .

[34]  M. Muggeo,et al.  segmented: An R package to Fit Regression Models with Broken-Line Relationships , 2008 .

[35]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[36]  R. Real,et al.  AUC: a misleading measure of the performance of predictive distribution models , 2008 .

[37]  Ben Ewald,et al.  Post hoc choice of cut points introduced bias to diagnostic research. , 2006, Journal of clinical epidemiology.

[38]  Elisa M. Molanes-López,et al.  Confidence intervals for the symmetry point: an optimal cutpoint in continuous diagnostic tests , 2016, Pharmaceutical statistics.

[39]  Jennifer A. Miller,et al.  Mapping Species Distributions: Spatial Inference and Prediction , 2010 .

[40]  G. Foody Impacts of imperfect reference data on the apparent accuracy of species presence–absence models and their predictions , 2011 .

[41]  Omri Allouche,et al.  Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS) , 2006 .

[42]  N. Perkins,et al.  Optimal Cut-point and Its Corresponding Youden Index to Discriminate Individuals Using Pooled Blood Samples , 2005, Epidemiology.

[43]  Matthew J. Smith,et al.  Using species distribution models to inform IUCN Red List assessments , 2014 .

[44]  Alberto Jiménez-Valverde,et al.  The uncertain nature of absences and their importance in species distribution modelling , 2010 .