Recommendations for using the relative operating characteristic (ROC)

The relative operating characteristic (ROC) is a widely-used method to measure diagnostic signals including predictions of land changes, species distributions, and ecological niches. The ROC measures the degree to which presence for a Boolean variable is associated with high ranks of an index. The ROC curve plots the rate of true positives versus the rate of false positives obtained from the comparison between the Boolean variable and multiple diagnoses derived from thresholds applied to the index. The area under the ROC curve (AUC) is a summary metric, which is commonly reported and frequently criticized. Our manuscript recommends four improvements in the use and interpretation of the ROC curve and its AUC by: (1) highlighting important threshold points on the ROC curve, (2) interpreting the shape of the ROC curve, (3) defining lower and upper bounds for the AUC, and (4) mapping the density of the presence within each bin of the ROC curve. These recommendations encourage scientists to interpret the rich information that the ROC curve can reveal, in a manner that goes far beyond the potentially misleading AUC. We illustrate the benefit of our recommendations by assessing the prediction of land change in a suburban landscape.

[1]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[2]  S. Rosset,et al.  Roc Confidence Bands: An Empirical Study , 2005 .

[3]  R. Pontius,et al.  Death to Kappa: birth of quantity disagreement and allocation disagreement for accuracy assessment , 2011 .

[4]  J. Metzger,et al.  Is bird incidence in Atlantic forest fragments influenced by landscape patterns at multiple scales? , 2009, Landscape Ecology.

[5]  H. Possingham,et al.  Effects of landscape pattern on bird species distribution in the Mt. Lofty Ranges, South Australia , 2003, Landscape Ecology.

[6]  Kor de Jong,et al.  A method to analyse neighbourhood characteristics of land use patterns , 2004, Comput. Environ. Urban Syst..

[7]  R. Real,et al.  AUC: a misleading measure of the performance of predictive distribution models , 2008 .

[8]  Wisdom M. Dlamini,et al.  A Bayesian belief network analysis of factors influencing wildfire occurrence in Swaziland , 2010, Environ. Model. Softw..

[9]  R. Swihart,et al.  Modeling patch occupancy: Relative performance of ecologically scaled landscape indices , 2008, Landscape Ecology.

[10]  Harris David,et al.  A statistical explanation of MaxEnt for ecologists , 2013 .

[11]  Joanne M. Oldland,et al.  Edge geometry influences patch-level habitat use by an edge specialist in south-eastern Australia , 2008, Landscape Ecology.

[12]  Sassan Saatchi,et al.  Modeling distribution of Amazonian tree species and diversity using remote sensing measurements , 2008 .

[13]  Shota Mochizuki,et al.  Change in habitat selection by Japanese macaques (Macaca fuscata) and habitat fragmentation analysis using temporal remotely sensed data in Niigata Prefecture, Japan , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[14]  Robert P. Anderson,et al.  Maximum entropy modeling of species geographic distributions , 2006 .

[15]  J. Evans,et al.  Gradient modeling of conifer species using random forests , 2009, Landscape Ecology.

[16]  Peter H. Verburg,et al.  Statistical methods for analysing the spatial dimension of changes in land use and farming systems , 2005 .

[17]  Robert Gilmore Pontius,et al.  A Suite of Tools for ROC Analysis of Spatial Models , 2013, ISPRS Int. J. Geo Inf..

[18]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[19]  Robert Gilmore Pontius,et al.  The total operating characteristic to measure diagnostic ability for multiple thresholds , 2014, Int. J. Geogr. Inf. Sci..

[20]  I. Jolliffe,et al.  Forecast verification : a practitioner's guide in atmospheric science , 2011 .

[21]  A. Townsend Peterson,et al.  Rethinking receiver operating characteristic analysis applications in ecological niche modeling , 2008 .

[22]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[23]  Pablo Pacheco,et al.  Calibration and validation of a model of forest disturbance in the Western Ghats, India 1920–1990 , 2004 .

[24]  Mehryar Mohri,et al.  Confidence Intervals for the Area Under the ROC Curve , 2004, NIPS.

[25]  Alan B. Anderson,et al.  Prediction of multinomial probability of land use change using a bisection decomposition and logistic regression , 2007, Landscape Ecology.

[26]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[27]  Jesús Muñoz,et al.  Comparison of statistical methods commonly used in predictive modelling , 2004 .

[28]  Peter H. Verburg,et al.  Predictive ability of logistic regression, auto-logistic regression and neural network models in empirical land-use change modeling – a case study , 2011, Int. J. Geogr. Inf. Sci..