Predictive Models of Fish Species Distributions: A Note on Proper Validation and Chance Predictions

Abstract The prediction of species distributions is a primary goal in the study, conservation, and management of fisheries resources. Statistical models relating patterns of species presence or absence to multiscale habitat variables play an important role in this regard. Researchers, however, have paid little attention to how improper model validation and chance predictions can result in unfounded confidence in the performance and utility of such models. Using simulated and empirical data for 40 lake and stream fish species, we demonstrate that the commonly employed resubstitution approach to model validation (in which the same data are used for both model construction and prediction) produces highly biased estimates of correct classification rates and consequently an inaccurate perception of true model performance. In contrast, a jackknife approach to validation resulted in relatively unbiased estimates of model performance. The estimated rates of model correct classification are also shown to be substa...

[1]  S. Manel,et al.  Evaluating presence-absence models in ecology: the need to account for prevalence , 2001 .

[2]  Donald A. Jackson,et al.  Fish–Habitat Relationships in Lakes: Gaining Predictive and Explanatory Insight by Using Artificial Neural Networks , 2001 .

[3]  Heinz G. Stefan,et al.  Simulated Fish Habitat Changes in North American Lakes in Response to Projected Climate Warming , 2001 .

[4]  A. Peterson,et al.  Predicting Species Invasions Using Ecological Niche Modeling: New Approaches from Bioinformatics Attack a Pressing Problem , 2001 .

[5]  D. Pont,et al.  A probabilistic model characterizing fish assemblages of French rivers: a framework for environmental assessment , 2001 .

[6]  Donald A. Jackson,et al.  What controls who is where in freshwater fish communities the roles of biotic, abiotic, and spatial factors , 2001 .

[7]  Antoine Guisan,et al.  Predictive habitat distribution models in ecology , 2000 .

[8]  Paul H. Williams,et al.  Using probability of persistence to identify important areas for biodiversity conservation , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[9]  Richard H. Norris,et al.  DEVELOPMENT AND EVALUATION OF PREDICTIVE MODELS FOR MEASURING THE BIOLOGICAL INTEGRITY OF STREAMS , 2000 .

[10]  J. Rosenfeld,et al.  Predictive Models of Fish Species Distribution in the Blackwater Drainage, British Columbia , 2000 .

[11]  Julian D. Olden,et al.  Torturing data for the sake of generality: How valid are our regression models? , 2000 .

[12]  J. Olden Predictive models for freshwater fish community composition , 2000 .

[13]  S. Manel,et al.  Alternative methods for predicting species distribution: an illustration with Himalayan river birds , 1999 .

[14]  Amy L. Harig,et al.  DEFINING AND RESTORING BIOLOGICAL INTEGRITYIN WILDERNESS LAKES , 1998 .

[15]  Brian D. Richter,et al.  Threats to Imperiled Freshwater Fauna , 1997 .

[16]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[17]  C. Chatfield Model uncertainty, data mining and statistical inference , 1995 .

[18]  David O. Evans,et al.  Introduction of Lake Trout (Salvelinus namaycush) to Inland Lakes of Ontario, Canada: Factors Contributing to Successful Colonization , 1995 .

[19]  Robert M. Hughes,et al.  Biological Diversity and Biological Integrity: Current Concerns for Lakes and Streams , 1992 .

[20]  Andrew R. Solow,et al.  A Randomization Test for Misclassification Probability in Discriminant Analysis , 1990 .

[21]  David L. Verbyla,et al.  Resampling methods for evaluating classification accuracy of wildlife habitat models , 1989 .

[22]  B. Efron How Biased is the Apparent Error Rate of a Prediction Rule , 1986 .

[23]  Don E. McAllister,et al.  Fishes of North America Endangered, Threatened, or of Special Concern: 1989 , 1979 .