“Collection Bias” and the Importance of Natural History Collections in Species Habitat Modeling: A Case Study Using Thoracophorus costalis Erichson (Coleoptera: Staphylinidae: Osoriinae), with a Critique of GBIF.org

Abstract When attempting to understand a species' distribution, knowing how many collections should be surveyed to achieve an adequate sample (exhaustiveness) is important. A test for exhaustiveness using species distribution models created with Diva-GIS was performed on county level locality information recorded from more than 4,900 specimens of Thoracophorus costalis Erichson (Staphylinidae: Osoriinae) borrowed from 38 collections. Size and location of distribution models based on specimens from single collections varied greatly, indicating “collection bias.” At least 15 collections needed to be combined before the resultant model averaged 90% of the area of a reference model created from all available specimens. By themselves, alternative distribution data from literature, Bugguide.net, and GBIF.org performed poorly, resulting in models with less than 15% the area of the reference model. Comments on the use of online data, the importance of maintaining and growing regional collections, and the future of natural history collections are included.

[1]  J. Elith,et al.  Species Distribution Models: Ecological Explanation and Prediction Across Space and Time , 2009 .

[2]  D. Pearson,et al.  The effects of scale and sample size on the accuracy of spatial predictions of tiger beetle (Cicindelidae) species richness , 1998 .

[3]  Vincent S. Smith,et al.  No specimen left behind: industrial scale digitization of natural history collections , 2012, ZooKeys.

[4]  Alberto Jiménez-Valverde,et al.  Limitations of Biodiversity Databases: Case Study on Seed‐Plant Diversity in Tenerife, Canary Islands , 2007, Conservation biology : the journal of the Society for Conservation Biology.

[5]  D. R. Robertson,et al.  Specimen collection: an essential tool. , 2014, Science.

[6]  Tim Newbold,et al.  Applications and limitations of museum data for conservation and ecology, with particular attention to species distribution models , 2010 .

[7]  David R. B. Stockwell,et al.  Effects of sample size on accuracy of species distribution models , 2002 .

[8]  J. L. Parra,et al.  Very high resolution interpolated climate surfaces for global land areas , 2005 .

[9]  S. Chatzimanolis Darwin’s legacy to rove beetles (Coleoptera, Staphylinidae): A new genus and a new species, including materials collected on the Beagle’s voyage , 2014, ZooKeys.

[10]  M. White,et al.  Measuring and comparing the accuracy of species distribution models with presence–absence data , 2011 .

[11]  M. Ferro Review of the Genus Thoracophorus (Coleoptera: Staphylinidae: Osoriinae) in North America North of Mexico, with a Key to Species , 2015 .

[12]  I. Kitching,et al.  Online solutions and the ‘Wallacean shortfall’: what does GBIF contribute to our knowledge of species' ranges? , 2013 .

[13]  Tim Sutton,et al.  How Global Is the Global Biodiversity Information Facility? , 2007, PloS one.

[14]  A. Suarez,et al.  The Value of Museum Collections for Research and Society , 2004 .

[15]  G. Cumming Using between‐model comparisons to fine‐tune linear models of species ranges , 2000 .

[16]  P. Hernandez,et al.  The effect of sample size and species characteristics on performance of different species distribution modeling methods , 2006 .

[17]  C. Nilsson,et al.  Future Climate Change Will Favour Non-Specialist Mammals in the (Sub)Arctics , 2012, PloS one.

[18]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[19]  O. Rojas-Soto,et al.  The importance of defining the geographic distribution of species for conservation: The case of the Bearded Wood-Partridge , 2012 .

[20]  R. Kadmon,et al.  EFFECT OF ROADSIDE BIAS ON THE ACCURACY OF PREDICTIVE MAPS PRODUCED BY BIOCLIMATIC MODELS , 2004 .

[21]  A. Peterson,et al.  Biodiversity informatics: managing and applying primary biodiversity data. , 2004, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[22]  G. Hardin,et al.  The Tragedy of the Commons , 1968, Green Planet Blues.

[23]  A. Peterson,et al.  New developments in museum-based informatics and applications in biodiversity analysis. , 2004, Trends in ecology & evolution.