How Global Is the Global Biodiversity Information Facility?

There is a concerted global effort to digitize biodiversity occurrence data from herbarium and museum collections that together offer an unparalleled archive of life on Earth over the past few centuries. The Global Biodiversity Information Facility provides the largest single gateway to these data. Since 2004 it has provided a single point of access to specimen data from databases of biological surveys and collections. Biologists now have rapid access to more than 120 million observations, for use in many biological analyses. We investigate the quality and coverage of data digitally available, from the perspective of a biologist seeking distribution data for spatial analysis on a global scale. We present an example of automatic verification of geographic data using distributions from the International Legume Database and Information Service to test empirically, issues of geographic coverage and accuracy. There are over 1/2 million records covering 31% of all Legume species, and 84% of these records pass geographic validation. These data are not yet a global biodiversity resource for all species, or all countries. A user will encounter many biases and gaps in these data which should be understood before data are used or analyzed. The data are notably deficient in many of the world's biodiversity hotspots. The deficiencies in data coverage can be resolved by an increased application of resources to digitize and publish data throughout these most diverse regions. But in the push to provide ever more data online, we should not forget that consistent data quality is of paramount importance if the data are to be useful in capturing a meaningful picture of life on Earth.

[1]  A. Suarez,et al.  The Value of Museum Collections for Research and Society , 2004 .

[2]  E. Pennisi Taxonomic Revival , 2000, Science.

[3]  Torsten Dikow,et al.  Significance of Specimen Databases from Taxonomic Revisions for Estimating and Mapping the Global Species Diversity of Invertebrates and Repatriating Reliable Specimen Data , 2004 .

[4]  A. Peterson,et al.  New developments in museum-based informatics and applications in biodiversity analysis. , 2004, Trends in ecology & evolution.

[5]  V. Sánchez‐Cordero,et al.  Museum specimen data predict crop damage by tropical rodents. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  H. Shaffer,et al.  The role of natural history collections in documenting species declines. , 1998, Trends in ecology & evolution.

[7]  C. Yesson,et al.  A phyloclimatic study of Cyclamen , 2006, BMC Evolutionary Biology.

[8]  F. Bisby The quiet revolution: biodiversity informatics and the internet. , 2000, Science.

[9]  R. Brummitt,et al.  World geographical scheme for recording plant distributions , 1992 .

[10]  T. Brooks,et al.  Hotspots Revisited: Earth's Biologically Richest and Most Endangered Terrestrial Ecoregions , 2000 .

[11]  Q. Wheeler,et al.  What if GBIF? , 2004 .

[12]  C Hilton-Taylor,et al.  Measuring the fate of plant diversity: towards a foundation for future monitoring and opportunities for urgent action , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[13]  C. Yesson,et al.  Phyloclimatic modeling: combining phylogenetics and bioclimatic modeling. , 2006, Systematic biology.

[14]  W. Ponder,et al.  Evaluation of Museum Collection Data for Use in Biodiversity Assessment , 2001 .

[15]  J. Edwards Research and Societal Benefits of the Global Biodiversity Information Facility , 2004 .