The World Bacterial Biogeography and Biodiversity through Databases: A Case Study of NCBI Nucleotide Database and GBIF Database

Databases are an essential tool and resource within the field of bioinformatics. The primary aim of this study was to generate an overview of global bacterial biodiversity and biogeography using available data from the two largest public online databases, NCBI Nucleotide and GBIF. The secondary aim was to highlight the contribution each geographic area has to each database. The basis for data analysis of this study was the metadata provided by both databases, mainly, the taxonomy and the geographical area origin of isolation of the microorganism (record). These were directly obtained from GBIF through the online interface, while E-utilities and Python were used in combination with a programmatic web service access to obtain data from the NCBI Nucleotide Database. Results indicate that the American continent, and more specifically the USA, is the top contributor, while Africa and Antarctica are less well represented. This highlights the imbalance of exploration within these areas rather than any reduction in biodiversity. This study describes a novel approach to generating global scale patterns of bacterial biodiversity and biogeography and indicates that the Proteobacteria are the most abundant and widely distributed phylum within both databases.

[1]  Philip Hugenholtz,et al.  Impact of Culture-Independent Studies on the Emerging Phylogenetic View of Bacterial Diversity , 1998, Journal of bacteriology.

[2]  E. Sayers A General Introduction to the E-utilities , 2010 .

[3]  J. Tiedje,et al.  Biogeography and Degree of Endemicity of Fluorescent Pseudomonas Strains in Soil , 2000, Applied and Environmental Microbiology.

[4]  J. Hollibaugh,et al.  Phylogenetic Composition of Arctic Ocean Archaeal Assemblages and Comparison with Antarctic Assemblages , 2004, Applied and Environmental Microbiology.

[5]  B. Jones,et al.  Microbial Biogeography of Six Salt Lakes in Inner Mongolia, China, and a Salt Lake in Argentina , 2009, Applied and Environmental Microbiology.

[6]  J. Prosser,et al.  Molecular Analysis of Bacterial Community Structure and Diversity in Unimproved and Improved Upland Grass Pastures , 1999, Applied and Environmental Microbiology.

[7]  Laura E. Green,et al.  The role of ecological theory in microbial ecology , 2007, Nature Reviews Microbiology.

[8]  Rob Knight,et al.  Soil bacterial diversity in the Arctic is not fundamentally different from that found in other biomes. , 2010, Environmental microbiology.

[9]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[10]  James H. Brown,et al.  Microbial biogeography: putting microorganisms on the map , 2006, Nature Reviews Microbiology.

[11]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[12]  Thomas Bell,et al.  The bacterial biogeography of British soils. , 2011, Environmental microbiology.

[13]  J. Hughes,et al.  A taxa–area relationship for bacteria , 2004, Nature.

[14]  W. Marsden I and J , 2012 .

[15]  O. White,et al.  Environmental Genome Shotgun Sequencing of the Sargasso Sea , 2004, Science.

[16]  Andy South,et al.  rworldmap : a new R package for mapping global data , 2011, R J..

[17]  Arthur Chapman,et al.  © 2005, Global Biodiversity Information Facility Material in this publication is free to use, with proper attribution. Recommended citation format: Chapman, A. D. 2005. Principles of Data Quality, version 1.0. Report for the Global Biodiversity Information Facility, Copenhagen. , 2005 .

[18]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[19]  R. B. Jackson,et al.  The diversity and biogeography of soil bacterial communities. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[20]  R. Knight,et al.  Global patterns in the biogeography of bacterial taxa. , 2011, Environmental microbiology.

[21]  B. Bohannan,et al.  Spatial scaling of microbial biodiversity. , 2006, Trends in ecology & evolution.

[22]  J. Saunders,et al.  Microbial Evolution, Diversity, and Ecology: A Decade of Ribosomal RNA Analysis of Uncultivated Microorganisms , 1998, Microbial Ecology.

[23]  E. Dooley,et al.  Global Biodiversity Information Facility , 2002, Environmental Health Perspectives.

[24]  Gerard Muyzer,et al.  A comparison of taxon co-occurrence patterns for macro- and microorganisms. , 2007, Ecology.

[25]  W. Whitman,et al.  Prokaryotes: the unseen majority. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Wim Vyverman,et al.  The power of species sorting: Local factors drive bacterial community composition over a wide range of spatial scales , 2007, Proceedings of the National Academy of Sciences.

[27]  James N. Burch,et al.  Biogeography. An Ecological and Evolutionary Approach , 1994, Economic Botany.

[28]  John W. Taylor,et al.  Geographic Barriers Isolate Endemic Populations of Hyperthermophilic Archaea , 2003, Science.

[29]  Roy Haines-Young,et al.  Biogeography , 1992, Vegetatio.