Open resource metagenomics: a model for sharing metagenomic libraries

Both sequence-based and activity-based exploitation of environmental DNA have provided unprecedented access to the genomic content of cultivated and uncultivated microorganisms. Although researchers deposit microbial strains in culture collections and DNA sequences in databases, activity-based metagenomic studies typically only publish sequences from the hits retrieved from specific screens. Physical metagenomic libraries, conceptually similar to entire sequence datasets, are usually not straightforward to obtain by interested parties subsequent to publication. In order to facilitate unrestricted distribution of metagenomic libraries, we propose the adoption of open resource metagenomics, in line with the trend towards open access publishing, and similar to culture- and mutant-strain collections that have been the backbone of traditional microbiology and microbial genetics. The concept of open resource metagenomics includes preparation of physical DNA libraries, preferably in versatile vectors that facilitate screening in a diversity of host organisms, and pooling of clones so that single aliquots containing complete libraries can be easily distributed upon request. Database deposition of associated metadata and sequence data for each library provides researchers with information to select the most appropriate libraries for further research projects. As a starting point, we have established the Canadian MetaMicroBiome Library (CM2BL [1]). The CM2BL is a publicly accessible collection of cosmid libraries containing environmental DNA from soils collected from across Canada, spanning multiple biomes. The libraries were constructed such that the cloned DNA can be easily transferred to Gateway® compliant vectors, facilitating functional screening in virtually any surrogate microbial host for which there are available plasmid vectors. The libraries, which we are placing in the public domain, will be distributed upon request without restriction to members of both the academic research community and industry. This article invites the scientific community to adopt this philosophy of open resource metagenomics to extend the utility of functional metagenomics beyond initial publication, circumventing the need to start from scratch with each new research project.

[1]  Trevor C. Charles,et al.  Harvesting of novel polyhydroxyalkanaote (PHA) synthase encoding genes from a soil metagenome library using phenotypic screening. , 2011, FEMS microbiology letters.

[2]  Emily S. Charlson,et al.  Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications , 2011, Nature Biotechnology.

[3]  Paula Y. Calle,et al.  Cloning large natural product gene clusters from the environment: Piecing environmental DNA gene clusters back together with TAR , 2010, Biopolymers.

[4]  Trevor C. Charles,et al.  Identification and characterization of new LuxR/LuxI-type quorum sensing systems from metagenomic libraries. , 2010, Environmental microbiology.

[5]  W. Ludwig,et al.  Notes on the characterization of prokaryote strains for taxonomic purposes. , 2010, International journal of systematic and evolutionary microbiology.

[6]  Adi Rolider Isolation and characterization of bacterial phosphorous metabolism genes from complex microbial communities , 2009 .

[7]  Norman Paskin,et al.  Studies on Monitoring and Tracking Genetic Resources: An Executive Summary , 2009, Standards in genomic sciences.

[8]  Andreas Wilke,et al.  phylogenetic and functional analysis of metagenomes , 2022 .

[9]  Frank Oliver Glöckner,et al.  Toward a standards-compliant genomic and metagenomic publication record. , 2008, Omics : a journal of integrative biology.

[10]  Chris F. Taylor,et al.  The minimum information about a genome sequence (MIGS) specification , 2008, Nature Biotechnology.

[11]  F. Katzen Gateway® recombinational cloning: a biological operating system , 2007, Expert opinion on drug discovery.

[12]  M. Thurston,et al.  Handlebar: a flexible, web-based inventory manager for handling barcoded samples. , 2007, BioTechniques.

[13]  P. Bork,et al.  Prediction of effective genome size in metagenomic samples , 2007, Genome Biology.

[14]  Trevor C. Charles,et al.  Isolation of Poly-3-Hydroxybutyrate Metabolism Genes from Complex Microbial Communities by Phenotypic Complementation of Bacterial Mutants , 2006, Applied and Environmental Microbiology.

[15]  J. Handelsman,et al.  Uncultured soil bacteria are a reservoir of new antibiotic resistance genes. , 2004, Environmental microbiology.

[16]  M. Kahn,et al.  New Recombination Methods for Sinorhizobium meliloti Genetics , 2004, Applied and Environmental Microbiology.

[17]  Jo Handelsman,et al.  A Census of rRNA Genes and Linked Genomic Sequences within a Soil Metagenomic Library , 2003, Applied and Environmental Microbiology.

[18]  Jo Handelsman,et al.  Isolation of Antibiotics Turbomycin A and B from a Metagenomic Library of Soil Microbial DNA , 2002, Applied and Environmental Microbiology.

[19]  C. Fraser,et al.  Sequenced strains must be saved from extinction , 2001, Nature.

[20]  J. Hartley,et al.  DNA cloning using in vitro site-specific recombination. , 2000, Genome research.

[21]  J. Handelsman,et al.  Cloning the Soil Metagenome: a Strategy for Accessing the Genetic and Functional Diversity of Uncultured Microorganisms , 2000, Applied and Environmental Microbiology.

[22]  E. Delong,et al.  Characterization of uncultivated prokaryotes: isolation and analysis of a 40-kilobase-pair genome fragment from a planktonic marine archaeon , 1996, Journal of bacteriology.

[23]  D. Fritze World Federation for Culture Collections: Minutes of the General Assembly, 14 October 1992, Beijing , 1994 .

[24]  E. Delong,et al.  Analysis of a marine picoplankton community by 16S rRNA gene cloning and sequencing , 1991, Journal of bacteriology.

[25]  Jonathan D. G. Jones,et al.  An efficient mobilizable cosmid vector, pRK7813, and its use in a rapid method for marker exchange in Pseudomonas fluorescens strain HV37a. , 1987, Gene.