Assembling the Marine Metagenome, One Cell at a Time

The difficulty associated with the cultivation of most microorganisms and the complexity of natural microbial assemblages, such as marine plankton or human microbiome, hinder genome reconstruction of representative taxa using cultivation or metagenomic approaches. Here we used an alternative, single cell sequencing approach to obtain high-quality genome assemblies of two uncultured, numerically significant marine microorganisms. We employed fluorescence-activated cell sorting and multiple displacement amplification to obtain hundreds of micrograms of genomic DNA from individual, uncultured cells of two marine flavobacteria from the Gulf of Maine that were phylogenetically distant from existing cultured strains. Shotgun sequencing and genome finishing yielded 1.9 Mbp in 17 contigs and 1.5 Mbp in 21 contigs for the two flavobacteria, with estimated genome recoveries of about 91% and 78%, respectively. Only 0.24% of the assembling sequences were contaminants and were removed from further analysis using rigorous quality control. In contrast to all cultured strains of marine flavobacteria, the two single cell genomes were excellent Global Ocean Sampling (GOS) metagenome fragment recruiters, demonstrating their numerical significance in the ocean. The geographic distribution of GOS recruits along the Northwest Atlantic coast coincided with ocean surface currents. Metabolic reconstruction indicated diverse potential energy sources, including biopolymer degradation, proteorhodopsin photometabolism, and hydrogen oxidation. Compared to cultured relatives, the two uncultured flavobacteria have small genome sizes, few non-coding nucleotides, and few paralogous genes, suggesting adaptations to narrow ecological niches. These features may have contributed to the abundance of the two taxa in specific regions of the ocean, and may have hindered their cultivation. We demonstrate the power of single cell DNA sequencing to generate reference genomes of uncultured taxa from a complex microbial community of marine bacterioplankton. A combination of single cell genomics and metagenomics enabled us to analyze the genome content, metabolic adaptations, and biogeography of these taxa.

[1]  Itai Sharon,et al.  Widespread distribution of proteorhodopsins in freshwater and brackish ecosystems , 2010, The ISME Journal.

[2]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[3]  I. Paulsen,et al.  Genome analysis of the proteorhodopsin-containing marine bacterium Polaribacter sp. MED152 (Flavobacteria) , 2008, Proceedings of the National Academy of Sciences.

[4]  E. Delong,et al.  The Microbial Engines That Drive Earth's Biogeochemical Cycles , 2008, Science.

[5]  Itai Sharon,et al.  Widespread distribution of proteorhodopsins in freshwater and brackish ecosystems , 2008, The ISME Journal.

[6]  Photochemical production of molecular hydrogen in lake water and coastal seawater , 2008 .

[7]  Zhaojun Bai,et al.  CompostBin: A DNA Composition-Based Algorithm for Binning Environmental Shotgun Reads , 2007, RECOMB.

[8]  R. Knight,et al.  The Human Microbiome Project , 2007, Nature.

[9]  M. Moran,et al.  Resourceful heterotrophs make the most of light in the coastal ocean , 2007, Nature Reviews Microbiology.

[10]  Timothy B. Stockwell,et al.  Nanoliter Reactors Improve Multiple Displacement Amplification of Genomes from Single Cells , 2007, PLoS genetics.

[11]  S. Giovannoni,et al.  Improvements of high-throughput culturing yielded novel SAR11 strains and other abundant marine bacteria from the Oregon coast and the Bermuda Atlantic Time Series study site , 2007, The ISME Journal.

[12]  S. Quake,et al.  Dissecting biological “dark matter” with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouth , 2007, Proceedings of the National Academy of Sciences.

[13]  R. Stepanauskas,et al.  Matching phylogeny and metabolism in the uncultured marine bacteria, one cell at a time , 2007, Proceedings of the National Academy of Sciences.

[14]  Roger S Lasken,et al.  Mechanism of chimera formation during the Multiple Displacement Amplification reaction , 2007 .

[15]  A. Halpern,et al.  The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific , 2007, PLoS biology.

[16]  Alexander F. Auch,et al.  MEGAN analysis of metagenomic data. , 2007, Genome research.

[17]  S. Giovannoni,et al.  The SAR92 Clade: an Abundant Coastal Clade of Culturable Marine Bacteria Possessing Proteorhodopsin , 2007, Applied and Environmental Microbiology.

[18]  R. Neutze,et al.  Light stimulates growth of proteorhodopsin-containing marine Flavobacteria , 2007, Nature.

[19]  P. Vandamme,et al.  DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. , 2007, International journal of systematic and evolutionary microbiology.

[20]  E. Delong,et al.  Genomic analysis of the uncultivated marine crenarchaeote Cenarchaeum symbiosum , 2006, Proceedings of the National Academy of Sciences.

[21]  Natalia N. Ivanova,et al.  Symbiosis insights through metagenomic analysis of a microbial consortium. , 2006, Nature Reviews Microbiology.

[22]  G. Church,et al.  Sequencing genomes from single cells by polymerase cloning , 2006, Nature Biotechnology.

[23]  Lynne A. Goodwin,et al.  Pathogenomic Sequence Analysis of Bacillus cereus and Bacillus thuringiensis Isolates Closely Related to Bacillus anthracis , 2006, Journal of bacteriology.

[24]  Inna Dubchak,et al.  The integrated microbial genomes (IMG) system , 2005, Nucleic Acids Res..

[25]  Patricia Siguier,et al.  ISfinder: the reference centre for bacterial insertion sequences , 2005, Nucleic Acids Res..

[26]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[27]  M. Noordewier,et al.  Genome Streamlining in a Cosmopolitan Oceanic Bacterium , 2005, Science.

[28]  Søren Brunak,et al.  Prediction of twin-arginine signal peptides , 2005, BMC Bioinformatics.

[29]  A. C. Munk,et al.  Closing bacterial genomic sequence gaps with adaptor-PCR. , 2005, BioTechniques.

[30]  Martin Ester,et al.  Sequence analysis PSORTb v . 2 . 0 : Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis , 2004 .

[31]  S. Tringe,et al.  Comparative Metagenomics of Microbial Communities , 2004, Science.

[32]  Peter D. Karp,et al.  MetaCyc: a multiorganism database of metabolic pathways and enzymes , 2005, Nucleic Acids Res..

[33]  M. Moran,et al.  Use of Microautoradiography Combined with Fluorescence In Situ Hybridization To Determine Dimethylsulfoniopropionate Incorporation by Marine Bacterioplankton Taxa , 2004, Applied and Environmental Microbiology.

[34]  S. Brunak,et al.  Improved prediction of signal peptides: SignalP 3.0. , 2004, Journal of molecular biology.

[35]  O. White,et al.  Environmental Genome Shotgun Sequencing of the Sargasso Sea , 2004, Science.

[36]  J. Banfield,et al.  Community structure and metabolism through reconstruction of microbial genomes from the environment , 2004, Nature.

[37]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[38]  Burkhard Haefner,et al.  Drugs from the deep: marine natural products as drug candidates. , 2003, Drug discovery today.

[39]  K. Zengler,et al.  Cultivating the uncultured , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  S. Giovannoni,et al.  Cultivation of the ubiquitous SAR11 marine bacterioplankton clade , 2002, Nature.

[41]  Sean R. Eddy,et al.  A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure , 2002, BMC Bioinformatics.

[42]  S. Salzberg,et al.  Fast algorithms for large-scale genome alignment and comparison. , 2002, Nucleic acids research.

[43]  D. Kirchman The ecology of Cytophaga-Flavobacteria in aquatic environments. , 2002, FEMS microbiology ecology.

[44]  J. Hoheisel,et al.  Global Analysis of the General Stress Response ofBacillus subtilis , 2001, Journal of bacteriology.

[45]  Marion Leclerc,et al.  Proteorhodopsin phototrophy in the ocean , 2001, Nature.

[46]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[47]  Lu Zhang,et al.  Sustained photobiological hydrogen gas production upon reversible inactivation of oxygen evolution in the green alga Chlamydomonas reinhardtii. , 2000, Plant physiology.

[48]  S. Salzberg,et al.  Improved microbial gene identification with GLIMMER. , 1999, Nucleic acids research.

[49]  G. Olsen,et al.  CRITICA: coding region identification tool invoking comparative analysis. , 1999, Molecular biology and evolution.

[50]  L. Proctor,et al.  Molecular evidence for zooplankton-associated nitrogen-fixing anaerobes based on amplification of the nifH gene , 1999 .

[51]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[52]  P. Green,et al.  Consed: a graphical tool for sequence finishing. , 1998, Genome research.

[53]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[54]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[55]  D. Lane 16S/23S rRNA sequencing , 1991 .

[56]  E. Stackebrandt,et al.  Nucleic acid techniques in bacterial systematics , 1991 .

[57]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[58]  N. Pace,et al.  The Analysis of Natural Microbial Populations by Ribosomal RNA Sequences , 1986 .