Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures

BackgroundMetagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses.ResultsFrom both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes.ConclusionThat BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis.

[1]  G. Fournous,et al.  Phage as agents of lateral gene transfer. , 2003, Current opinion in microbiology.

[2]  M. Blaser,et al.  Identification of Horizontally Acquired Genetic Elements in Helicobacter pylori and Other Prokaryotes Using Oligonucleotide Difference Analysis , 2002 .

[3]  Ellen Jo Baron,et al.  Manual of clinical microbiology , 1975 .

[4]  G. Singer,et al.  Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content. , 2003, Gene.

[5]  Martin J Blaser,et al.  Evidence of host-virus co-evolution in tetranucleotide usage patterns of bacteriophages and eukaryotic viruses , 2006, BMC Genomics.

[6]  Dhritiman Ghosh,et al.  Metagenomic Characterization of Chesapeake Bay Virioplankton , 2007, Applied and Environmental Microbiology.

[7]  H. Ackermann,et al.  Isolation and characterization of Thermus bacteriophages , 2006, Archives of Virology.

[8]  N. Pace,et al.  Microbial Composition of Near-Boiling Silica-Depositing Thermal Springs throughout Yellowstone National Park , 2002, Applied and Environmental Microbiology.

[9]  E. Koonin,et al.  Evolutionary genomics of archaeal viruses: unique viral genomes in the third domain of life. , 2006, Virus research.

[10]  K. Eric Wommack,et al.  Assembly of Viral Metagenomes from Yellowstone Hot Springs , 2008, Applied and Environmental Microbiology.

[11]  K. Wommack,et al.  Virioplankton: Viruses in Aquatic Ecosystems , 2000, Microbiology and Molecular Biology Reviews.

[12]  M. Breitbart,et al.  Phage Community Dynamics in Hot Springs , 2004, Applied and Environmental Microbiology.

[13]  R. Edwards,et al.  The Phage Proteomic Tree: a Genome-Based Taxonomy for Phage , 2002, Journal of bacteriology.

[14]  N. Pace,et al.  Phylogenetic analysis of the hyperthermophilic pink filament community in Octopus Spring, Yellowstone National Park , 1994, Applied and environmental microbiology.

[15]  J. Drake,et al.  Genome-Wide Patterns of Nucleotide Substitution Reveal Stringent Functional Constraints on the Protein Sequences of Thermophiles , 2004, Genetics.

[16]  Henry D. Isenberg,et al.  Manual of Clinical Microbiology , 1991 .

[17]  J. Gordon,et al.  Genomic and Metabolic Studies of the Impact of Probiotics on a Model Gut Symbiont and Host , 2006, PLoS biology.

[18]  S. Karlin,et al.  Over- and under-representation of short oligonucleotides in DNA sequences. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[19]  I. Rigoutsos,et al.  Accurate phylogenetic classification of variable-length DNA fragments , 2007, Nature Methods.

[20]  Micah Acinapura,et al.  Computational DNA Sequence Analysis , 2003 .

[21]  R. Huber,et al.  Signature Lipids and Stable Carbon Isotope Analyses of Octopus Spring Hyperthermophilic Communities Compared with Those ofAquificales Representatives , 2001, Applied and Environmental Microbiology.

[22]  M. Blaser,et al.  Evolutionary implications of microbial genome tetranucleotide frequency biases. , 2003, Genome research.

[23]  James T. Staley,et al.  Biodiversity of microbial life , 2002 .

[24]  Oleg N. Reva,et al.  Global features of sequences of bacterial chromosomes, plasmids and phages revealed by analysis of oligonucleotide usage patterns , 2004, BMC Bioinformatics.

[25]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[26]  P. Forterre,et al.  The role played by viruses in the evolution of their hosts: a view based on informational protein phylogenies. , 2003, Research in microbiology.

[27]  T. D. Brock Thermophilic Microorganisms and Life at High Temperatures , 1978, Springer Series in Microbiology.

[28]  Y. Sakaki,et al.  Isolation and characterization of a bacteriophage infectious to an extreme thermophile, Thermus thermophilus HB8 , 1975, Journal of virology.

[29]  Roger W. M. Hatfull,et al.  Approaches Exchange and Failings of Phenetic Imbroglios of Viral Taxonomy: Genetic , 2002 .

[30]  B. Andresen,et al.  Genomic analysis of uncultured marine viral communities , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[31]  H Almagor,et al.  A Markov analysis of DNA sequences. , 1983, Journal of theoretical biology.

[32]  R. Garrett,et al.  Morphology and genome organization of the virus PSV of the hyperthermophilic archaeal genera Pyrobaculum and Thermoproteus: a novel virus family, the Globuloviridae. , 2004, Virology.

[33]  P. Salamon,et al.  Metagenomic Analyses of an Uncultured Viral Community from Human Feces , 2003, Journal of bacteriology.

[34]  Florent E. Angly,et al.  The Marine Viromes of Four Oceanic Regions , 2006, PLoS biology.

[35]  M. Pop,et al.  Metagenomic Analysis of the Human Distal Gut Microbiome , 2006, Science.

[36]  C. Suttle Marine viruses — major players in the global ecosystem , 2007, Nature Reviews Microbiology.

[37]  R. Sharp,et al.  Isolation and preliminary taxonomic studies of Thermus strains isolated from Yellowstone National Park, USA. , 1986, Journal of general microbiology.

[38]  V. Kunin,et al.  A bacterial metapopulation adapts locally to phage predation despite global dispersal. , 2008, Genome research.

[39]  D. Swofford PAUP*: Phylogenetic analysis using parsimony (*and other methods), Version 4.0b10 , 2002 .

[40]  M. Weinbauer,et al.  Are viruses driving microbial diversification and diversity? , 2003, Environmental microbiology.

[41]  Roderic D. M. Page,et al.  TreeView: an application to display phylogenetic trees on personal computers , 1996, Comput. Appl. Biosci..

[42]  N. Pace,et al.  Characterization of a Yellowstone hot spring microbial community by 5S rRNA sequences , 1985, Applied and environmental microbiology.

[43]  B. Wiedenheft,et al.  Effects of Culturing on the Population Structure of a Hyperthermophilic Virus , 2004, Microbial Ecology.