Thousands of previously unknown phages discovered in whole-community human gut metagenomes

Background Double-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut virome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut. Results A search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, “Flandersviridae” and “Quimbyviridae”, include some of the most common and abundant members of the human gut virome that infect Bacteroides , Parabacteroides , and Prevotella . The third proposed family, “Gratiaviridae,” consists of less abundant phages that are distantly related to the families Autographiviridae , Drexlerviridae , and Chaseviridae . Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some “Quimbyviridae” phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several “Flandersviridae” phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The “Gratiaviridae” phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species. Conclusions Analysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of Caudovirales and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse, and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes. Video abstract

[1]  P. Pevzner,et al.  Analysis of metagenome-assembled viral genomes from the human gut reveals diverse putative CrAss-like phages with unique genomic features , 2021, Nature Communications.

[2]  A. Luque,et al.  The Missing Tailed Phages: Prediction of Small Capsid Candidates , 2020, Microorganisms.

[3]  G. Hatfull Actinobacteriophages: Genomics, Dynamics, and Applications. , 2020, Annual review of virology.

[4]  Kira S. Makarova,et al.  Machine-learning approach expands the repertoire of anti-CRISPR protein families , 2020, Nature Communications.

[5]  C. Whitfield,et al.  Assembly of Bacterial Capsular Polysaccharides and Exopolysaccharides. , 2020, Annual review of microbiology.

[6]  Sergey A. Shmakov,et al.  Mapping CRISPR spaceromes reveals vast host-specific viromes of prokaryotes , 2020, Communications Biology.

[7]  Ryan D. Crawford,et al.  Phase-variable capsular polysaccharides and lipoproteins modify bacteriophage susceptibility in Bacteroides thetaiotaomicron , 2020, Nature Microbiology.

[8]  D. Valentine,et al.  Role of diversity-generating retroelements for regulatory pathway tuning in cyanobacteria , 2020, bioRxiv.

[9]  Dmitry Antipov,et al.  Metaviral SPAdes: assembly of viruses from metagenomic data , 2020, Bioinform..

[10]  E. Koonin,et al.  The crAss-like Phage Group: How Metagenomics Reshaped the Human Virome. , 2020, Trends in microbiology.

[11]  M. Johnson,et al.  Critical Anti-CRISPR Locus Repression by a Bi-functional Cas9 Inhibitor. , 2020, Cell host & microbe.

[12]  Eugene V. Koonin,et al.  Seeker: Alignment-free identification of bacteriophage genomes by deep learning , 2020, bioRxiv.

[13]  S. Hallam,et al.  Ecology and molecular targets of hypermutation in the global microbiome , 2020, Nature Communications.

[14]  William W. Van Treuren,et al.  Bacteroides thetaiotaomicron-infecting bacteriophage isolates inform sequence-based host range predictions , 2020, bioRxiv.

[15]  E. Koonin,et al.  Global Organization and Proposed Megataxonomy of the Virus World , 2020, Microbiology and Molecular Biology Reviews.

[16]  L. Aravind,et al.  Highly regulated, diversifying NTP-dependent biological conflict systems with implications for the emergence of multicellularity , 2020, eLife.

[17]  I. Mijakovic,et al.  hipBA toxin-antitoxin systems mediate persistence in Caulobacter crescentus , 2020, Scientific Reports.

[18]  Narmada Thanki,et al.  CDD/SPARCLE: the conserved domain database in 2020 , 2019, Nucleic Acids Res..

[19]  Ezequiel Valguarnera,et al.  Good gone bad: One toxin away from disease for Bacteroides fragilis. , 2019, Journal of molecular biology.

[20]  Stan J. J. Brouns,et al.  Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages , 2019, Viruses.

[21]  I. Borovok,et al.  Coordination of cohabiting phage elements supports bacteria–phage cooperation , 2019, Nature Communications.

[22]  T. Sutton,et al.  The Human Gut Virome Is Highly Diverse, Stable, and Individual Specific. , 2019, Cell host & microbe.

[23]  Samuel Kilcher,et al.  Listeria phages induce Cas9 degradation to protect lysogenic genomes , 2019, bioRxiv.

[24]  P. Turnbaugh,et al.  CRISPR-Cas System of a Prevalent Human Gut Bacterium Reveals Hyper-targeting against Phages in a Human Virome Catalog. , 2019, Cell host & microbe.

[25]  Adair L. Borges,et al.  Anti-CRISPR-Associated Proteins Are Crucial Repressors of Anti-CRISPR Transcription , 2019, Cell.

[26]  J. Matthijnssens,et al.  What is (not) known about the dynamics of the human gut virome in health and disease. , 2019, Current opinion in virology.

[27]  Natalia N. Ivanova,et al.  Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earth’s biomes , 2019, Nature Microbiology.

[28]  T. Sutton,et al.  The human gut virome is highly diverse, stable and individual-specific , 2019, bioRxiv.

[29]  Evelien M. Adriaenssens,et al.  Analysis of Spounaviruses as a Case Study for the Overdue Reclassification of Tailed Phages , 2019, Systematic biology.

[30]  Donovan H. Parks,et al.  Evaluation of a concatenated protein phylogeny for classification of tailed double-stranded DNA viruses belonging to the order Caudovirales , 2019, Nature Microbiology.

[31]  Evelien M. Adriaenssens,et al.  Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks , 2019, Nature Biotechnology.

[32]  P. Pevzner,et al.  Plasmid detection and assembly in genomic and metagenomic data sets , 2019, Genome research.

[33]  Carol L. Ecale Zhou,et al.  PHANOTATE: a novel approach to gene identification in phage genomes , 2019, Bioinform..

[34]  Christine L. Sun,et al.  Clades of huge phages from across Earth’s ecosystems , 2019, bioRxiv.

[35]  Sergey Koren,et al.  Mash Screen: high-throughput sequence containment estimation for genome discovery , 2019, Genome Biology.

[36]  Milot Mirdita,et al.  HH-suite3 for fast remote homology detection and deep protein annotation , 2019, BMC Bioinformatics.

[37]  K. Maxwell,et al.  Meet the Anti-CRISPRs: Widespread Protein Inhibitors of CRISPR-Cas Systems. , 2019, The CRISPR journal.

[38]  Ryan M. O’Connell,et al.  Expansion of Bacteriophages Is Linked to Aggravated Intestinal Inflammation and Colitis. , 2019, Cell host & microbe.

[39]  L. Debarbieux,et al.  The Battle Within: Interactions of Bacteriophages and Bacteria in the Gastrointestinal Tract. , 2019, Cell host & microbe.

[40]  Nicholas D. Youngblut,et al.  Virome Diversity Correlates with Intestinal Microbiome Diversity in Adult Monozygotic Twins. , 2019, Cell host & microbe.

[41]  Stan J. J. Brouns,et al.  Global phylogeography and ancient evolution of the widespread human gut virus crAssphage , 2019, bioRxiv.

[42]  E. Delwart,et al.  Gut virome of mammals and birds reveals high genetic diversity of the family Microviridae , 2019, Virus evolution.

[43]  H. Neve,et al.  Expanding the Diversity of Myoviridae Phages Infecting Lactobacillus plantarum—A Novel Lineage of Lactobacillus Phages Comprising Five New Members , 2018, Viruses.

[44]  Daniel J. Nasko,et al.  Family A DNA Polymerase Phylogeny Uncovers Diversity and Replication Gene Organization in the Virioplankton , 2018, Front. Microbiol..

[45]  A. Phillippy,et al.  High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries , 2017, Nature Communications.

[46]  E. Koonin,et al.  Origins and Evolution of the Global RNA Virome , 2018, mBio.

[47]  R. Edwards,et al.  A diversity-generating retroelement encoded by a globally ubiquitous Bacteroides phage , 2018, Microbiome.

[48]  Stuart Walker Repair , 2018, Design Realities.

[49]  J. Keith Joung,et al.  Discovery of widespread type I and type V CRISPR-Cas inhibitors , 2018, Science.

[50]  Y. Doyon,et al.  Anti-CRISPR AcrIIa6 cubic form , 2018 .

[51]  Philippe Horvath,et al.  Widespread anti-CRISPR proteins in virulent bacteriophages inhibit a range of Cas9 proteins , 2018, Nature Communications.

[52]  Evelien M. Adriaenssens,et al.  Evaluation of the genomic diversity of viruses infecting bacteria, archaea and eukaryotes using a common bioinformatic platform: steps towards a unified taxonomy , 2018, The Journal of general virology.

[53]  Brian C. Thomas,et al.  Megaphages infect Prevotella and variants are widespread in gut microbiomes , 2018, bioRxiv.

[54]  Jeff F. Miller,et al.  Template-assisted synthesis of adenine-mutagenized cDNA by a retroelement protein complex , 2018, bioRxiv.

[55]  Mya Breitbart,et al.  Phage puppet masters of the marine microbial realm , 2018, Nature Microbiology.

[56]  T. Sutton,et al.  Reproducible protocols for metagenomic analysis of human faecal phageomes , 2018, Microbiome.

[57]  M. Mariadassou,et al.  Phages infecting Faecalibacterium prausnitzii belong to novel viral genera that help to decipher intestinal viromes , 2018, Microbiome.

[58]  B. Garcia,et al.  Microbes vs. chemistry in the origin of the anaerobic gut lumen , 2018, Proceedings of the National Academy of Sciences.

[59]  Catherine Putonti,et al.  Gene Co-occurrence Networks Reflect Bacteriophage Ecology and Evolution , 2018, mBio.

[60]  Jia Gu,et al.  fastp: an ultra-fast all-in-one FASTQ preprocessor , 2018, bioRxiv.

[61]  Johannes Söding,et al.  Clustering huge protein sequence sets in linear time , 2017, Nature Communications.

[62]  Jeff F. Miller,et al.  Diversity-generating retroelements: natural variation, classification and evolution inferred from a large-scale genomic survey , 2017, Nucleic acids research.

[63]  Alan R. Davidson,et al.  Anti-CRISPR: discovery, mechanism and function , 2017, Nature Reviews Microbiology.

[64]  Robert A Edwards,et al.  Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut , 2017, Nature Microbiology.

[65]  Kira S. Makarova,et al.  The CRISPR Spacer Space Is Dominated by Sequences from Species-Specific Mobilomes , 2017, mBio.

[66]  J. Banfield,et al.  dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication , 2017, The ISME Journal.

[67]  J. Weitz,et al.  Synergy between the Host Immune System and Bacteriophage Is Essential for Successful Phage Therapy against an Acute Respiratory Pathogen. , 2017, Cell host & microbe.

[68]  Andrew J. Davison,et al.  Consensus statement: Virus taxonomy in the age of metagenomics , 2017, Nature Reviews Microbiology.

[69]  Maria Jesus Martin,et al.  Uniclust databases of clustered and deeply annotated protein sequences and alignments , 2016, Nucleic Acids Res..

[70]  Forest Rohwer,et al.  Viruses as Winners in the Game of Life. , 2016, Annual review of virology.

[71]  Jonathan Kans,et al.  Entrez Direct: E-utilities on the UNIX Command Line , 2016 .

[72]  Eugene V. Koonin,et al.  The Double-Stranded DNA Virosphere as a Modular Hierarchical Network of Gene Sharing , 2016, mBio.

[73]  M. Alfonso-Prieto,et al.  Mechanism of Ribonuclease III Catalytic Regulation by Serine Phosphorylation , 2016, Scientific Reports.

[74]  S. Mazmanian,et al.  Gut biogeography of the bacterial microbiota , 2015, Nature Reviews Microbiology.

[75]  Barbara A. Bailey,et al.  Subdiffusive motion of bacteriophage in mucosal surfaces increases the frequency of bacterial encounters , 2015, Proceedings of the National Academy of Sciences.

[76]  E. R. Rocha,et al.  Dps and DpsL Mediate Survival In Vitro and In Vivo during the Prolonged Oxidative Stress Response in Bacteroides fragilis , 2015, Journal of bacteriology.

[77]  Olli Simell,et al.  Bacteroides dorei dominates gut microbiome prior to autoimmunity in Finnish children at high risk for type 1 diabetes , 2014, Front. Microbiol..

[78]  F. Bushman,et al.  Correlation between intraluminal oxygen gradient and radial partitioning of intestinal microbiota. , 2014, Gastroenterology.

[79]  Yuzhen Ye Identification of Diversity-Generating Retroelements in Human Microbiomes , 2014, International journal of molecular sciences.

[80]  R. Edwards,et al.  A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes , 2014, Nature Communications.

[81]  Peer Bork,et al.  Classification and quantification of bacteriophage taxa in human gut metagenomes , 2014, The ISME Journal.

[82]  M. Schumacher,et al.  Mechanism of staphylococcal multiresistance plasmid replication origin assembly by the RepA protein , 2014, Proceedings of the National Academy of Sciences.

[83]  Axel Visel,et al.  Stop codon reassignments in the wild , 2014, Science.

[84]  N. Zenkin,et al.  Molecular mechanism of bacterial persistence by HipA. , 2013, Molecular cell.

[85]  Manesh Shah,et al.  Twelve previously unknown phage genera are ubiquitous in global oceans , 2013, Proceedings of the National Academy of Sciences.

[86]  Frederic D Bushman,et al.  Rapid evolution of the human gut virome , 2013, Proceedings of the National Academy of Sciences.

[87]  P. Salamon,et al.  Bacteriophage adhering to mucus provide a non–host-derived immunity , 2013, Proceedings of the National Academy of Sciences.

[88]  C. Hill,et al.  Isoprenoid biosynthesis in bacterial pathogens. , 2012, Microbiology.

[89]  Katherine H. Huang,et al.  Structure, Function and Diversity of the Healthy Human Microbiome , 2012, Nature.

[90]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[91]  Frederic D Bushman,et al.  Hypervariable loci in the human gut virome , 2012, Proceedings of the National Academy of Sciences.

[92]  Robert A. Edwards,et al.  PHACTS, a computational approach to classifying the lifestyle of phages , 2012, Bioinform..

[93]  M. Young,et al.  Characterization of the Bacteroides fragilis bfr Gene Product Identifies a Bacterial DPS-Like Protein and Suggests Evolutionary Links in the Ferritin Superfamily , 2011, Journal of bacteriology.

[94]  S. Shuman,et al.  Determinants of the cytotoxicity of PrrC anticodon nuclease and its amelioration by tRNA repair. , 2012, RNA.

[95]  Audrey R. Odom Five Questions about Non-Mevalonate Isoprenoid Biosynthesis , 2011, PLoS pathogens.

[96]  D. Friedman,et al.  Activation of a prophage‐encoded tyrosine kinase by a heterologous infecting phage results in a self‐inflicted abortive infection , 2011, Molecular microbiology.

[97]  A. Davidson,et al.  The solution structure of the C-terminal Ig-like domain of the bacteriophage λ tail tube protein. , 2010, Journal of molecular biology.

[98]  Forest Rohwer,et al.  Viruses in the fecal microbiota of monozygotic twins and their mothers , 2010, Nature.

[99]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[100]  Miriam L. Land,et al.  Trace: Tennessee Research and Creative Exchange Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification Recommended Citation Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification , 2022 .

[101]  S. Casjens,et al.  Determining DNA packaging strategy by analysis of the termini of the chromosomes in tailed-bacteriophage virions. , 2009, Methods in molecular biology.

[102]  Alejandro A. Schäffer,et al.  Database indexing for production MegaBLAST searches , 2008, Bioinform..

[103]  Jeff F. Miller,et al.  Selective Ligand Recognition by a Diversity-Generating Retroelement Variable Protein , 2008, PLoS biology.

[104]  E. Koonin,et al.  The Deep Archaeal Roots of Eukaryotes , 2008, Molecular biology and evolution.

[105]  Zhou Yu,et al.  Ig-like domains on bacteriophages: a tale of promiscuity and deceit. , 2006, Journal of molecular biology.

[106]  K. Severinov,et al.  Localization of the Escherichia coli RNA Polymerase β′ Subunit Residue Phosphorylated by Bacteriophage T7 Kinase Gp0.7 , 2006, Journal of bacteriology.

[107]  Olivier Fayet,et al.  Recoding in bacteriophages and bacterial IS elements. , 2006, Trends in genetics : TIG.

[108]  F. Blattner,et al.  Genome of Bacteriophage P1 , 2004, Journal of bacteriology.

[109]  R. Hendrix,et al.  Conserved translational frameshift in dsDNA bacteriophage tail assembly genes. , 2004, Molecular cell.

[110]  R. Simons,et al.  Tropism switching in Bordetella bacteriophage defines a family of diversity-generating retroelements , 2004, Nature.

[111]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[112]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[113]  P. Salamon,et al.  Metagenomic Analyses of an Uncultured Viral Community from Human Feces , 2003, Journal of bacteriology.

[114]  S. Casjens,et al.  Prophages and bacterial genomics: what have we learned so far? , 2003, Molecular microbiology.

[115]  Fumio Arisaka,et al.  Bacteriophage T4 Genome , 2003, Microbiology and Molecular Biology Reviews.

[116]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[117]  E. Brown,et al.  Characterization of the Depletion of 2-C-Methyl-d-Erythritol-2,4-Cyclodiphosphate Synthase in Escherichia coli and Bacillus subtilis , 2002, Journal of bacteriology.

[118]  R. Simons,et al.  Reverse Transcriptase-Mediated Tropism Switching in Bordetella Bacteriophage , 2002, Science.

[119]  A. Wright,et al.  DNA segregation in bacteria. , 2000, Annual review of microbiology.

[120]  E. Tuomanen,et al.  Pneumococcal licD2 gene is involved in phosphorylcholine metabolism , 1999, Molecular microbiology.

[121]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[122]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[123]  E. R. Rocha,et al.  Oxidative stress response in an anaerobe, Bacteroides fragilis: a role for catalase in protection against hydrogen peroxide , 1996, Journal of bacteriology.

[124]  A. Campbell Comparative molecular biology of lambdoid phages. , 1994, Annual review of microbiology.

[125]  S. Casjens,et al.  A Programmed Translational Frameshift is Required for the Synthesis of a Bacteriophage λ Tail Assembly Protein , 1993 .

[126]  A. Tomasz,et al.  Choline-containing bacteriophage receptors in Streptococcus pneumoniae , 1982, Journal of bacteriology.

[127]  K. Lew,et al.  Role of antirepressor in the bipartite control of repression and immunity by bacteriophage P22. , 1975, Journal of molecular biology.

[128]  F. Studier,et al.  Protein kinase induction in Escherichia coli by bacteriophage T7. , 1974, Proceedings of the National Academy of Sciences of the United States of America.