Prophage genomics reveals patterns in phage genome organization and replication

Temperate phage genomes are highly variable mosaic collections of genes that infect a bacterial host, integrate into the host’s genome or replicate as low copy number plasmids, and are regulated to switch from the lysogenic to lytic cycles to generate new virions and escape their host. Genomes from most Bacterial phyla contain at least one or more prophages. We updated our PhiSpy algorithm to improve detection of prophages and to provide a web-based framework for PhiSpy. We have used this algorithm to identify 36,488 prophage regions from 11,941 bacterial genomes, including almost 600 prophages with no known homology to any proteins. Transfer RNA genes were abundant in the prophages, many of which alleviate the limits of translation efficiency due to host codon bias and presumably enable phages to surpass the normal capacity of the hosts’ translation machinery. We identified integrase genes in 15,765 prophages (43% of the prophages). The integrase was routinely located at either end of the integrated phage genome, and was used to orient and align prophage genomes to reveal their underlying organization. The conserved genome alignments of phages recapitulate early, middle, and late gene order in transcriptional control of phage genes, and demonstrate that gene order, presumably selected by transcription timing and/or coordination among functional modules has been stably conserved throughout phage evolution.

[1]  Rick L. Stevens,et al.  The RAST Server: Rapid Annotations using Subsystems Technology , 2008, BMC Genomics.

[2]  Stevens Dl,et al.  Streptococcus pyogenes: Basic Biology to Clinical Manifestations , 2016 .

[3]  F. Rohwer,et al.  Genome Sequences of Two Closely Related Vibrio parahaemolyticus Phages, VP16T and VP16C , 2003, Journal of bacteriology.

[4]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[5]  M. Breitbart,et al.  The complete genomic sequence of the marine phage Roseophage SIO1 shares homology with nonmarine phages , 2000 .

[6]  Robert A. Edwards,et al.  PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies , 2012, Nucleic acids research.

[7]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[8]  F. Bushman Lateral DNA transfer : mechanisms and consequences , 2002 .

[9]  Graham F Hatfull,et al.  Bacteriophage genomics. , 2008, Current opinion in microbiology.

[10]  N. Scherberg,et al.  Transfer RNA coded by the T4 bacteriophage genome. , 1968, Proceedings of the National Academy of Sciences of the United States of America.

[11]  M. Haruki,et al.  Site-specific recombinases as tools for heterologous gene integration , 2011, Applied Microbiology and Biotechnology.

[12]  D. Fouts Phage_Finder: Automated identification and classification of prophage regions in complete bacterial genome sequences , 2006, Nucleic acids research.

[13]  R. Tirumalai,et al.  Similarities and differences among 105 members of the Int family of site-specific recombinases. , 1998, Nucleic acids research.

[14]  C. Marrs,et al.  Kinetics and regulation of transcription of bacteriophage Mu. , 1990, Virology.

[15]  M. Chandler,et al.  Replication of the prophage P1 during the cell cycle of Escherichia coli , 1977, Molecular and General Genetics MGG.

[16]  J. Collins,et al.  Antibiotic Treatment Expands the Resistance Reservoir and Ecological Network of the Phage Metagenome , 2013, Nature.

[17]  T. Kunisawa,et al.  Synonymous codon preferences in bacteriophage T4: a distinctive use of transfer RNAs from T4 and from its host Escherichia coli. , 1992, Journal of theoretical biology.

[18]  J. H. Wilson Function of the bacteriophage T4 transfer RNA's. , 1973, Journal of molecular biology.

[19]  R. Hendrix,et al.  Evolutionary relationships among diverse bacteriophages and prophages: all the world's a phage. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[20]  V. Stewart,et al.  Genetic Analysis of Pathogenic Bacteria: A Laboratory Manual , 1995 .

[21]  David S. Wishart,et al.  PHAST: A Fast Phage Search Tool , 2011, Nucleic Acids Res..

[22]  G. Fournous,et al.  Phage as agents of lateral gene transfer. , 2003, Current opinion in microbiology.

[23]  E. Holmes,et al.  Viral evolution and the emergence of SARS coronavirus. , 2004, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[24]  Massimo Vergassola,et al.  Causes for the intriguing presence of tRNAs in phages. , 2007, Genome research.

[25]  N. Hannett,et al.  Restriction cleavage map of SP01 DNA: general location of early, middle, and late genes , 1979, Journal of Virology.

[26]  Rodrigo Lopez,et al.  A new bioinformatics analysis tools framework at EMBL–EBI , 2010, Nucleic Acids Res..

[27]  Jacques van Helden,et al.  Prophinder: a computational tool for prophage prediction in prokaryotic genomes , 2008, Bioinform..

[28]  S. Salzberg,et al.  DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae , 2000, Nature.

[29]  P. Forterre,et al.  Single‐stranded DNA viruses employ a variety of mechanisms for integration into host genomes , 2015, Annals of the New York Academy of Sciences.

[30]  Antoni Luque,et al.  Theoretical studies on assembly, physical stability and dynamics of viruses. , 2013, Sub-cellular biochemistry.

[31]  Scott V. Nguyen,et al.  The Bacteriophages of Streptococcus pyogenes , 2019, Microbiology spectrum.

[32]  John Vu,et al.  Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity , 2015, eLife.

[33]  M. Ventura,et al.  Temporal Transcription Map of the Virulent Streptococcus thermophilus Bacteriophage Sfi19 , 2004, Applied and Environmental Microbiology.

[34]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[35]  L. Rajeev,et al.  Challenging a Paradigm: the Role of DNA Homology in Tyrosine Recombinase Reactions , 2009, Microbiology and Molecular Biology Reviews.

[36]  G. Hatfull,et al.  The orientation of mycobacteriophage Bxb1 integration is solely dependent on the central dinucleotide of attP and attB. , 2003, Molecular cell.

[37]  B. Das,et al.  Integrative mobile elements exploiting Xer recombination. , 2013, Trends in microbiology.

[38]  A. Campbell Chromosomal insertion sites for phages and plasmids , 1992, Journal of bacteriology.

[39]  Barbara A. Bailey,et al.  Lytic to temperate switching of viral communities , 2016, Nature.

[40]  D. Esposito,et al.  The integrase family of tyrosine recombinases: evolution of a conserved active site domain. , 1997, Nucleic acids research.

[41]  H. Karch,et al.  Characterization of a Shiga Toxin 2e-Converting Bacteriophage from an Escherichia coli Strain of Human Origin , 2000, Infection and Immunity.

[42]  Peter Salamon,et al.  Phage Phenomics: Physiological Approaches to Characterize Novel Viral Proteins , 2015, Journal of visualized experiments : JoVE.

[43]  P Argos,et al.  The integrase family of site‐specific recombinases: regional similarities and global diversity. , 1986, The EMBO journal.

[44]  Fangfang Xia,et al.  The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST) , 2013, Nucleic Acids Res..

[45]  Andrew R McEwan,et al.  Site-specific recombination by phiC31 integrase and other large serine recombinases. , 2010, Biochemical Society transactions.

[46]  Tanja Woyke,et al.  Viral dark matter and virus–host interactions resolved from publicly available microbial genomes , 2015, eLife.

[47]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[48]  Wolf-Dietrich Hardt,et al.  Phages and the Evolution of Bacterial Pathogens: from Genomic Rearrangements to Lysogenic Conversion , 2004, Microbiology and Molecular Biology Reviews.

[49]  Robert Barber,et al.  Prophage Finder: A Prophage Loci Prediction Tool for Prokaryotic Genome Sequences , 2006, Silico Biol..

[50]  Itai Sharon,et al.  Comparative metagenomics of microbial traits within oceanic viral communities , 2011, The ISME Journal.

[51]  A. Campbell Phage integration and chromosome structure. A personal history. , 2007, Annual review of genetics.

[52]  Matthew B. Sullivan,et al.  VirSorter: mining viral signal from microbial genomic data , 2015, PeerJ.

[53]  Ying Gao,et al.  Bioinformatics Applications Note Sequence Analysis Cd-hit Suite: a Web Server for Clustering and Comparing Biological Sequences , 2022 .

[54]  Robert A Edwards,et al.  Structure and function of a cyanophage-encoded peptide deformylase , 2013, The ISME Journal.

[55]  Ghislain Fournous,et al.  The impact of prophages on bacterial chromosomes , 2004, Molecular microbiology.

[56]  Bas E. Dutilh,et al.  Computational approaches to predict bacteriophage–host relationships , 2015, FEMS microbiology reviews.

[57]  James J. Davis,et al.  Modal Codon Usage: Assessing the Typical Codon Usage of a Genome , 2009, Molecular biology and evolution.

[58]  M. Jayaram,et al.  An Overview of Tyrosine Site-specific Recombination: From an Flp Perspective. , 2015, Microbiology spectrum.

[59]  Fumio Arisaka,et al.  Bacteriophage T4 Genome , 2003, Microbiology and Molecular Biology Reviews.

[60]  M. E. Abdel-Haliem,et al.  Site-specific recombination systems in filamentous phages , 2012, Molecular Genetics and Genomics.

[61]  S. Casjens,et al.  Prophages and bacterial genomics: what have we learned so far? , 2003, Molecular microbiology.