Prophage Hunter: an integrative hunting tool for active prophages

Abstract Identifying active prophages is critical for studying coevolution of phage and bacteria, investigating phage physiology and biochemistry, and engineering designer phages for diverse applications. We present Prophage Hunter, a tool aimed at hunting for active prophages from whole genome assembly of bacteria. Combining sequence similarity-based matching and genetic features-based machine learning classification, we developed a novel scoring system that exhibits higher accuracy than current tools in predicting active prophages on the validation datasets. The option of skipping similarity matching is also available so that there's higher chance for novel phages to be discovered. Prophage Hunter provides a one-stop web service to extract prophage genomes from bacterial genomes, evaluate the activity of the prophages, identify phylogenetically related phages, and annotate the function of phage proteins. Prophage Hunter is freely available at https://pro-hunter.bgi.com/.

[1]  David S. Wishart,et al.  PHASTER: a better, faster version of the PHAST phage search tool , 2016, Nucleic Acids Res..

[2]  Courtney J. Robinson,et al.  Prophage-mediated defence against viral attack and viral counter-defence , 2017, Nature Microbiology.

[3]  L. Leibovici,et al.  The significance of Acinetobacter baumannii bacteraemia compared with Klebsiella pneumoniae bacteraemia: risk factors and outcomes. , 2006, The Journal of hospital infection.

[4]  Yang Young Lu,et al.  VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data , 2017, Microbiome.

[5]  S. Abedon,et al.  Lysogeny in nature: mechanisms, impact and ecology of temperate phages , 2017, The ISME Journal.

[6]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[7]  M. Borodovsky,et al.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. , 2001, Nucleic acids research.

[8]  O. Lund,et al.  MetaPhinder—Identifying Bacteriophage Sequences in Metagenomic Data Sets , 2016, PloS one.

[9]  M. Touchon,et al.  Genetic and life-history traits associated with the distribution of prophages in bacteria , 2016, The ISME Journal.

[10]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[11]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[12]  D. Fouts,et al.  Genetic modifications to temperate Enterococcus faecalis phage Ef11 that abolish the establishment of lysogeny and sensitivity to repressor, and increase host range and productivity of lytic infection. , 2013, Microbiology.

[13]  M. Clokie,et al.  Clostridium difficile phages: still difficult? , 2014, Front. Microbiol..

[14]  Robert A. Edwards,et al.  PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies , 2012, Nucleic acids research.

[15]  Matthew Fraser,et al.  InterProScan 5: genome-scale protein function classification , 2014, Bioinform..

[16]  R. Alaghehbandan,et al.  Inhibitory-based method for detection of Klebsiella pneumoniae carbapenemase Acinetobacter baumannii isolated from burn patients. , 2015, Indian journal of pathology & microbiology.

[17]  M. Loessner,et al.  Cross-genus rebooting of custom-made, synthetic bacteriophage genomes in L-form bacteria , 2018, Proceedings of the National Academy of Sciences.

[18]  Antibiotic trends of Klebsiella pneumoniae and Acinetobacter baumannii resistance indicators in an intensive care unit of Southern Italy, 2008–2013 , 2015, Antimicrobial Resistance and Infection Control.

[19]  S. Salzberg,et al.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads , 2015, Nature Biotechnology.

[20]  Matthew B. Sullivan,et al.  VirSorter: mining viral signal from microbial genomic data , 2015, PeerJ.

[21]  João C. Setubal,et al.  MARVEL, a Tool for Prediction of Bacteriophage Sequences in Metagenomic Bins , 2018, Front. Genet..

[22]  I. Borovok,et al.  Temperate bacteriophages as regulators of host behavior. , 2017, Current opinion in microbiology.

[23]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[24]  David G Hendrickson,et al.  Differential analysis of gene regulation at transcript resolution with RNA-seq , 2012, Nature Biotechnology.

[25]  U. Qimron,et al.  Temperate and lytic bacteriophages programmed to sensitize and kill antibiotic-resistant bacteria , 2015, Proceedings of the National Academy of Sciences.

[26]  M. Adams,et al.  Carbapenem-resistant Acinetobacter baumannii and Klebsiella pneumoniae across a hospital system: impact of post-acute care facilities on dissemination. , 2010, The Journal of antimicrobial chemotherapy.

[27]  M. Loessner,et al.  Engineering Bacteriophages as Versatile Biologics. , 2019, Trends in microbiology.

[28]  R. Edwards,et al.  A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes , 2014, Nature Communications.

[29]  G. Fournous,et al.  Prophage Genomics , 2003, Microbiology and Molecular Biology Reviews.

[30]  David S. Wishart,et al.  PHAST: A Fast Phage Search Tool , 2011, Nucleic Acids Res..

[31]  A. R. Costa,et al.  Phage Therapy: Going Temperate? , 2019, Trends in microbiology.

[32]  Juw Won Park,et al.  Genetic engineering of a temperate phage-based delivery system for CRISPR/Cas9 antimicrobials against Staphylococcus aureus , 2017, Scientific Reports.

[33]  Sascha Dietrich,et al.  Genome-Based Identification of Active Prophage Regions by Next Generation Sequencing in Bacillus licheniformis DSM13 , 2015, PloS one.

[34]  R. Sorek,et al.  Contemporary Phage Biology: From Classic Models to New Insights , 2018, Cell.