MultiPhATE2: Code for Functional Annotation and Comparison of Bacteriophage Genomes

To address the need for improved tools for annotation and comparative genomics of bacteriophage genomes, we developed multiPhATE2. As an extension of the multiPhATE code, multiPhATE2 performs gene finding and functional sequence annotation of predicted gene and protein sequences, and additional search algorithms and databases extend the search space of the original functional annotation subsystem. MultiPhATE2 includes comparative genomics codes for gene matching among sets of input bacteriophage genomes, and scales well to large input data sets with the incorporation of multiprocessing in the functional annotation and comparative genomics subsystems. MultiPhATE2 was implemented in Python 3.7 and runs as a command-line code under Linux or MAC-OS. MultiPhATE2 is freely available under an open-source GPL-3 license at https://github.com/carolzhou/multiPhATE2. Instructions for acquiring the databases and third party codes used by multiPhATE2 are found in the README file included with the distribution. Users may report bugs by submitting issues to the project GitHub repository webpage. Contact: zhou4@llnl.gov or multiphate@gmail.com. Supplementary materials, which demonstrate the outputs of multiPhATE2, are available in a GitHub repository, at https://github.com/carolzhou/multiPhATE2_supplementaryData/.

[1]  Po-E Li,et al.  Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform , 2016, bioRxiv.

[2]  Karthik Anantharaman,et al.  VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences , 2020, Microbiome.

[3]  Dmitrij Turaev,et al.  HoloVir: A Workflow for Investigating the Diversity and Function of Viruses in Invertebrate Holobionts , 2016, Front. Microbiol..

[4]  Robert A. Edwards,et al.  PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies , 2012, Nucleic acids research.

[5]  Pedro M. Coutinho,et al.  The carbohydrate-active enzymes database (CAZy) in 2013 , 2013, Nucleic Acids Res..

[6]  C. Duplessis,et al.  A Review of Topical Phage Therapy for Chronically Infected Wounds and Preparations for a Randomized Adaptive Clinical Trial Evaluating Topical Phage Therapy in Chronically Infected Diabetic Foot Ulcers , 2020, Antibiotics.

[7]  João L Reis-Cunha,et al.  ProphET, prophage estimation tool: A stand-alone prophage sequence prediction tool with self-updating reference database , 2019, PloS one.

[8]  R. Voelker FDA Approves Bacteriophage Trial. , 2019, JAMA.

[9]  Jeremy J. Barr,et al.  Phage Therapy in the Postantibiotic Era , 2019, Clinical Microbiology Reviews.

[10]  M. Borodovsky,et al.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. , 2001, Nucleic acids research.

[11]  G. Węgrzyn,et al.  Phage therapy: Current status and perspectives , 2019, Medicinal research reviews.

[12]  M. Daly,et al.  Prospects for Fungal Bioremediation of Acidic Radioactive Waste Sites: Characterization and Genome Sequence of Rhodotorula taiwanensis MD1149 , 2018, Front. Microbiol..

[13]  Torsten Seemann,et al.  Prokka: rapid prokaryotic genome annotation , 2014, Bioinform..

[14]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[15]  A. R. Costa,et al.  Current challenges and future opportunities of phage therapy. , 2020, FEMS microbiology reviews.

[16]  Sean R. Eddy,et al.  Hidden Markov model speed heuristic and iterative HMM search procedure , 2010, BMC Bioinformatics.

[17]  P. Daszak,et al.  The Global Virome Project , 2018, Science.

[18]  Carol L. Ecale Zhou,et al.  PHANOTATE: a novel approach to gene identification in phage genomes , 2019, Bioinform..

[19]  Robert Edwards,et al.  multiPhATE: bioinformatics pipeline for functional annotation of phage isolates , 2019, bioRxiv.

[20]  Miriam L. Land,et al.  Trace: Tennessee Research and Creative Exchange Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification Recommended Citation Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification , 2022 .

[21]  H. Wei,et al.  Engineered bacteriophage lysins as novel anti-infectives , 2014, Front. Microbiol..

[22]  Eugene V. Koonin,et al.  Prokaryotic Virus Orthologous Groups (pVOGs): a resource for comparative genomics and protein family annotation , 2016, Nucleic Acids Res..

[23]  M. Payne,et al.  Bacteriophage Therapy: Clinical Trials and Regulatory Hurdles , 2018, Front. Cell. Infect. Microbiol..

[24]  Steven Salzberg,et al.  Identifying bacterial genes and endosymbiont DNA with Glimmer , 2007, Bioinform..

[25]  Casandra W. Philipson,et al.  Characterizing Phage Genomes for Therapeutic Applications , 2018, Viruses.

[26]  Graham F. Hatfull,et al.  PhagesDB: the actinobacteriophage database , 2017, Bioinform..