VIGOR extended to annotate genomes for additional 12 different viruses

A gene prediction program, VIGOR (Viral Genome ORF Reader), was developed at J. Craig Venter Institute in 2010 and has been successfully performing gene calling in coronavirus, influenza, rhinovirus and rotavirus for projects at the Genome Sequencing Center for Infectious Diseases. VIGOR uses sequence similarity search against custom protein databases to identify protein coding regions, start and stop codons and other gene features. Ribonucleicacid editing and other features are accurately identified based on sequence similarity and signature residues. VIGOR produces four output files: a gene prediction file, a complementary DNA file, an alignment file, and a gene feature table file. The gene feature table can be used to create GenBank submission. VIGOR takes a single input: viral genomic sequences in FASTA format. VIGOR has been extended to predict genes for 12 viruses: measles virus, mumps virus, rubella virus, respiratory syncytial virus, alphavirus and Venezuelan equine encephalitis virus, norovirus, metapneumovirus, yellow fever virus, Japanese encephalitis virus, parainfluenza virus and Sendai virus. VIGOR accurately detects the complex gene features like ribonucleicacid editing, stop codon leakage and ribosomal shunting. Precisely identifying the mat_peptide cleavage for some viruses is a built-in feature of VIGOR. The gene predictions for these viruses have been evaluated by testing from 27 to 240 genomes from GenBank.

[1]  M. Tsurudome,et al.  Identification of Paramyxovirus V Protein Residues Essential for STAT Protein Degradation and Promotion of Virus Replication , 2005, Journal of Virology.

[2]  Feng-Biao Guo,et al.  ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes , 2006, BMC Bioinformatics.

[3]  D. Kolakofsky,et al.  Sendai Virus Y Proteins Are Initiated by a Ribosomal Shunt , 1998, Molecular and Cellular Biology.

[4]  J. H. Strauss,et al.  Characterization of the rubella virus nonstructural protease domain and its cleavage site , 1996, Journal of virology.

[5]  Brian R. Murphy,et al.  The Genome Length of Human Parainfluenza Virus Type 2 Follows the Rule of Six, and Recombinant Viruses Recovered from Non-Polyhexameric-Length Antigenomic cDNAs Contain a Biased Distribution of Correcting Mutations , 2003, Journal of Virology.

[6]  C. Rice,et al.  Development and Application of a Reverse Genetics System for Japanese Encephalitis Virus , 2003, Journal of Virology.

[7]  G. Elliott,et al.  Strain-variable editing during transcription of the P gene of mumps virus may lead to the generation of non-structural proteins NS1 (V) and NS2. , 1990, The Journal of general virology.

[8]  Guy Boivin,et al.  Genetic diversity between human metapneumovirus subgroups. , 2003, Virology.

[9]  R. Álvarez,et al.  Comparison of the full-length genome sequence of avian metapneumovirus subtype C with other paramyxoviruses. , 2005, Virus research.

[10]  D. Garcin,et al.  Longer and Shorter Forms of Sendai Virus C Proteins Play Different Roles in Modulating the Cellular Antiviral Response , 2001, Journal of Virology.

[11]  Tatiana A. Tatusova,et al.  FLAN: a web server for influenza virus genome annotation , 2007, Nucleic Acids Res..

[12]  Sylvain de Breyne,et al.  Identification of a cis-acting element required for shunt-mediated translational initiation of the Sendai virus Y proteins. , 2003, Nucleic acids research.

[13]  J. H. Strauss,et al.  The alphaviruses: gene expression, replication, and evolution , 1994, Microbiological reviews.

[14]  C. Rice,et al.  The signal for translational readthrough of a UGA codon in Sindbis virus RNA involves a single cytidine residue immediately downstream of the termination codon , 1993, Journal of virology.

[15]  Y. Nagai,et al.  The paramyxovirus, Sendai virus, V protein encodes a luxury function required for viral pathogenesis , 1997, The EMBO journal.

[16]  M. Billeter,et al.  Recombinant measles viruses defective for RNA editing and V protein synthesis are viable in cultured cells. , 1997, Virology.

[17]  D. Kolakofsky,et al.  Ribosomal initiation from an ACG codon in the Sendai virus P/C mRNA. , 1988, The EMBO journal.

[18]  N. Barrett,et al.  Genome analysis and phylogenetic relationships between east, central and west African isolates of Yellow fever virus. , 2006, The Journal of general virology.

[19]  B. Murphy,et al.  Mutations in the C, D, and V open reading frames of human parainfluenza virus type 3 attenuate replication in rodents and primates. , 1999, Virology.

[20]  Hisanori Bando,et al.  Completion of the full-length genome sequence of human parainfluenza virus types 4A and 4B: sequence analysis of the large protein genes and gene start, intergenic and end sequences , 2010, Archives of Virology.

[21]  P. Chong,et al.  Proteolytic processing of rubella virus nonstructural proteins. , 1998, Virology.

[22]  H. Ushijima,et al.  Genomic analysis of diverse rubella virus genotypes. , 2007, The Journal of general virology.

[23]  Jaideep P. Sundaram,et al.  VIGOR, an annotation program for small viral genomes , 2010, BMC Bioinformatics.

[24]  D. Kolakofsky,et al.  Editing of the Sendai virus P/C mRNA by G insertion occurs during mRNA synthesis via a virus-encoded activity , 1990, Journal of virology.

[25]  M. Hardy,et al.  Norovirus protein structure and function. , 2005, FEMS microbiology letters.

[26]  H. Beier,et al.  Misreading of termination codons in eukaryotes by natural nonsense suppressor tRNAs. , 2001, Nucleic acids research.

[27]  Kelly J. Henrickson Parainfluenza Viruses , 2003, Clinical Microbiology Reviews.

[28]  S S Whitehead,et al.  Respiratory syncytial virus (RSV) SH and G proteins are not essential for viral replication in vitro: clinical evaluation and molecular characterization of a cold-passaged, attenuated RSV subgroup B mutant. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[29]  L. Anderson,et al.  Association between respiratory syncytial virus outbreaks and lower respiratory tract deaths of infants and young children. , 1990, The Journal of infectious diseases.

[30]  T. Frey,et al.  Rubella virus capsid protein modulation of viral genomic and subgenomic RNA synthesis. , 2005, Virology.

[31]  C. Hedberg,et al.  Food-related illness and death in the United States. , 1999, Emerging infectious diseases.

[32]  D. Kolakofsky,et al.  Paramyxovirus mRNA editing leads to G deletions as well as insertions. , 1994, The EMBO journal.

[33]  R. Ray,et al.  The P gene of human parainfluenza virus type 1 encodes P and C proteins but not a cysteine-rich V protein , 1991, Journal of virology.