Mutation Master: profiles of substitutions in hepatitis C virus RNA of the core, alternate reading frame, and NS2 coding regions.

The RNA genome of the hepatitis C virus (HCV) undergoes rapid evolutionary change. Efforts to control this virus would benefit from the advent of facile methods to identify characteristic features of HCV RNA and proteins, and to condense the vast amount of mutational data into a readily interpretable form. Many HCV sequences are available in GenBank. To facilitate analysis, consensus sequences were constructed to eliminate the overrepresentation of certain genotypes, such as genotype 1, and a novel package of sequence analysis tools was developed. Mutation Master generates profiles of point mutations in a population of sequences and produces a set of visual displays and tables indicating the number, frequency, and character of substitutions. It can be used to analyze hundreds of sequences at a time. When applied to 255 HCV core protein sequences, Mutation Master identified variable domains and a series of mutations meriting further investigation. It flagged position 4, for example, where 90% or more of all sequences in genotypes 1, 2, 4, and 5, have N4, whereas those in genotypes 3, 6, 7, 8, 9, and 10 have L4. This pattern is noteworthy: L (hydrophobic) to N (polar) substitutions are generally rare, and genotypes 1, 2, 4, and 5 do not form a recognized super family of sequences. Thus, the L4N substitution probably arose independently several times. Moreover, not one member of genotypes 1, 2, 4, or 5 has L4 and not one member of genotypes 3, 6, 7, 8, 9, or 10 has N4. This nonoverlapping pattern suggests that coordinated changes at position 4 and a second site are required to yield a viable virus. The package generated a table of genotype-specific substitutions whose future analysis may help to identify interacting amino acids. Three substitutions were present in 100% of genotype 2 members and absent from all others: A68D, R74K, and R114H. Finally, this study revealed thatARFP, a novel protein encoded in an overlapping reading frame, is as conserved as conventional HCV proteins, a result supporting a role for ARFP in the viral life cycle. Whereas most conventional programs for phylogenetic analysis of sequences provide information about overall relatedness of genes or genomes, this program highlights and profiles point mutations. This is important because determinants of pathogenicity and drug susceptibility are likely to result from changes at only one or two key nucleotides or amino acid sites, and would not be detected by the type of pairwise comparisons that have usually been performed on HCV to date. This study is the first application of Mutation Master, which is now available upon request (http://tandem.biomath.mssm.edu/mutationmaster.html).

[1]  J. McLauchlan,et al.  The Domains Required to Direct Core Proteins of Hepatitis C Virus and GB Virus-B to Lipid Droplets Share Common Features with Plant Oleosin Proteins* , 2002, The Journal of Biological Chemistry.

[2]  M. Manns,et al.  Peginterferon alfa-2b plus ribavirin for chronic hepatitis , 2002, The Lancet.

[3]  A. Musacchio,et al.  In vitro self-assembled HCV core virus-like particles induce a strong antibody immune response in sheep. , 2002, Biochemical and biophysical research communications.

[4]  J. Ou,et al.  Post-translational Modification of the Hepatitis C Virus Core Protein by Tissue Transglutaminase* , 2001, The Journal of Biological Chemistry.

[5]  Zhenming Xu,et al.  Synthesis of a novel hepatitis C virus protein by ribosomal frameshift , 2001, The EMBO journal.

[6]  D. Stump,et al.  Evidence for a new hepatitis C virus antigen encoded in an overlapping reading frame. , 2001, RNA.

[7]  E. Wimmer,et al.  Genetic Analysis of a Poliovirus/Hepatitis C Virus Chimera: New Structure for Domain II of the Internal Ribosomal Entry Site of Hepatitis C Virus , 2001, Journal of Virology.

[8]  S. Lemon,et al.  The influence of downstream protein-coding sequence on internal ribosome entry on hepatitis C virus and other flavivirus RNAs. , 2001, RNA.

[9]  S. Watowich,et al.  Self-Assembly of Nucleocapsid-Like Particles from Recombinant Hepatitis C Virus Core Protein , 2001, Journal of Virology.

[10]  S. Lemon,et al.  Core Protein-Coding Sequence, but Not Core Protein, Modulates the Efficiency of Cap-Independent Translation Directed by the Internal Ribosome Entry Site of Hepatitis C Virus , 2000, Journal of Virology.

[11]  S. Lemon,et al.  Cell Type-Specific Enhancement of Hepatitis C Virus Internal Ribosome Entry Site-Directed Translation due to 5′ Nontranslated Region Substitutions Selected during Passage of Virus in Lymphoblastoid Cells , 2000, Journal of Virology.

[12]  C. Chu,et al.  Amino acid substitutions in codons 9–11 of hepatitis C virus core protein lead to the synthesis of a short core protein product , 2000, Journal of gastroenterology and hepatology.

[13]  J. McLauchlan,et al.  Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes , 2000, Journal of viral hepatitis.

[14]  Y. Matsuura,et al.  Interaction of Hepatitis C Virus Core Protein with Viral Sense RNA and Suppression of Its Translation , 1999, Journal of Virology.

[15]  T. Liang,et al.  Hepatitis C virus-like particles synthesized in insect cells as a potential vaccine candidate. , 1999, Gastroenterology.

[16]  A. Sherker,et al.  Specific in vitro association between the hepatitis C viral genome and core protein , 1999, Journal of medical virology.

[17]  M. Yanagi,et al.  Toward a surrogate model for hepatitis C virus: An infectious molecular clone of the GB virus-B hepatitis agent. , 1999, Virology.

[18]  M. Lai,et al.  An internal polypyrimidine-tract-binding protein-binding site in the hepatitis C virus RNA attenuates translation, which is relieved by the 3'-untranslated sequence. , 1999, Virology.

[19]  W. Syu,et al.  Self-association of the C-terminal domain of the hepatitis-C virus core protein. , 1998, European Journal of Biochemistry.

[20]  F Tsuda,et al.  The entire nucleotide sequences of three hepatitis C virus isolates in genetic groups 7-9 and comparison with those in the other eight genetic groups. , 1998, The Journal of general virology.

[21]  M. Ichikawa,et al.  The Native Form and Maturation Process of Hepatitis C Virus Core Protein , 1998, Journal of Virology.

[22]  D. Wong,et al.  Hepatitis C Virus Structural Proteins Assemble into Viruslike Particles in Insect Cells , 1998, Journal of Virology.

[23]  P. Simmonds,et al.  Characteristics of Nucleotide Substitution in the Hepatitis C Virus Genome: Constraints on Sequence Change in Coding Regions at Both Ends of the Genome , 1997, Journal of Molecular Evolution.

[24]  M. Yanagi,et al.  Transcripts from a single full-length cDNA clone of hepatitis C virus are infectious when directly transfected into the liver of a chimpanzee. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[25]  H. Kräusslich,et al.  Analysis of hepatitis C virus core protein interaction domains. , 1997, The Journal of general virology.

[26]  P. Simmonds,et al.  The origin of hepatitis C virus genotypes. , 1997, The Journal of general virology.

[27]  R. Bhat,et al.  Regulated processing of hepatitis C virus core protein is linked to subcellular localization , 1997, Journal of virology.

[28]  E. Holmes,et al.  Evolutionary analysis of variants of hepatitis C virus found in South-East Asia: comparison with classifications based upon sequence similarity. , 1996, The Journal of general virology.

[29]  M. Honda,et al.  Stability of a stem-loop involving the initiator AUG controls the efficiency of internal initiation of translation on hepatitis C virus RNA. , 1996, RNA.

[30]  M. Selby,et al.  Interaction between hepatitis C virus core protein and E1 envelope protein , 1996, Journal of virology.

[31]  M. Lai,et al.  Homotypic interaction and multimerization of hepatitis C virus core protein. , 1996, Virology.

[32]  M. Lai,et al.  Differential subcellular localization of hepatitis C virus core gene products. , 1995, Virology.

[33]  I. Brierley,et al.  Ribosomal frameshifting viral RNAs. , 1995, The Journal of general virology.

[34]  S. Chen,et al.  Modulation of the trans-suppression activity of hepatitis C virus core protein by phosphorylation , 1995, Journal of virology.

[35]  P. Simmonds Variability of hepatitis C virus , 1995, Hepatology.

[36]  R. Purcell,et al.  Genetic Heterogeneity of Hepatitis C Virus: Quasispecies and Genotypes , 1995, Seminars in liver disease.

[37]  J. Yen,et al.  Nuclear localization signals in the core protein of hepatitis C virus. , 1994, Biochemical and Biophysical Research Communications - BBRC.

[38]  R. Purcell,et al.  Sequence analysis of the core gene of 14 hepatitis C virus genotypes. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[39]  M. Sakamoto,et al.  Entire nucleotide sequence and characterization of a hepatitis C virus of genotype V/3a. , 1994, The Journal of general virology.

[40]  M. Selby,et al.  Comparative studies of the core gene products of two different hepatitis C virus isolates: two alternative forms determined by a single amino acid substitution. , 1994, Virology.

[41]  P. Simmonds,et al.  Hepatitis C serotype and response to interferon therapy. , 1994, The New England journal of medicine.

[42]  R. Lanford,et al.  Analysis of hepatitis C virus capsid, E1, and E2/NS1 proteins expressed in insect cells. , 1993, Virology.

[43]  A. Siddiqui,et al.  Translation of human hepatitis C virus RNA in cultured cells is mediated by an internal ribosome-binding mechanism , 1993, Journal of virology.

[44]  M. Houghton,et al.  Expression, identification and subcellular localization of the proteins encoded by the hepatitis C viral genome. , 1993, The Journal of general virology.

[45]  C. Rice,et al.  Expression and identification of hepatitis C virus polyprotein cleavage products , 1993, Journal of virology.

[46]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[47]  L. Ping,et al.  Secondary structure of the 5' nontranslated regions of hepatitis C virus and pestivirus genomic RNAs. , 1992, Nucleic acids research.

[48]  S. Mishiro,et al.  Two distinct subtypes of hepatitis C virus defined by antibodies directed to the putative core protein , 1992, Hepatology.

[49]  A. Weiner,et al.  Hepatitis C virus (HCV) circulates as a population of different but closely related genomes: quasispecies nature of HCV genome distribution , 1992, Journal of virology.

[50]  A. Nomoto,et al.  Internal ribosome entry site within hepatitis C virus RNA , 1992, Journal of virology.

[51]  P. Barr,et al.  Genetic organization and diversity of the hepatitis C virus. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[52]  M. Houghton,et al.  Isolation of a cDNA clone derived from a blood-borne non-A, non-B viral hepatitis genome. , 1989, Science.

[53]  M. Record Effects of Na+ and Mg++ ions on the helix–coil transition of DNA , 1975 .

[54]  T. Gojobori,et al.  Reduction of synonymous substitutions in the core protein gene of hepatitis C virus , 2004, Journal of Molecular Evolution.

[55]  M. Lai,et al.  Hepatitis C virus core protein: possible roles in viral pathogenesis. , 2000, Current topics in microbiology and immunology.

[56]  R H Purcell,et al.  Clinical significance of hepatitis C virus genotypes and quasispecies. , 2000, Seminars in liver disease.

[57]  Y. Matsuura,et al.  Nuclear localization of the truncated hepatitis C virus core protein with its hydrophobic C terminus deleted. , 1995, The Journal of general virology.