Molecular evolution of the hepatitis B virus genome

The hepatitis B virus (HBV) has a circular DNA genome of about 3,200 base pairs. Economical use of the genome with overlapping reading frames may have led to severe constraints on nucleotide substitutions along the genome and to highly variable rates of substitution among nucleotide sites. Nucleotide sequences from 13 complete HBV genomes were compared to examine such variability of substitution rates among sites and to examine the phylogenetic relationships among the HBV variants. The maximum likelihood method was employed to fit models of DNA sequence evolution that can account for the complexity of the pattern of nucleotide substitution. Comparison of the models suggests that the rates of substitution are different in different genes and codon positions; for example, the third codon position changes at a rate over ten times higher than the second position. Furthermore, substantial variation of substitution rates was detected even after the effects of genes and codon positions were corrected; that is, rates are different at different sites of the same gene or at the same codon position. Such rates after the correction were also found to be positively correlated at adjacent sites, which indicated the existence of conserved and variable domains in the proteins encoded by the viral genome. A multiparameter model validates the earlier finding that the variation in nucleotide conservation is not random around the HBV genome. The test for the existence of a molecular clock suggests that substitution rates are more or less constant among lineages. The phylogenetic relationships among the viral variants were examined. Although the data do not seem to contain sufficient information to resolve the details of the phylogeny, it appears quite certain that the serotypes of the viral variants do not reflect their genetic relatedness.

[1]  Y. Ono,et al.  Tbe complete nudeotide sequences of the cloned hepatitis B vims DNA; subtype adr and adw , 1983 .

[2]  T. Gojobori,et al.  Host-independent evolution and a genetic classification of the hepadnavirus family based on nucleotide sequences. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[3]  J. Wakeley Substitution rate variation among sites in hypervariable region 1 of human mitochondrial DNA , 1993, Journal of Molecular Evolution.

[4]  N. Goldman,et al.  Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation. , 1994, Molecular biology and evolution.

[5]  Wen-Hsiung Li Unbiased estimation of the rates of synonymous and nonsynonymous substitution , 2006, Journal of Molecular Evolution.

[6]  V. Bichko,et al.  Subtype ayw variant of hepatitis B virus , 1985, FEBS letters.

[7]  N. Goldman Simple diagnostic statistical tests of models for DNA substitution , 1993, Journal of Molecular Evolution.

[8]  Takashi Miyata,et al.  Molecular evolution of mRNA: A method for estimating evolutionary rates of synonymous and amino acid substitutions from homologous nucleotide sequences and its application , 1980, Journal of Molecular Evolution.

[9]  Philip E. Gill,et al.  Practical optimization , 1981 .

[10]  Ziheng Yang Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods , 1994, Journal of Molecular Evolution.

[11]  N. Goldman,et al.  A codon-based model of nucleotide substitution for protein-coding DNA sequences. , 1994, Molecular biology and evolution.

[12]  I. Lauder,et al.  The variability of the hepatitis B virus genome: statistical analysis and biological implications. , 1993, Molecular biology and evolution.

[13]  Masami Hasegawa,et al.  CONFIDENCE LIMITS ON THE MAXIMUM‐LIKELIHOOD ESTIMATE OF THE HOMINOID TREE FROM MITOCHONDRIAL‐DNA SEQUENCES , 1989, Evolution; international journal of organic evolution.

[14]  S. Muse,et al.  A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. , 1994, Molecular biology and evolution.

[15]  Liu Zp,et al.  The complete nucleotide sequence of the cloned DNA of hepatitis B virus subtype adr in pADR-1. , 1987 .

[16]  Z. Yang,et al.  Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. , 1993, Molecular biology and evolution.

[17]  H. Rho,et al.  The nucleotide sequence and reading frames of a mutant hepatitis B virus subtype adr. , 1989, Nucleic acids research.

[18]  George Tauchen,et al.  EVALUATION OF MAXIMUM LIKELIHOOD , 2001 .

[19]  Ziheng Yang Estimating the pattern of nucleotide substitution , 1994, Journal of Molecular Evolution.

[20]  M. G. Kendall,et al.  The advanced theory of statistics. Vols. 2. , 1969 .

[21]  J. Pugh,et al.  Expression of the X Gene of Hepatitis B Virus , 1986, Journal of medical virology.

[22]  H. Okamoto,et al.  Nucleotide sequence of a hepatitis B virus genome of subtype adw isolated from a Philippine: Comparison with the reported three genomes of the same subtype , 1988 .

[23]  C. Luo,et al.  A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. , 1985, Molecular biology and evolution.

[24]  H. Okamoto,et al.  Nucleotide sequence of a cloned hepatitis B virus genome, subtype ayr: comparison with genomes of the other three subtypes. , 1986, The Journal of general virology.

[25]  J. R. Fresco,et al.  Nucleotide Sequence , 2020, Definitions.

[26]  H. Okamoto,et al.  Genomic heterogeneity of hepatitis B virus in a 54-year-old woman who contracted the infection through materno-fetal transmission. , 1987, The Japanese journal of experimental medicine.

[27]  N. Bianchi,et al.  Evolution of the Zfx and Zfy genes: rates and interdependence between the genes. , 1993, Molecular biology and evolution.

[28]  F. Galibert,et al.  Nucleotide sequence of the hepatitis B virus genome (subtype ayw) cloned in E. coli , 1979, Nature.

[29]  M. Nei,et al.  Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. , 1986, Molecular biology and evolution.

[30]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[31]  R. A. Doney,et al.  4. Probability and Random Processes , 1993 .

[32]  W. Rutter,et al.  THE NUCLEOTIDE SEQUENCE OF THE HEPATITIS B VIRAL GENOME AND THE IDENTIFICATION OF THE MAJOR VIRAL GENES , 1980 .

[33]  G. Grimmett,et al.  Probability and random processes , 2002 .

[34]  Nick Goldman,et al.  Statistical tests of models of DNA substitution , 1993, Journal of Molecular Evolution.

[35]  Nick Goldman,et al.  MAXIMUM LIKELIHOOD TREES FROM DNA SEQUENCES: A PECULIAR STATISTICAL ESTIMATION PROBLEM , 1995 .

[36]  Z. Yang,et al.  Mixed model analysis of DNA sequence evolution. , 1995, Biometrics.

[37]  L. Jin,et al.  Limitations of the evolutionary parsimony method of phylogenetic analysis. , 1990, Molecular biology and evolution.

[38]  K. Koike,et al.  Complete nucleotide sequence of hepatitis B virus DNA of subtype adr and its conserved gene organization. , 1984, Gene.

[39]  H. J. Lin,et al.  Biochemical detection of hepatitis B virus constituents. , 1989, Advances in clinical chemistry.

[40]  E. Cohen,et al.  Estimation of the number of nucleotide sequences in mouse DNA complementary to messenger RNAs specifying a complete mouse immunoglobulin. , 1976, Biochemistry.

[41]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[42]  Asao Fujiyama,et al.  Cloning and structural analyses of hepatitis B virus DNAs, subtype adr , 1983, Nucleic Acids Res..

[43]  H. Kishino,et al.  Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoidea , 1989, Journal of Molecular Evolution.

[44]  M. Nei,et al.  Relative efficiencies of the maximum-likelihood, neighbor-joining, and maximum-parsimony methods when substitution rate varies with site. , 1994, Molecular biology and evolution.