Contrasts in codon usage of latent versus productive genes of Epstein-Barr virus: data and hypotheses

Epstein-Barr virus (EBV) has two different modes of existence: latent and productive. There are eight known genes expressed during latency (and hardly at all during the productive phase) and about 70 other ("productive") genes. It is shown that the EBV genes known to be expressed during latency display codon usage strikingly different from that of genes that are expressed during lytic growth. In particular, the percentage of S3 (G or C in codon site 3) is persistently lower (about 20%) in all latent genes than in nonlatent genes. Moreover, S3 is lower in each multicodon amino acid form. Also, the percentage of S in silent codon sites 1 of leucine and arginine is lower in latent than in nonlatent genes. The largest absolute differences in amino acid usage between latent and nonlatent genes emphasize codon types SSN and WWN (W means nucleotide A or T and N is any nucleotide). Two principal explanations to account for the EBV latent versus productive gene codon disparity are proposed. Latent genes have codon usage substantially different from that of host cell genes to minimize the deleterious consequences to the host of viral gene expression during latency. (Productive genes are not so constrained.) It is also proposed that the latency genes of EBV were acquired recently by the viral genome. Evidence and arguments for these proposals are presented.

[1]  L. J. Perry,et al.  The complete DNA sequence of the long unique region in the genome of herpes simplex virus type 1. , 1988, The Journal of general virology.

[2]  S Karlin,et al.  Comparative statistics for DNA and protein sequences: single sequence analysis. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[3]  E. Kieff,et al.  Epstein-Barr virus nuclear antigen 2 specifically induces expression of the B-cell activation antigen CD23. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[4]  M. Perricaudet,et al.  An Epstein-Barr virus transcription unit is at least 84 kilobases long. , 1986, Nucleic acids research.

[5]  E. Kieff,et al.  A sixth Epstein-Barr virus nuclear protein (EBNA3B) is expressed in latently infected growth-transformed lymphocytes , 1988, Journal of virology.

[6]  W. W. Ralph,et al.  Codon usage in the vertebrate hemoglobins and its implications. , 1985, Molecular biology and evolution.

[7]  H. Tabak,et al.  A fused chimeric protein made in human cells , 1989, Cell.

[8]  P. L. Deininger,et al.  DNA sequence and expression of the B95-8 Epstein—Barr virus genome , 1984, Nature.

[9]  D J Lipman,et al.  Contextual constraints on synonymous codon choice. , 1983, Journal of molecular biology.

[10]  J. Bennetzen,et al.  Codon selection in yeast. , 1982, The Journal of biological chemistry.

[11]  E. G. Shpaer Constraints on codon context in Escherichia coli genes. Their possible role in modulating the efficiency of translation. , 1986, Journal of molecular biology.

[12]  I. Ernberg,et al.  An Epstein-Barr virus (EBV)-determined nuclear antigen (EBNA5) partly encoded by the transformation-associated Bam WYH region of EBV DNA: preferential expression in lymphoblastoid cell lines. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[13]  R. Tjian,et al.  Transcriptional regulation in mammalian cells by sequence-specific DNA binding proteins. , 1989, Science.

[14]  J. Strominger,et al.  Analysis of the transcript encoding the latent Epstein-Barr virus nuclear antigen I: a potentially polycistronic message generated by long-range splicing of several exons. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Perricaudet,et al.  Spliced RNA from the IR1‐U2 region of Epstein‐Barr virus: presence of an open reading frame for a repetitive polypeptide. , 1984, The EMBO journal.

[16]  E. Kieff,et al.  A fifth Epstein-Barr virus nuclear protein (EBNA3C) is expressed in latently infected growth-transformed lymphocytes , 1988, Journal of virology.

[17]  E. Kieff,et al.  Repeat arrays in cellular DNA related to the Epstein-Barr virus IR3 repeat , 1985, Molecular and cellular biology.

[18]  J. Filipski,et al.  Correlation between molecular clock ticking, codon usage, fidelity of DNA repair, chromosome banding and chromatin compactness in germline cells , 1987, FEBS letters.

[19]  A. Suyama,et al.  Local stability of DNA and RNA secondary structure and its relation to biological functions. , 1986, Progress in biophysics and molecular biology.

[20]  C. Cantor,et al.  Approaches to physical mapping of the human genome. , 1986, Cold Spring Harbor symposia on quantitative biology.

[21]  G Bernardi,et al.  The mosaic genome of warm-blooded vertebrates. , 1985, Science.

[22]  G. Hayward,et al.  Sequence-specific DNA binding of the Epstein-Barr virus nuclear antigen (EBNA-1) to clustered sites in the plasmid maintenance region , 1985, Cell.

[23]  E. Kieff,et al.  One of two Epstein-Barr virus nuclear antigens contains a glycine-alanine copolymer domain. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[24]  G. W. Hatfield,et al.  Nonrandom utilization of codon pairs in Escherichia coli. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[25]  W. Hammerschmidt,et al.  Identification and characterization of oriLyt, a lytic origin of DNA replication of Epstein-Barr virus , 1988, Cell.

[26]  A. Davison,et al.  Genetic relations between varicella-zoster virus and Epstein-Barr virus. , 1987, The Journal of general virology.

[27]  Manolo Gouy,et al.  Codon catalog usage is a genome strategy modulated for gene expressivity , 1981, Nucleic Acids Res..

[28]  R. L. Baldwin,et al.  HELIX--RANDOM COIL TRANSITIONS IN DNA HOMOPOLYMER PAIRS. , 1964, Journal of molecular biology.

[29]  R. Nussinov,et al.  Preferential codon usage in genes. , 1981, Gene.

[30]  E. Kieff,et al.  Identification and characterization of a cellular protein that cross-reacts with the Epstein-Barr virus nuclear antigen , 1984, Journal of virology.

[31]  E. Mocarski,et al.  Herpes simplex virus latent RNA (LAT) is not required for latent infection in the mouse. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[32]  D C Shields,et al.  Synonymous codon usage in Bacillus subtilis reflects both translational selection and mutational biases. , 1987, Nucleic acids research.

[33]  G. Klein,et al.  Purification of the Epstein-Barr virus-determined nuclear antigen from Epstein-Barr virus-transformed human lymphoid cell lines , 1978, Journal of virology.

[34]  A J Davison,et al.  The complete DNA sequence of varicella-zoster virus. , 1986, The Journal of general virology.

[35]  W. Fiers,et al.  Preferential codon usage in prokaryotic genes: the optimal codon-anticodon interaction energy and the selective codon usage in efficiently expressed genes. , 1982, Gene.

[36]  U. Nater,et al.  Epstein-Barr virus. , 1991, The Journal of family practice.

[37]  B. Sugden An intricate route to immortality , 1989, Cell.

[38]  N. DeLuca,et al.  Physical and functional domains of the herpes simplex virus transcriptional regulatory protein ICP4 , 1988, Journal of virology.

[39]  P. Highton,et al.  Similarities between the DNA molecules of bacteriophages 424, lambda, and 21, determined by denaturation and electron microscopy. , 1975, Virology.

[40]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[41]  E. Kieff,et al.  Nucleotide sequences of mRNAs encoding Epstein-Barr virus nuclear proteins: a probable transcriptional initiation site. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[42]  G. Klein Viral latency and transformation: The strategy of Epstein-Barr virus , 1989, Cell.

[43]  P. Chavrier,et al.  Both Epstein‐Barr virus (EBV)‐encoded trans‐acting factors, EB1 and EB2, are required to activate transcription from an EBV early promoter. , 1986, The EMBO journal.

[44]  S Karlin,et al.  A method to identify distinctive charge configurations in protein sequences, with application to human herpesvirus polypeptides. , 1989, Journal of molecular biology.

[45]  D. Reisman,et al.  A putative origin of replication of plasmids derived from Epstein-Barr virus is composed of two cis-acting components , 1985, Molecular and cellular biology.

[46]  N. Sueoka Directional mutation pressure and neutral molecular evolution. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[47]  R. Blake,et al.  Analysis of the codon bias in E. coli sequences. , 1984, Journal of biomolecular structure & dynamics.

[48]  T. Ikemura Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. , 1981, Journal of molecular biology.

[49]  T Gojobori,et al.  Codon usage tabulated from the GenBank genetic sequence data. , 1991, Nucleic acids research.

[50]  F. Rodier,et al.  Two distinct compositional classes of vertebrate gene-bearing DNA stretches, their structures and possible evolutionary origin. , 1987, DNA.

[51]  M. Perricaudet,et al.  A promoter for the highly spliced EBNA family of RNAs of Epstein-Barr virus , 1987, Journal of virology.

[52]  J. Miller,et al.  Analysis of mutation in human cells by using an Epstein-Barr virus shuttle system , 1987, Molecular and cellular biology.

[53]  M. Gouy,et al.  Codon catalog usage and the genome hypothesis. , 1980, Nucleic acids research.

[54]  E. Kieff,et al.  Two related Epstein-Barr virus membrane proteins are encoded by separate genes , 1989, Journal of virology.

[55]  A. Bird CpG-rich islands and the function of DNA methylation , 1986, Nature.

[56]  J. Dillner,et al.  BamHI E region of the Epstein-Barr virus genome encodes three transformation-associated nuclear proteins. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[57]  M Yarus,et al.  Sense codons are found in specific contexts. , 1985, Journal of molecular biology.

[58]  B. Sugden,et al.  A promoter of Epstein-Barr virus that can function during latent infection can be transactivated by EBNA-1, a viral protein required for viral DNA replication during latent infection , 1989, Journal of virology.