Comparison of synonymous codon distribution patterns of bacteriophage and host genomes.

Synonymous codon usage patterns of bacteriophage and host genomes were compared. Two indexes, G + C base composition of a gene (fgc) and fraction of translationally optimal codons of the gene (fop), were used in the comparison. Synonymous codon usage data of all the coding sequences on a genome are represented as a cloud of points in the plane of fop vs. fgc. The Escherichia coli coding sequences appear to exhibit two phases, "rising" and "flat" phases. Genes that are essential for survival and are thought to be native are located in the flat phase, while foreign-type genes from prophages and transposons are found in the rising phase with a slope of nearly unity in the fgc vs. fop plot. Synonymous codon distribution patterns of genes from temperate phages P4, P2, N15 and lambda are similar to the pattern of E. coli rising phase genes. In contrast, genes from the virulent phage T7 or T4, for which a phage-encoded DNA polymerase is identified, fall in a linear curve with a slope of nearly zero in the fop vs. fgc plane. These results may suggest that the G + C contents for T7, T4 and E. coli flat phase genes are subject to the directional mutation pressure and are determined by the DNA polymerase used in the replication. There is significant variation in the fop values of the phage genes, suggesting an adjustment to gene expression level. Similar analyses of codon distribution patterns were carried out for Haemophilus influenzae, Bacillus subtilis, Mycobacterium tuberculosis and their phages with complete genomic sequences available.

[1]  P. Sharp,et al.  Codon usage in the Mycobacterium tuberculosis complex. , 1996, Microbiology.

[2]  W. Fiers,et al.  Preferential codon usage in prokaryotic genes: the optimal codon-anticodon interaction energy and the selective codon usage in efficiently expressed genes. , 1982, Gene.

[3]  T. Ikemura Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes. , 1981, Journal of molecular biology.

[4]  T. Ohama,et al.  Role of GC-biased mutation pressure on synonymous codon choice in Micrococcus luteus, a bacterium with a high genomic GC-content. , 1990, Nucleic acids research.

[5]  K. Umesono,et al.  Directional mutation pressure and transfer RNA in choice of the third nucleotide of synonymous two-codon sets. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[6]  S. Finkel,et al.  DNA sequence of satellite bacteriophage P4. , 1990, Nucleic acids research.

[7]  Toshimichi Ikemura,et al.  Detection of genes in Escherichia coli sequences determined by genome projects and prediction of protein production levels, based on multivariate diversity in codon usage , 1996, Comput. Appl. Biosci..

[8]  A. Goffeau,et al.  The complete genome sequence of the Gram-positive bacterium Bacillus subtilis , 1997, Nature.

[9]  D. Hatfield,et al.  Transfer RNA in Protein Synthesis , 1992 .

[10]  D. Esposito,et al.  The complete nucleotide sequence of bacteriophage HP1 DNA. , 1996, Nucleic acids research.

[11]  G. Sarkis,et al.  Genome structure of mycobacteriophage D29: implications for phage evolution. , 1998, Journal of molecular biology.

[12]  H. Ochman,et al.  Amelioration of Bacterial Genomes: Rates of Change and Exchange , 1997, Journal of Molecular Evolution.

[13]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[14]  N. Sueoka On the genetic basis of variation and heterogeneity of DNA base composition. , 1962, Proceedings of the National Academy of Sciences of the United States of America.

[15]  N. W. Davis,et al.  The complete genome sequence of Escherichia coli K-12. , 1997, Science.

[16]  T. Ikemura Correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes. Differences in synonymous codon choice patterns of yeast and Escherichia coli with reference to the abundance of isoaccepting transfer RNAs. , 1982, Journal of molecular biology.

[17]  B. Barrell,et al.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence , 1998, Nature.

[18]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.

[19]  T. Trautner,et al.  The complete nucleotide sequence and functional organization of Bacillus subtilis bacteriophage SPP1. , 1997, Gene.

[20]  P. Urbánek,et al.  Nucleotide sequence of the right early region of Bacillus subtilis phage PZA completes the 19366-bp sequence of PZA genome. Comparison with the homologous sequence of phage phi 29. , 1986, Gene.

[21]  G Osterburg,et al.  Nucleotide sequence of bacteriophage fd DNA. , 1978, Nucleic acids research.

[22]  A. Danchin,et al.  Evidence for horizontal gene transfer in Escherichia coli speciation. , 1991, Journal of molecular biology.

[23]  Manolo Gouy,et al.  Codon catalog usage is a genome strategy modulated for gene expressivity , 1981, Nucleic Acids Res..

[24]  H. Ochman,et al.  Molecular archaeology of the Escherichia coli genome. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[25]  M. Gouy,et al.  Codon usage in bacteria: correlation with gene expressivity. , 1982, Nucleic acids research.

[26]  F. Sanger,et al.  Nucleotide sequence of bacteriophage φX174 DNA , 1977, Nature.

[27]  T. Kunisawa,et al.  Synonymous codon preferences in bacteriophage T4: a distinctive use of transfer RNAs from T4 and from its host Escherichia coli. , 1992, Journal of theoretical biology.

[28]  G. Sarkis,et al.  DNA sequence, structure and gene expression of mycobacteriophage L5: a phage system for mycobacterial genetics , 1993, Molecular microbiology.

[29]  N. Sueoka Directional mutation pressure and neutral molecular evolution. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[30]  T. Ikemura Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. , 1981, Journal of molecular biology.

[31]  A. Pavesi,et al.  Transfer RNA gene redundancy and translational selection in Saccharomyces cerevisiae. , 1997, Journal of molecular biology.

[32]  F. Studier,et al.  Complete nucleotide sequence of bacteriophage T7 DNA and the locations of T7 genetic elements. , 1983, Journal of molecular biology.

[33]  M. Gouy,et al.  Codon catalog usage and the genome hypothesis. , 1980, Nucleic acids research.