Codon usage between genomes is constrained by genome-wide mutational processes.

Analysis of genome-wide codon bias shows that only two parameters effectively differentiate the genome-wide codon bias of 100 eubacterial and archaeal organisms. The first parameter correlates with genome GC content, and the second parameter correlates with context-dependent nucleotide bias. Both of these parameters may be calculated from intergenic sequences. Therefore, genome-wide codon bias in eubacteria and archaea may be predicted from intergenic sequences that are not translated. When these two parameters are calculated for genes from nonmammalian eukaryotic organisms, genes from the same organism again have similar values, and genome-wide codon bias may also be predicted from intergenic sequences. In mammals, genes from the same organism are similar only in the second parameter, because GC content varies widely among isochores. Our results suggest that, in general, genome-wide codon bias is determined primarily by mutational processes that act throughout the genome, and only secondarily by selective forces acting on translated sequences.

[1]  D. Chessel,et al.  Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria. , 2003, Journal of applied genetics.

[2]  M. Gouy,et al.  Codon usage in bacteria: correlation with gene expressivity. , 1982, Nucleic acids research.

[3]  Hugo Naya,et al.  Aerobiosis Increases the Genomic Guanine Plus Cytosine Content (GC%) in Prokaryotes , 2002, Journal of Molecular Evolution.

[4]  Sanford Weisberg Wiley Series in Probability and Statistics , 2005 .

[5]  Wen-Hsiung Li,et al.  The rate of synonymous substitution in enterobacterial genes is inversely related to codon usage bias. , 1987, Molecular biology and evolution.

[6]  William Lee,et al.  Genome-tools: a flexible package for genome sequence analysis. , 2002, BioTechniques.

[7]  C. Vieille,et al.  Hyperthermophilic Enzymes: Sources, Uses, and Molecular Mechanisms for Thermostability , 2001, Microbiology and Molecular Biology Reviews.

[8]  Malik Beshir Malik,et al.  Applied Linear Regression , 2005, Technometrics.

[9]  S. Osawa,et al.  Recent evidence for evolution of the genetic code , 1992, Microbiological reviews.

[10]  C. Woese On the evolution of cells , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[11]  G. Bernardi,et al.  The compositional distribution of coding sequences and DNA molecules in humans and murids , 2005, Journal of Molecular Evolution.

[12]  S. Kanaya,et al.  Analysis of codon usage diversity of bacterial genes with a self-organizing map (SOM): characterization of horizontally transferred genes with emphasis on the E. coli O157 genome. , 2001, Gene.

[13]  Etsuko N. Moriyama,et al.  Codon Usage Bias and tRNA Abundance in Drosophila , 1997, Journal of Molecular Evolution.

[14]  T. Miyata,et al.  Secondary structure of MS2 phage RNA and bias in code word usage. , 1979, Nucleic acids research.

[15]  T. Hassard,et al.  Applied Linear Regression , 2005 .

[16]  N. Sueoka Directional mutation pressure and neutral molecular evolution. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[17]  G Bernardi,et al.  Isochores and the evolutionary genomics of vertebrates. , 2000, Gene.

[18]  D C Shields,et al.  "Silent" sites in Drosophila genes are not neutral: evidence of selection among synonymous codons. , 1988, Molecular biology and evolution.

[19]  T. Lindahl Instability and decay of the primary structure of DNA , 1993, Nature.

[20]  M. Gouy,et al.  Codon catalog usage and the genome hypothesis. , 1980, Nucleic acids research.

[21]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[22]  D. Lynn,et al.  Synonymous codon usage is subject to selection in thermophilic bacteria. , 2002, Nucleic acids research.

[23]  D. Forsdyke,et al.  Thermophilic bacteria strictly obey Szybalski's transcription direction rule and politely purine-load RNAs with both adenine and guanine. , 2000, Genome research.

[24]  T. Jukes,et al.  Relationship between G + C in silent sites of codons and amino acid composition of human proteins , 1993, Journal of Molecular Evolution.

[25]  L. Duret,et al.  Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[26]  C. Glover,et al.  Gene expression profiling for hematopoietic cell culture , 2006 .

[27]  B. Bainbridge,et al.  Genetics , 1981, Experientia.

[28]  S. Karlin,et al.  Comparative DNA analysis across diverse genomes. , 1998, Annual review of genetics.

[29]  D. Hartl,et al.  Selection intensity for codon bias. , 1994, Genetics.

[30]  M. Bulmer The selection-mutation-drift theory of synonymous codon usage. , 1991, Genetics.

[31]  S. Osawa,et al.  The guanine and cytosine content of genomic DNA and bacterial evolution. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[32]  W. Doolittle You are what you eat: a gene transfer ratchet could account for bacterial genes in eukaryotic nuclear genomes. , 1998, Trends in genetics : TIG.

[33]  R. Gambari,et al.  Codon usage and secondary structure of the rabbit alpha-globin mRNA: a hypothesis. , 1990, Biomedica biochimica acta.

[34]  H. Ochman,et al.  Lateral gene transfer and the nature of bacterial innovation , 2000, Nature.

[35]  Giorgio Bernardi,et al.  Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins , 1991, Journal of Molecular Evolution.

[36]  C. Gautier,et al.  Hydrophobicity, expressivity and aromaticity are the major trends of amino-acid usage in 999 Escherichia coli chromosome-encoded genes. , 1994, Nucleic acids research.

[37]  M. Bulmer,et al.  Coevolution of codon usage and transfer RNA abundance , 1987, Nature.

[38]  A. Eyre-Walker,et al.  Synonymous codon bias is related to gene length in Escherichia coli: selection for translational accuracy? , 1996, Molecular biology and evolution.

[39]  Mads Kærn,et al.  Noise in eukaryotic gene expression , 2003, Nature.

[40]  G Bernardi,et al.  The correlation of protein hydropathy with the base composition of coding sequences. , 1999, Gene.

[41]  K Watanabe,et al.  Archaeal adaptation to higher temperatures revealed by genomic sequence of Thermoplasma volcanium. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Manolo Gouy,et al.  Codon catalog usage is a genome strategy modulated for gene expressivity , 1981, Nucleic Acids Res..

[43]  M. Huynen,et al.  Equal G and C contents in histone genes indicate selection pressures on mRNA secondary structure , 1992, Journal of Molecular Evolution.

[44]  P. Sharp,et al.  Codon usage: mutational bias, translational selection, or both? , 1993, Biochemical Society transactions.

[45]  T. Ikemura Codon usage and tRNA content in unicellular and multicellular organisms. , 1985, Molecular biology and evolution.