Amino acid cost and codon-usage biases in 6 prokaryotic genomes: a whole-genome analysis.

For most prokaryotic organisms, amino acid biosynthesis represents a significant portion of their overall energy budget. The difference in the cost of synthesis between amino acids can be striking, differing by as much as 7-fold. Two prokaryotic organisms, Escherichia coli and Bacillus subtilis, have been shown to preferentially utilize less costly amino acids in highly expressed genes, indicating that parsimony in amino acid selection may confer a selective advantage for prokaryotes. This study confirms those findings and extends them to 4 additional prokaryotic organisms: Chlamydia trachomatis, Chlamydophila pneumoniae AR39, Synechocystis sp. PCC 6803, and Thermus thermophilus HB27. Adherence to codon-usage biases for each of these 6 organisms is inversely correlated with a coding region's average amino acid biosynthetic cost in a fashion that is independent of chemoheterotrophic, photoautotrophic, or thermophilic lifestyle. The obligate parasites C. trachomatis and C. pneumoniae AR39 are incapable of synthesizing many of the 20 common amino acids. Removing auxotrophic amino acids from consideration in these organisms does not alter the overall trend of preferential use of energetically inexpensive amino acids in highly expressed genes.

[1]  Wm. R. Wright General Intelligence, Objectively Determined and Measured. , 1905 .

[2]  S. Golden,et al.  Stability of the Synechococcus elongatus PCC 7942 circadian clock under directed anti-phase expression of the kai genes. , 2005, Microbiology.

[3]  D. Axe,et al.  Extreme functional sensitivity to conservative amino acid changes on enzyme exteriors. , 2000, Journal of molecular biology.

[4]  Trinad Chakraborty,et al.  GenomeViz: visualizing microbial genomes , 2004, BMC Bioinformatics.

[5]  G Humphreys,et al.  Codon usage can affect efficiency of translation of genes in Escherichia coli. , 1984, Nucleic acids research.

[6]  S. Kanaya,et al.  Studies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysis. , 1999, Gene.

[7]  A. H. Stouthamer A theoretical study on the amount of ATP required for synthesis of microbial cell material , 2007, Antonie van Leeuwenhoek.

[8]  P. Sharp,et al.  Absence of translationally selected synonymous codon usage bias in Helicobacter pylori. , 2000, Microbiology.

[9]  A Carbone,et al.  Codon bias signatures, organization of microorganisms in codon space, and lifestyle. , 2005, Molecular biology and evolution.

[10]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[11]  M. Gouy,et al.  Codon catalog usage and the genome hypothesis. , 1980, Nucleic acids research.

[12]  S. Razin Adherence of Pathogenic Mycoplasmas to Host Cells , 1999, Bioscience reports.

[13]  Lorenz Wernisch,et al.  Unexpected correlations between gene expression and codon usage bias from microarray data for the whole Escherichia coli K-12 genome. , 2003, Nucleic Acids Research.

[14]  Takashi Gojobori,et al.  Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Philippe Bessières,et al.  Micado - a network-oriented database for microbial genomes , 1997, Comput. Appl. Biosci..

[16]  Dmitrij Frishman,et al.  Illuminating the Evolutionary History of Chlamydiae , 2004, Science.

[17]  A. L. Koch Microbial Physiology and Ecology of Slow Growth , 1997, Microbiology and Molecular Biology Reviews.

[18]  Natalia Maltsev,et al.  WIT: integrated system for high-throughput genome sequence analysis and metabolic reconstruction , 2000, Nucleic Acids Res..

[19]  Stefano Pascarella,et al.  Comparative structural analysis of psychrophilic and meso‐ and thermophilic enzymes , 2002, Proteins.

[20]  A. L. Koch,et al.  The adaptive responses of Escherichia coli to a feast and famine existence. , 1971, Advances in microbial physiology.

[21]  F. Blattner,et al.  Functional Genomics: Expression Analysis ofEscherichia coli Growing on Minimal and Rich Media , 1999, Journal of bacteriology.

[22]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[23]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[24]  T. Ikemura Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. , 1981, Journal of molecular biology.

[25]  A. Berger FUNDAMENTALS OF BIOSTATISTICS , 1969 .

[26]  Eduardo P C Rocha,et al.  Base composition bias might result from competition for metabolic resources. , 2002, Trends in genetics : TIG.

[27]  A. Eyre-Walker,et al.  Synonymous codon bias is related to gene length in Escherichia coli: selection for translational accuracy? , 1996, Molecular biology and evolution.

[28]  D. E. Atkinson Cellular Energy Metabolism and its Regulation , 1977 .

[29]  D. Fell,et al.  Modelling photosynthesis and its control. , 2000, Journal of experimental botany.

[30]  T. Ikemura Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes. , 1981, Journal of molecular biology.

[31]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[32]  R. Nussinov,et al.  Factors enhancing protein thermostability. , 2000, Protein engineering.

[33]  Hugo Naya,et al.  Trends in Codon and Amino Acid Usage in Thermotoga maritima , 2002, Journal of Molecular Evolution.

[34]  J. Parker,et al.  Missense misreading of asparagine codons as a function of codon identity and context. , 1987, The Journal of biological chemistry.

[35]  Folker Meyer,et al.  Comparing expression level‐dependent features in codon usage with protein abundance: An analysis of ‘predictive proteomics’ , 2004, Proteomics.

[36]  C. Craig,et al.  Selection costs of amino acid substitutions in ColE1 and ColIa gene clusters harbored by Escherichia coli. , 1998, Molecular biology and evolution.

[37]  C. Fraser,et al.  Complete genome sequence of the Q-fever pathogen Coxiella burnetii , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[38]  M. Gouy,et al.  Codon frequencies in 119 individual genes confirm consistent choices of degenerate bases according to genome type. , 1980, Nucleic acids research.

[39]  Hiroshi Akashi,et al.  Translational selection and yeast proteome evolution. , 2003, Genetics.

[40]  Santiago Garcia-Vallvé,et al.  HGT-DB: a database of putative horizontally transferred genes in prokaryotic complete genomes , 2003, Nucleic Acids Res..

[41]  Paul M. Sharp,et al.  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes , 1986, Nucleic Acids Res..

[42]  W. Hess Genome analysis of marine photosynthetic microbes and their global role. , 2004, Current opinion in biotechnology.