Intimate Evolution of Proteins

Discerning the significant relations that exist within and among genome sequences is a major step toward the modeling of biopolymer evolution. Here we report the systematic analysis of the atomic composition of proteins encoded by organisms representative of each kingdoms. Protein atomic contents are shown to vary largely among species, the larger variations being observed for the main architectural component of proteins, the carbon atom. These variations apply to the bulk proteins as well as to subsets of ortholog proteins. A pronounced correlation between proteome carbon content and genome base composition is further evidenced, with high G+C genome content being related to low protein carbon content. The generation of random proteomes and the examination of the canonical genetic code provide arguments for the hypothesis that natural selection might have driven genome base composition.

[1]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[2]  J. Thorne,et al.  Models of protein sequence evolution and their applications. , 2000, Current opinion in genetics & development.

[3]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[4]  W. Li,et al.  Selective constraints, amino acid composition, and the rate of protein evolution. , 2000, Molecular biology and evolution.

[5]  C R Woese,et al.  Order in the genetic code. , 1965, Proceedings of the National Academy of Sciences of the United States of America.

[6]  N. Sueoka,et al.  Compositional correlation between deoxyribonucleic acid and protein. , 1961, Cold Spring Harbor symposia on quantitative biology.

[7]  E. O’Shea,et al.  Global analysis of protein expression in yeast , 2003, Nature.

[8]  P. Marlière,et al.  Molecular Evolution of Protein Atomic Composition , 2001, Science.

[9]  S. Karlin,et al.  Chance and statistical significance in protein and DNA sequence analysis. , 1992, Science.

[10]  Michel Werner,et al.  Sulfur sparing in the yeast proteome in response to sulfur demand. , 2002, Molecular cell.

[11]  M. Kimura The Neutral Theory of Molecular Evolution: Introduction , 1983 .

[12]  G. Bernardi,et al.  Codon usage and genome composition , 2005, Journal of Molecular Evolution.

[13]  Takashi Gojobori,et al.  Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[14]  F. Neidhardt,et al.  Escherichia Coli and Salmonella: Typhimurium Cellular and Molecular Biology , 1987 .

[15]  G. Singer,et al.  Nucleotide bias causes a genomewide bias in the amino acid composition of proteins. , 2000, Molecular biology and evolution.

[16]  Philippe Marlière,et al.  Adaptive eradication of methionine and cysteine from cyanobacterial light-harvesting proteins , 1989, Nature.

[17]  Eduardo P C Rocha,et al.  Base composition bias might result from competition for metabolic resources. , 2002, Trends in genetics : TIG.

[18]  J. Lobry,et al.  Influence of genomic G+C content on average amino-acid composition of proteins from 59 bacterial species. , 1997, Gene.

[19]  N. Sueoka On the genetic basis of variation and heterogeneity of DNA base composition. , 1962, Proceedings of the National Academy of Sciences of the United States of America.

[20]  E. Koonin,et al.  The Impact of Comparative Genomics on Our Understanding of Evolution , 2000, Cell.

[21]  M. Nei Molecular Evolutionary Genetics , 1987 .

[22]  S. Osawa,et al.  The guanine and cytosine content of genomic DNA and bacterial evolution. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[23]  S. Lyles Biology of microorganisms , 1969 .

[24]  C. Woese,et al.  Bacterial evolution , 1987, Microbiological reviews.

[25]  H. Ochman,et al.  Lateral gene transfer and the nature of bacterial innovation , 2000, Nature.

[26]  Laurence D. Hurst,et al.  A quantitative measure of error minimization in the genetic code , 1991, Journal of Molecular Evolution.

[27]  S. Karlin,et al.  Quantile distributions of amino acid usage in protein classes. , 1992, Protein engineering.