Amino acid composition in endothermic vertebrates is biased in the same direction as in thermophilic prokaryotes

BackgroundAmong bacteria and archaea, amino acid usage is correlated with habitat temperatures. In particular, protein surfaces in species thriving at higher temperatures appear to be enriched in amino acids that stabilize protein structure and depleted in amino acids that decrease thermostability. Does this observation reflect a causal relationship, or could the apparent trend be caused by phylogenetic relatedness among sampled organisms living at different temperatures? And do proteins from endothermic and exothermic vertebrates show similar differences?ResultsWe find that the observed correlations between the frequencies of individual amino acids and prokaryotic habitat temperature are strongly influenced by evolutionary relatedness between the species analysed; however, a proteome-wide bias towards increased thermostability remains after controlling for phylogeny. Do eukaryotes show similar effects of thermal adaptation? A small shift of amino acid usage in the expected direction is observed in endothermic ('warm-blooded') mammals and chicken compared to ectothermic ('cold-blooded') vertebrates with lower body temperatures; this shift is not simply explained by nucleotide usage biases.ConclusionProtein homologs operating at different temperatures have different amino acid composition, both in prokaryotes and in vertebrates. Thus, during the transition from ectothermic to endothermic life styles, the ancestors of mammals and of birds may have experienced weak genome-wide positive selection to increase the thermostability of their proteins.

[1]  Jun S. Liu,et al.  Phylogenomics of nonavian reptiles and the structure of the ancestral amniote genome , 2007, Proceedings of the National Academy of Sciences.

[2]  D. Hickey,et al.  Thermal Adaptation of the Small Subunit Ribosomal RNA Gene: A Comparative Study , 2006, Journal of Molecular Evolution.

[3]  G. Olsen,et al.  Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Eugene I Shakhnovich,et al.  Physics and evolution of thermophilic adaptation. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Lobry,et al.  Relationships Between Genomic G+C Content, RNA Secondary Structures, and Optimal Growth Temperature in Prokaryotes , 1997, Journal of Molecular Evolution.

[6]  Michail Yu. Lobanov,et al.  Different packing of external residues can explain differences in the thermostability of proteins from thermophilic and mesophilic organisms , 2007, Bioinform..

[7]  Laurence D. Hurst,et al.  The evolution of isochores , 2001, Nature Reviews Genetics.

[8]  L. Duret,et al.  Recombination drives the evolution of GC-content in the human genome. , 2004, Molecular biology and evolution.

[9]  I. Jonassen,et al.  Structure-dependent relationships between growth temperature of prokaryotes and the amino acid frequency in their proteins , 2007, Extremophiles.

[10]  Hugo Naya,et al.  Trends of Amino Acid Usage in the Proteins from the Human Genome , 2007, Journal of biomolecular structure & dynamics.

[11]  M. R. Ruocco,et al.  Adaptation of model proteins from cold to hot environments involves continuous and small adjustments of average parameters related to amino acid composition. , 2008, Journal of theoretical biology.

[12]  Gerard Talavera,et al.  Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. , 2007, Systematic biology.

[13]  L. Branco,et al.  Physiology of temperature regulation: comparative aspects. , 2007, Comparative biochemistry and physiology. Part A, Molecular & integrative physiology.

[14]  A. R. Merchant,et al.  High guanine–cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[15]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[16]  B. Snel,et al.  Toward Automatic Reconstruction of a Highly Resolved Tree of Life , 2006, Science.

[17]  A. Suyama,et al.  Local stability of DNA and RNA secondary structure and its relation to biological functions. , 1986, Progress in biophysics and molecular biology.

[18]  M. Bonato,et al.  Preferred amino acids and thermostability. , 2003, Genetics and molecular research : GMR.

[19]  G. Bernardi,et al.  Compositional properties and thermal adaptation of 18S rRNA in vertebrates. , 2008, RNA.

[20]  Jean-Michel Claverie,et al.  Genomic Correlates of Hyperthermostability, an Update* , 2003, The Journal of Biological Chemistry.

[21]  Igor N. Berezovsky,et al.  Protein and DNA Sequence Determinants of Thermophilic Adaptation , 2006, PLoS Comput. Biol..

[22]  G. Singer,et al.  Genomic and proteomic adaptations to growth at high temperature , 2004, Genome Biology.

[23]  C. Cambillau,et al.  Structural and Genomic Correlates of Hyperthermostability* , 2000, The Journal of Biological Chemistry.

[24]  Laurent Duret,et al.  The Impact of Recombination on Nucleotide Substitutions in the Human Genome , 2008, PLoS genetics.

[25]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[26]  Campbell O. Webb,et al.  Bioinformatics Applications Note Phylocom: Software for the Analysis of Phylogenetic Community Structure and Trait Evolution , 2022 .

[27]  David P. Kreil,et al.  Identification of thermophilic species by the amino acid compositions deduced from their genomes. , 2001, Nucleic acids research.

[28]  Kenji Mizuguchi,et al.  Environment specific substitution tables for thermophilic proteins , 2007, BMC Bioinformatics.

[29]  Fredj Tekaia,et al.  Amino acid composition of genomes, lifestyles of organisms, and evolutionary trends: a global picture with correspondence analysis. , 2002, Gene.