Analysis of Nanoarchaeum equitans genome and proteome composition: indications for hyperthermophilic and parasitic adaptation

BackgroundNanoarchaeum equitans, the only known hyperthermophilic archaeon exhibiting parasitic life style, has raised some new questions about the evolution of the Archaea and provided a model of choice to study the genome landmarks correlated with thermo-parasitic adaptation. In this context, we have analyzed the genome and proteome composition of N. equitans and compared the same with those of other mesophiles, hyperthermophiles and obligatory host-associated organisms.ResultsAnalysis of nucleotide, codon and amino acid usage patterns in N. equitans indicates the presence of distinct selective constraints, probably due to its adaptation to a thermo-parasitic life-style. Among the conspicuous characteristics featuring its hyperthermophilic adaptation are overrepresentation of purine bases in protein coding sequences, higher GC-content in tRNA/rRNA sequences, distinct synonymous codon usage, enhanced usage of aromatic and positively charged residues, and decreased frequencies of polar uncharged residues, as compared to those in mesophilic organisms. Positively charged amino acid residues are relatively abundant in the encoded gene-products of N. equitans and other hyperthermophiles, which is reflected in their isoelectric point distribution. Pairwise comparison of 105 orthologous protein sequences shows a strong bias towards replacement of uncharged polar residues of mesophilic proteins by Lys/Arg, Tyr and some hydrophobic residues in their Nanoarchaeal orthologs. The traits potentially attributable to the symbiotic/parasitic life-style of the organism include the presence of apparently weak translational selection in synonymous codon usage and a marked heterogeneity in membrane-associated proteins, which may be important for N. equitans to interact with the host and hence, may help the organism to adapt to the strictly host-associated life style. Despite being strictly host-dependent, N. equitans follows cost minimization hypothesis.ConclusionThe present study reveals that the genome and proteome composition of N. equitans are marked with the signatures of dual adaptation – one to high temperature and the other to obligatory parasitism. While the analysis of nucleotide/amino acid preferences in N. equitans offers an insight into the molecular strategies taken by the archaeon for thermo-parasitic adaptation, the comparative study of the compositional characteristics of mesophiles, hyperthermophiles and obligatory host-associated organisms demonstrates the generality of such strategies in the microbial world.

[1]  David P. Kreil,et al.  Identification of thermophilic species by the amino acid compositions deduced from their genomes. , 2001, Nucleic acids research.

[2]  P. Sharp,et al.  Absence of translationally selected synonymous codon usage bias in Helicobacter pylori. , 2000, Microbiology.

[3]  D. Wall,et al.  Gene expression level influences amino acid usage, but not codon usage, in the tsetse fly endosymbiont Wigglesworthia. , 2003, Microbiology.

[4]  D. A. Dougherty,et al.  Cation-π interactions in structural biology , 1999 .

[5]  G. Singer,et al.  Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content. , 2003, Gene.

[6]  G. Perrière,et al.  Use and misuse of correspondence analysis in codon usage studies. , 2002, Nucleic acids research.

[7]  A Ikai,et al.  Thermostability and aliphatic index of globular proteins. , 1980, Journal of biochemistry.

[8]  D. A. Dougherty,et al.  Cation-pi interactions in structural biology. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Igor N. Berezovsky,et al.  Entropic Stabilization of Proteins and Its Proteomic Consequences , 2005, PLoS Comput. Biol..

[10]  G. Böhm,et al.  The stability of proteins in extreme environments. , 1998, Current opinion in structural biology.

[11]  E. Rocha Codon usage bias from tRNA's point of view: redundancy, specialization, and efficient decoding for translation optimization. , 2004, Genome research.

[12]  J. Mortimer,et al.  Optimum growth temperature and the base composition of open reading frames in prokaryotes , 2003, Extremophiles.

[13]  G. Böhm,et al.  Thermostability of proteins from Thermotoga maritima. , 2001, Methods in enzymology.

[14]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[15]  R. Nussinov,et al.  Factors enhancing protein thermostability. , 2000, Protein engineering.

[16]  K Watanabe,et al.  Archaeal adaptation to higher temperatures revealed by genomic sequence of Thermoplasma volcanium. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[17]  D Eisenberg,et al.  Transproteomic evidence of a loop-deletion mechanism for enhancing protein thermostability. , 1999, Journal of molecular biology.

[18]  G. Böhm,et al.  Stabilization of creatinase from Pseudomonas putida by random mutagenesis , 1993, Protein science : a publication of the Protein Society.

[19]  K. S. Yip,et al.  Protein thermostability above 100 degreesC: a key role for ionic interactions. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Robert B. Russell,et al.  GlobPlot: exploring protein sequences for globularity and disorder , 2003, Nucleic Acids Res..

[21]  P. Sharp,et al.  Variation in the strength of selected codon usage bias among bacteria , 2005, Nucleic acids research.

[22]  G. Olsen,et al.  Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Satoshi Fukuchi,et al.  Compositional changes in RNA, DNA and proteins for bacterial adaptation to higher and lower temperatures. , 2003, Journal of biochemistry.

[24]  Adam Godzik,et al.  Contribution of electrostatic interactions, compactness and quaternary structure to protein thermostability: lessons from structural genomics of Thermotoga maritima. , 2006, Journal of molecular biology.

[25]  M. R. Parsons,et al.  Crystal structure of intact elongation factor EF-Tu from Escherichia coli in GDP conformation at 2.05 A resolution. , 1999, Journal of molecular biology.

[26]  J Schultz,et al.  SMART, a simple modular architecture research tool: identification of signaling domains. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  J O McInerney,et al.  Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[28]  A. Moya,et al.  Mutational and selective pressures on codon and amino acid usage in Buchnera, endosymbiotic bacteria of aphids. , 2003, Genome research.

[29]  K. S. Yip,et al.  Protein thermostability above 100°C: A key role for ionic interactions , 1998 .

[30]  István Simon,et al.  Preformed structural elements feature in partner recognition by intrinsically unstructured proteins. , 2004, Journal of molecular biology.

[31]  K. Umesono,et al.  Directional mutation pressure and transfer RNA in choice of the third nucleotide of synonymous two-codon sets. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[32]  D. Eisenberg,et al.  Correlation of sequence hydrophobicities measures similarity in three-dimensional protein structure. , 1983, Journal of molecular biology.

[33]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[34]  C. Dutta,et al.  Compositional variation in bacterial genes and proteins with potential expression level , 2005, FEBS letters.

[35]  Harald Huber,et al.  A new phylum of Archaea represented by a nanosized hyperthermophilic symbiont , 2002, Nature.

[36]  Dieter Söll,et al.  The genome of Nanoarchaeum equitans: Insights into early archaeal evolution and derived parasitism , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Timothy A. Whitehead,et al.  Minimal protein-folding systems in hyperthermophilic archaea , 2004, Nature Reviews Microbiology.

[38]  C. Dutta,et al.  Evolutionary Constraints on Codon and Amino Acid Usage in Two Strains of Human Pathogenic Actinobacteria Tropheryma whipplei , 2006, Journal of Molecular Evolution.

[39]  N. Moran Accelerated evolution and Muller's rachet in endosymbiotic bacteria. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[40]  N. Moran,et al.  Lifestyle evolution in symbiotic bacteria: insights from genomics. , 2000, Trends in ecology & evolution.

[41]  Hervé Seligmann,et al.  Cost-Minimization of Amino Acid Usage , 2003, Journal of Molecular Evolution.

[42]  H. Musto,et al.  The effect of expression levels on codon usage in Plasmodium falciparum , 2004, Parasitology.

[43]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[44]  C. Dutta,et al.  Codon and amino acid usage in two major human pathogens of genus Bartonella--optimization between replicational-transcriptional selection, translational control and cost minimization. , 2005, DNA research : an international journal for rapid publication of reports on genes and genomes.

[45]  H. Weiner,et al.  Crystallization and preliminary X-ray investigation of bovine liver mitochondrial aldehyde dehydrogenase. , 1992, Journal of molecular biology.

[46]  R. Bernander Chromosome replication, nucleoid segregation and cell division in archaea. , 2000, Trends in microbiology.

[47]  C. Vieille,et al.  Bivalent cations and amino-acid composition contribute to the thermostability of Bacillus licheniformis xylose isomerase. , 2001, European journal of biochemistry.

[48]  A. R. Merchant,et al.  High guanine–cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[49]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[50]  V. Tumanyan,et al.  Representation of amino acid sequences in terms of interaction energy in protein globules , 1997, FEBS letters.

[51]  J. Gibrat,et al.  GOR method for predicting protein secondary structure from amino acid sequence. , 1996, Methods in enzymology.

[52]  J. Lobry,et al.  Relationships Between Genomic G+C Content, RNA Secondary Structures, and Optimal Growth Temperature in Prokaryotes , 1997, Journal of Molecular Evolution.

[53]  E. Nevo,et al.  Adaptive role of increased frequency of polypurine tracts in mRNA sequences of thermophilic prokaryotes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[54]  L. Wernisch,et al.  Solving the riddle of codon usage preferences: a test for translational selection. , 2004, Nucleic acids research.

[55]  D. Lynn,et al.  Synonymous codon usage is subject to selection in thermophilic bacteria. , 2002, Nucleic acids research.

[56]  D. Hickey,et al.  Evidence for strong selective constraint acting on the nucleotide composition of 16S ribosomal RNA genes. , 2002, Nucleic acids research.

[57]  G. Böhm,et al.  [33] Thermostability of proteins from Thermotoga maritima , 2001 .

[58]  A. Suyama,et al.  Local stability of DNA and RNA secondary structure and its relation to biological functions. , 1986, Progress in biophysics and molecular biology.

[59]  C. Gautier,et al.  Hydrophobicity, expressivity and aromaticity are the major trends of amino-acid usage in 999 Escherichia coli chromosome-encoded genes. , 1994, Nucleic acids research.

[60]  Jürgen Gadau,et al.  The genome sequence of Blochmannia floridanus: Comparative analysis of reduced genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[61]  E. Querol,et al.  Analysis of protein conformational characteristics related to thermostability. , 1996, Protein engineering.

[62]  M. Gromiha,et al.  Important inter-residue contacts for enhancing the thermal stability of thermophilic proteins. , 2001, Biophysical chemistry.

[63]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.