Core promoter information content correlates with optimal growth temperature

The subtle mechanisms by which protein-DNA interactions remain functional across a wide range of temperatures are largely unknown. In this work, we manually curated available information relating fully sequenced archaeal genomes with organism growth temperatures. We built a motif that represents the core promoter of each species and calculated its information content. We then studied the relation between optimal growth temperature (OGT) and information content (IC) in the promoter region.We found a positive correlation between G + C content and OGT in tRNA regions and not in overall genome. Furthermore, we found that there is a positive correlation between information content and optimal growth temperatures in Archaea. This can’t be explained by an increased C+G composition nor by other obvious mechanisms. These findings suggest that increased information content could produce a positive fitness in organisms living at high temperatures. We suggest that molecular information theory may need to be adapted for hyperthermophiles.

[1]  M. Gouy,et al.  Adaptation to environmental temperature is a major determinant of molecular evolutionary rates in archaea. , 2011, Molecular biology and evolution.

[2]  J. Soppa Basal and regulated transcription in archaea. , 2001, Advances in applied microbiology.

[3]  A. R. Merchant,et al.  High guanine–cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[4]  T. Haran,et al.  Signals for TBP/TATA box recognition. , 2000, Journal of molecular biology.

[5]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[6]  Boyke Bunk,et al.  BacDive—the Bacterial Diversity Metadatabase , 2013, Nucleic Acids Res..

[7]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[8]  J. P. Harrison,et al.  The limits for life under multiple extremes. , 2013, Trends in microbiology.

[9]  E. Susko,et al.  On the correlation between genomic G+C content and optimal growth temperature in prokaryotes: data quality and confounding factors. , 2006, Biochemical and biophysical research communications.

[10]  T. D. Schneider,et al.  Evolution of biological information. , 2000, Nucleic acids research.

[11]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[12]  Sławomir Jabłoński,et al.  Methanogenic archaea database containing physiological and biochemical characteristics. , 2015, International journal of systematic and evolutionary microbiology.

[13]  Andreas Wagner,et al.  Growth Temperature and Genome Size in Bacteria Are Negatively Correlated, Suggesting Genomic Streamlining During Thermal Adaptation , 2013, Genome biology and evolution.

[14]  A. Nadra,et al.  Indirect DNA readout on the protein side: coupling between histidine protonation, global structural cooperativity, dynamics, and DNA binding of the human papillomavirus type 16 E2C domain. , 2009, Journal of molecular biology.

[15]  G. Bernardi,et al.  Genomic GC level, optimal growth temperature, and genome size in prokaryotes. , 2006, Biochemical and biophysical research communications.

[16]  R. Kornberg The molecular basis of eukaryotic transcription , 2007, Proceedings of the National Academy of Sciences.

[17]  Dean Laslett,et al.  ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. , 2004, Nucleic acids research.

[18]  R. Mann,et al.  Origins of specificity in protein-DNA recognition. , 2010, Annual review of biochemistry.

[19]  T. D. Schneider,et al.  70% efficiency of bistate molecular machines explained by information theory, high dimensional geometry and evolutionary convergence , 2010, Nucleic acids research.

[20]  P. Blum Archaea : ancient microbes, extreme environments, and the origin of life , 2001 .

[21]  Scott Mann,et al.  Bacterial genomic G+C composition-eliciting environmental adaptation. , 2010, Genomics.

[22]  T. D. Schneider,et al.  Consensus sequence Zen. , 2002, Applied bioinformatics.

[23]  Songnian Hu,et al.  On the molecular mechanism of GC content variation among eubacterial genomes , 2012, Biology Direct.