Visualizing bacterial tRNA identity determinants and antideterminants using function logos and inverse function logos

Sequence logos are stacked bar graphs that generalize the notion of consensus sequence. They employ entropy statistics very effectively to display variation in a structural alignment of sequences of a common function, while emphasizing its over-represented features. Yet sequence logos cannot display features that distinguish functional subclasses within a structurally related superfamily nor do they display under-represented features. We introduce two extensions to address these needs: function logos and inverse logos. Function logos display subfunctions that are over-represented among sequences carrying a specific feature. Inverse logos generalize both sequence logos and function logos by displaying under-represented, rather than over-represented, features or functions in structural alignments. To make inverse logos, a compositional inverse is applied to the feature or function frequency distributions before logo construction, where a compositional inverse is a mathematical transform that makes common features or functions rare and vice versa. We applied these methods to a database of structurally aligned bacterial tDNAs to create highly condensed, birds-eye views of potentially all so-called identity determinants and antideterminants that confer specific amino acid charging or initiator function on tRNAs in bacteria. We recovered both known and a few potentially novel identity elements. Function logos and inverse logos are useful tools for exploratory bioinformatic analysis of structure-function relationships in sequence families and superfamilies.

[1]  D. Crothers,et al.  Is there a discriminator site in transfer RNA? , 1972, Proceedings of the National Academy of Sciences of the United States of America.

[2]  R. Roberts Staphylococcal transfer ribonucleic acids. II. Sequence analysis of isoaccepting glycine transfer ribonucleic acids IA and IB from Staphylococcus epidermidis Texas 26. , 1974, The Journal of biological chemistry.

[3]  Mathias Sprinzl,et al.  Compilation of tRNA sequences , 1979, Nucleic Acids Res..

[4]  R. Cedergren,et al.  Structure in tRNA data. , 1982, Biochimie.

[5]  Mathias Sprinzl,et al.  Compilation of tRNA sequences , 1983, Nucleic Acids Res..

[6]  Gary D. Stormo,et al.  Delila system tools , 1984, Nucleic Acids Res..

[7]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[8]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[9]  H B Nicholas,et al.  Differences between transfer RNA molecules. , 1987, Journal of molecular biology.

[10]  H. B. Nicholas,et al.  An algorithm for discriminating sequences and its application to yeast transfer RNA , 1987, Comput. Appl. Biosci..

[11]  W. McClain,et al.  Changing the acceptor identity of a transfer RNA by altering nucleotides in a "variable pocket". , 1988, Science.

[12]  Paul Schimmel,et al.  A simple structural feature is a major determinant of the identity of a transfer RNA , 1988, Nature.

[13]  D. Söll,et al.  Discrimination between glutaminyl-tRNA synthetase and seryl-tRNA synthetase involves nucleotides in the acceptor helix of tRNA. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[14]  P. Schimmel,et al.  Evidence that a major determinant for the identity of a transfer RNA is conserved in evolution. , 1989, Biochemistry.

[15]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[16]  D. Söll,et al.  Anticodon and acceptor stem nucleotides in tRNAGln are major recognition elements for E. coli glutaminyl-tRNA synthetase , 1991, Nature.

[17]  H. Himeno,et al.  Identity determinants of E. coli tryptophan tRNA. , 1991, Nucleic acids research.

[18]  W. McClain,et al.  Four sites in the acceptor helix and one site in the variable pocket of tRNA(Ala) determine the molecule's acceptor identity. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[19]  J. Abelson,et al.  Eight base changes are sufficient to convert a leucine-inserting tRNA into a serine-inserting tRNA. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[20]  D. Söll,et al.  Recognition of bases in Escherichia coli tRNA(Gln) by glutaminyl‐tRNA synthetase: a complete identity set. , 1992, The EMBO journal.

[21]  C. Francklyn,et al.  Overlapping nucleotide determinants for specific aminoacylation of RNA microhelices. , 1992, Science.

[22]  Switching tRNA(Gln) identity from glutamine to tryptophan. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[23]  C. Green,et al.  Staphylococcus aureus has clustered tRNA genes , 1993, Journal of bacteriology.

[24]  Identity elements of tRNA(Trp). Identification and evolutionary conservation. , 1993, The Journal of biological chemistry.

[25]  W. McClain Transfer RNA identity , 1993, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[26]  Sergey Steinberg,et al.  Compilation of tRNA sequences and sequences of tRNA genes , 2004, Nucleic Acids Res..

[27]  I. Willis,et al.  Analysis of acceptor stem base pairing on tRNA(Trp) aminoacylation and function in vivo. , 1994, The Journal of biological chemistry.

[28]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[29]  N. Nameki Identity elements of tRNA(Thr) towards Saccharomyces cerevisiae threonyl-tRNA synthetase. , 1995, Nucleic acids research.

[30]  D. Söll,et al.  Interactions between tRNA identity nucleotides and their recognition sites in glutaminyl-tRNA synthetase determine the cognate amino acid affinity of the enzyme. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[31]  O. Nureki,et al.  Major identity determinants in the "augmented D helix" of tRNA(Glu) from Escherichia coli. , 1996, Journal of molecular biology.

[32]  Gary D. Stormo,et al.  Displaying the information contents of structural RNA alignments: the structure logos , 1997, Comput. Appl. Biosci..

[33]  S. Salzberg,et al.  Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi , 1997, Nature.

[34]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[35]  Mark Borodovsky,et al.  The complete genome sequence of the gastric pathogen Helicobacter pylori , 1997, Nature.

[36]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[37]  R Giegé,et al.  Universal rules and idiosyncratic features in tRNA identity. , 1998, Nucleic acids research.

[38]  K. Musier-Forsyth,et al.  Transfer RNA recognition by aminoacyl‐tRNA synthetases , 1999, Biopolymers.

[39]  M. Liu,et al.  Synthetase recognition determinants of E. coli valine transfer RNA. , 1999, Biochemistry.

[40]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[41]  Henri Grosjean,et al.  tRNomics: analysis of tRNA genes from 50 genomes of Eukarya, Archaea, and Bacteria reveals anticodon-sparing strategies and domain-specific features. , 2002, RNA.

[42]  Dean Laslett,et al.  ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. , 2004, Nucleic acids research.

[43]  Dieter Söll,et al.  Aminoacyl-tRNAs: setting the limits of the genetic code. , 2004, Genes & development.

[44]  David H. Ardell,et al.  TFAM detects co-evolution of tRNA identity rules with lateral transfer of histidyl-tRNA synthetase , 2006, Nucleic acids research.