Structural Constraints Identified with Covariation Analysis in Ribosomal RNA

Covariation analysis is used to identify those positions with similar patterns of sequence variation in an alignment of RNA sequences. These constraints on the evolution of two positions are usually associated with a base pair in a helix. While mutual information (MI) has been used to accurately predict an RNA secondary structure and a few of its tertiary interactions, early studies revealed that phylogenetic event counting methods are more sensitive and provide extra confidence in the prediction of base pairs. We developed a novel and powerful phylogenetic events counting method (PEC) for quantifying positional covariation with the Gutell lab’s new RNA Comparative Analysis Database (rCAD). The PEC and MI-based methods each identify unique base pairs, and jointly identify many other base pairs. In total, both methods in combination with an N-best and helix-extension strategy identify the maximal number of base pairs. While covariation methods have effectively and accurately predicted RNAs secondary structure, only a few tertiary structure base pairs have been identified. Analysis presented herein and at the Gutell lab’s Comparative RNA Web (CRW) Site reveal that the majority of these latter base pairs do not covary with one another. However, covariation analysis does reveal a weaker although significant covariation between sets of nucleotides that are in proximity in the three-dimensional RNA structure. This reveals that covariation analysis identifies other types of structural constraints beyond the two nucleotides that form a base pair.

[1]  ROY MARKHAM,et al.  Structure of Ribonucleic Acid , 1951, Nature.

[2]  E. Chargaff,et al.  Some recent studies on the composition and structure of nucleic acids. , 1951, Journal of cellular physiology. Supplement.

[3]  F. Crick Codon--anticodon pairing: the wobble hypothesis. , 1966, Journal of molecular biology.

[4]  B. Clark,et al.  Structure of yeast phenylalanine tRNA at 3 Å resolution , 1974, Nature.

[5]  A. Rich,et al.  Three-dimensional structure of yeast phenylalanine transfer RNA at 3. 0Å resolution , 1974, Nature.

[6]  F. Crick,et al.  Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid , 1974, Nature.

[7]  C. Woese,et al.  5S RNA secondary structure , 1975, Nature.

[8]  R. Gutell,et al.  Secondary structure model for bacterial 16S ribosomal RNA: phylogenetic, enzymatic and chemical evidence. , 1980, Nucleic acids research.

[9]  H. Noller,et al.  Gene organization and primary structure of a ribosomal RNA operon from Escherichia coli. , 1981, Journal of molecular biology.

[10]  R. Gutell,et al.  Secondary structure model for 23S ribosomal RNA. , 1981, Nucleic acids research.

[11]  R. Gutell,et al.  Comparative anatomy of 16-S-like ribosomal RNA. , 1985, Progress in nucleic acid research and molecular biology.

[12]  C. Zwieb The secondary structure of the 7SL RNA in the signal recognition particle: functional implications. , 1985, Nucleic acids research.

[13]  R. Gutell,et al.  Higher order structure in ribosomal RNA. , 1986, The EMBO journal.

[14]  H. Tabak,et al.  Structural conventions for group I introns. , 1987, Nucleic acids research.

[15]  C. Guthrie,et al.  Spliceosomal snRNAs. , 1988, Annual review of genetics.

[16]  N. Pace,et al.  Phylogenetic comparative analysis and the secondary structure of ribonuclease P RNA — a review**Presented at the Albany Conference on ‘RNA: Catalysis, Splicing, Evolution’, Rensselaerville, NY (U.S.A.) 22-25 September, 1988. , 1989 .

[17]  N. Pace,et al.  Phylogenetic comparative analysis and the secondary structure of ribonuclease P RNA--a review. , 1989, Gene.

[18]  C R Woese,et al.  Higher order structural elements in ribosomal RNAs: pseudo-knots and the use of noncanonical pairs. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[19]  C R Woese,et al.  Architecture of ribosomal RNA: constraints on the sequence of "tetra-loops". , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[20]  J. Szostak,et al.  Phylogenetic and genetic evidence for base-triples in the catalytic domain of group I introns , 1990, Nature.

[21]  David K. Y. Chiu,et al.  Inferring consensus structure from nucleic acid sequences , 1991, Comput. Appl. Biosci..

[22]  G. Stormo,et al.  Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods. , 1992, Nucleic acids research.

[23]  R. Gutell,et al.  Comparative studies of RNA: inferring higher-order structure from patterns of sequence variation , 1993 .

[24]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[25]  R. Gutell,et al.  Lessons from an evolving rRNA: 16S and 23S rRNA structures from a comparative perspective. , 1994, Microbiological reviews.

[26]  D Gautheret,et al.  Identification of base-triples in RNA using comparative sequence analysis. , 1995, Journal of molecular biology.

[27]  C. Kundrot,et al.  Crystal Structure of a Group I Ribozyme Domain: Principles of RNA Packing , 1996, Science.

[28]  D. Bartel,et al.  Phylogenetic analysis of tmRNA secondary structure. , 1996, RNA.

[29]  D Gautheret,et al.  Inferring the conformation of RNA base pairs and triples from patterns of sequence variation. , 1997, Nucleic acids research.

[30]  R. Gutell,et al.  A functional ribosomal RNA tertiary structure involves a base triple interaction. , 1998, Biochemistry.

[31]  Bjarne Knudsen,et al.  RNA secondary structure prediction using stochastic context-free grammars and evolutionary history , 1999, Bioinform..

[32]  J. Doudna,et al.  Crystal structure of the ribonucleoprotein core of the signal recognition particle. , 2000, Science.

[33]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[34]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[35]  K. Hartmuth,et al.  Crystal structure of the spliceosomal 15.5kD protein bound to a U4 snRNA fragment. , 2000, Molecular cell.

[36]  P. Tuff,et al.  Exploring a phylogenetic approach for the detection of correlated substitutions in proteins. , 2000, Molecular biology and evolution.

[37]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[38]  S C Harvey,et al.  AA.AG@helix.ends: A:A and A:G base-pairs at the ends of 16 S and 23 S rRNA helices. , 2001, Journal of molecular biology.

[39]  R. Gutell,et al.  The accuracy of ribosomal RNA comparative structure models. , 2002, Current opinion in structural biology.

[40]  A. E. Sauer-Eriksson,et al.  Structure of the SRP19–RNA complex and implications for signal recognition particle assembly , 2002, Nature.

[41]  A. Horovitz,et al.  Mapping pathways of allosteric communication in GroEL by analysis of correlated mutations , 2002, Proteins.

[42]  Bjarne Knudsen,et al.  Pfold: RNA Secondary Structure Prediction Using Stochastic Context-Free Grammars , 2003 .

[43]  R. Gutell,et al.  Diversity of base-pair conformations and their occurrence in rRNA structure and RNA structural motifs. , 2004, Journal of molecular biology.

[44]  Scott A. Strobel,et al.  Crystal structure of a self-splicing group I intron with both exons , 2004, Nature.

[45]  Richard W. Aldrich,et al.  A perturbation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments , 2004, Bioinform..

[46]  R. Aldrich,et al.  Influence of conservation on calculations of amino acid covariance in multiple sequence alignments , 2004, Proteins.

[47]  J. Holton,et al.  Structures of the Bacterial Ribosome at 3.5 Å Resolution , 2005, Science.

[48]  N. Pace,et al.  Crystal structure of a bacterial ribonuclease P RNA. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[49]  W. Atchley,et al.  Networks of coevolving sites in structural and functional domains of serpin proteins. , 2005, Molecular biology and evolution.

[50]  R. Green,et al.  An Active Role for tRNA in Decoding Beyond Codon:Anticodon Pairing , 2005, Science.

[51]  Peter F. Stadler,et al.  Memory Efficient Folding Algorithms for Circular RNA Secondary Structures , 2006, German Conference on Bioinformatics.

[52]  A. S. Krasilnikov,et al.  Crystal structure of the RNA component of bacterial ribonuclease P , 2005, Nature.

[53]  A. Jean-Marie,et al.  A model-based approach for detecting coevolving positions in a molecule. , 2005, Molecular biology and evolution.

[54]  E. Chargaff Chemical specificity of nucleic acids and mechanism of their enzymatic degradation , 1950, Experientia.

[55]  Paul P. Gardner,et al.  Sequence analysis Measuring covariation in RNA alignments : physical realism improves information measures , 2006 .

[56]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[57]  David Haussler,et al.  Identification and Classification of Conserved RNA Secondary Structures in the Human Genome , 2006, PLoS Comput. Biol..

[58]  David Haussler,et al.  Detecting the coevolution of biosequences--an example of RNA interaction prediction. , 2007, Molecular biology and evolution.

[59]  Shigeyuki Yokoyama,et al.  Structural basis for functional mimicry of long-variable-arm tRNA by transfer-messenger RNA , 2007, Proceedings of the National Academy of Sciences.

[60]  Gregory B. Gloor,et al.  Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction , 2008, Bioinform..

[61]  Sebastian Will,et al.  RNAalifold: improved consensus structure prediction for RNA alignments , 2008, BMC Bioinformatics.

[62]  Stuart Ozer,et al.  Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA , 2009, SSDBM.

[63]  T. Hwa,et al.  Identification of direct residue contacts in protein–protein interaction by message passing , 2009, Proceedings of the National Academy of Sciences.

[64]  V. Ramakrishnan,et al.  What recent ribosome structures have revealed about the mechanism of translation , 2009, Nature.

[65]  William R. Taylor,et al.  Direct correlation analysis improves fold recognition , 2011, Comput. Biol. Chem..

[66]  Stuart Ozer,et al.  rCAD: A Novel Database Schema for the Comparative Analysis of RNA , 2011, 2011 IEEE Seventh International Conference on eScience.

[67]  Thomas A. Hopf,et al.  Protein 3D Structure Computed from Evolutionary Sequence Variation , 2011, PloS one.

[68]  V. Ramakrishnan,et al.  How mutations in tRNA distant from the anticodon affect the fidelity of decoding , 2010, Nature Structural &Molecular Biology.

[69]  Massimiliano Pontil,et al.  PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments , 2012, Bioinform..