Model-Free RNA Sequence and Structure Alignment Informed by SHAPE Probing Reveals a Conserved Alternate Secondary Structure for 16S rRNA

Discovery and characterization of functional RNA structures remains challenging due to deficiencies in de novo secondary structure modeling. Here we describe a dynamic programming approach for model-free sequence comparison that incorporates high-throughput chemical probing data. Based on SHAPE probing data alone, ribosomal RNAs (rRNAs) from three diverse organisms – the eubacteria E. coli and C. difficile and the archeon H. volcanii – could be aligned with accuracies comparable to alignments based on actual sequence identity. When both base sequence identity and chemical probing reactivities were considered together, accuracies improved further. Derived sequence alignments and chemical probing data from protein-free RNAs were then used as pseudo-free energy constraints to model consensus secondary structures for the 16S and 23S rRNAs. There are critical differences between these experimentally-informed models and currently accepted models, including in the functionally important neck and decoding regions of the 16S rRNA. We infer that the 16S rRNA has evolved to undergo large-scale changes in base pairing as part of ribosome function. As high-quality RNA probing data become widely available, structurally-informed sequence alignment will become broadly useful for de novo motif and function discovery.

[1]  J. Holton,et al.  Crystal structure of the bacterial ribosome from Escherichia coli at 3.5 A resolution. , 2014 .

[2]  J. Doudna,et al.  Insights into RNA structure and function from genome-wide studies , 2014, Nature Reviews Genetics.

[3]  Steven Busan,et al.  RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP) , 2014, Nature Methods.

[4]  Sean R Eddy,et al.  Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. , 2014, Annual review of biophysics.

[5]  Ronny Lorenz,et al.  Predicting RNA structure: advances and limitations. , 2014, Methods in molecular biology.

[6]  Kristen K. Dang,et al.  Comparison of SIV and HIV-1 Genomic RNA Structures Reveals Impact of Sequence Evolution on Conserved and Non-Conserved Structural Motifs , 2013, PLoS pathogens.

[7]  D. Mathews,et al.  Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots , 2013, Proceedings of the National Academy of Sciences.

[8]  R. Gutell,et al.  Structural Constraints Identified with Covariation Analysis in Ribosomal RNA , 2012, PloS one.

[9]  C. Waters,et al.  Cyclic Diguanylate Inversely Regulates Motility and Aggregation in Clostridium difficile , 2012, Journal of bacteriology.

[10]  David H Mathews,et al.  RNA structure prediction: an overview of methods. , 2012, Methods in molecular biology.

[11]  K. Weeks,et al.  Exploring RNA structural codes with SHAPE chemistry. , 2011, Accounts of chemical research.

[12]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[13]  Zhili Xu,et al.  Differential assembly of 16S rRNA domains during 30S subunit formation. , 2010, RNA.

[14]  Morgan C. Giddings,et al.  Influence of nucleotide identity on ribose 2'-hydroxyl reactivity in RNA. , 2009, RNA.

[15]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[16]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[17]  Sebastian Will,et al.  RNAalifold: improved consensus structure prediction for RNA alignments , 2008, BMC Bioinformatics.

[18]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[19]  K. Weeks,et al.  A fast-acting reagent for accurate analysis of RNA secondary and tertiary structure by SHAPE chemistry. , 2007, Journal of the American Chemical Society.

[20]  E. Westhof,et al.  The interaction networks of structured RNAs. , 2006, Nucleic acids research.

[21]  Andrew D Griffiths,et al.  Amplification of complex gene libraries by emulsion PCR , 2006, Nature Methods.

[22]  J. Holton,et al.  Structures of the Bacterial Ribosome at 3.5 Å Resolution , 2005, Science.

[23]  K. Weeks,et al.  RNA structure analysis at single nucleotide resolution by selective 2'-hydroxyl acylation and primer extension (SHAPE). , 2005, Journal of the American Chemical Society.

[24]  G. Culver,et al.  Mapping structural differences between 30S ribosomal subunit assembly intermediates , 2004, Nature Structural &Molecular Biology.

[25]  Sean R. Eddy,et al.  RSEARCH: Finding homologs of single structured RNA sequences , 2003, BMC Bioinformatics.

[26]  R. Gutell,et al.  The accuracy of ribosomal RNA comparative structure models. , 2002, Current opinion in structural biology.

[27]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[28]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[29]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[30]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[31]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[32]  G. Stormo,et al.  Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods. , 1992, Nucleic acids research.

[33]  E. Westhof,et al.  Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. , 1990, Journal of molecular biology.

[34]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[35]  B. Ganem RNA world , 1987, Nature.

[36]  H. Noller,et al.  Interconversion of active and inactive 30 S ribosomal subunits is accompanied by a conformational change in the decoding region of 16 S rRNA. , 1986, Journal of molecular biology.

[37]  O. Gotoh An improved algorithm for matching biological sequences. , 1982, Journal of molecular biology.

[38]  C. Smith,et al.  Transferable tetracycline resistance in Clostridium difficile , 1981, Antimicrobial Agents and Chemotherapy.

[39]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .