Regulatory context drives conservation of glycine riboswitch aptamers

In comparison to protein coding sequences, the impact of mutation and natural selection on the sequence and function of non-coding (ncRNA) genes is not well understood. Many ncRNA genes are narrowly distributed to only a few organisms, and appear to be rapidly evolving. Compared to protein coding sequences, there are many challenges associated with assessment of ncRNAs that are not well addressed by conventional phylogenetic approaches, including: short sequence length, lack of primary sequence conservation, and the importance of secondary structure for biological function. Riboswitches are structured ncRNAs that directly interact with small molecules to regulate gene expression in bacteria. They typically consist of a ligand-binding domain (aptamer) whose folding changes drive changes in gene expression. The glycine riboswitch is among the most well-studied due to the widespread occurrence of a tandem aptamer arrangement (tandem), wherein two homologous aptamers interact with glycine and each other to regulate gene expression. However, a significant proportion of glycine riboswitches are comprised of single aptamers (singleton). Here we use graph clustering to circumvent the limitations of traditional phylogenetic analysis when studying the relationship between the tandem and singleton glycine aptamers. Graph clustering enables a broader range of pairwise comparison measures to be used to assess aptamer similarity. Using this approach, we show that one aptamer of the tandem glycine riboswitch pair is typically much more highly conserved, and that which aptamer is conserved depends on the regulated gene. Furthermore, our analysis also reveals that singleton aptamers are more similar to either the first or second tandem aptamer, again based on the regulated gene. Taken together, our findings suggest that tandem glycine riboswitches degrade into functional singletons, with the regulated gene(s) dictating which glycine-binding aptamer is conserved. Author Summary The glycine riboswitch is a ncRNA responsible for the regulation of several distinct gene sets in bacteria that is found with either one (singleton) or two (tandem) aptamers, each of which directly senses glycine. Which aptamer is more important for gene-regulation, and the functional difference between tandem and singleton aptamers, are long-standing questions in the riboswitch field. Like many biologically functional RNAs, glycine aptamers require a specific 3D folded conformation. Thus, they have low primary sequence similarity across distantly related homologs, and large changes in sequence length that make creation and analysis of accurate multiple sequence alignments challenging. To better understand the relationship between tandem and singleton aptamers, we used a graph clustering approach that allows us to compare the similarity of aptamers using metrics that measure both sequence and structure similarity. Our investigation reveals that in tandem glycine riboswitches, one aptamer is more highly conserved than the other, and which aptamer is conserved depends on what gene(s) are regulated. Moreover, we find that many singleton glycine riboswitches likely originate from tandem riboswitches in which the ligand-binding site of the non-conserved aptamer has degraded over time.

[1]  M. Meyer,et al.  In Vivo Behavior of the Tandem Glycine Riboswitch in Bacillus subtilis , 2017, mBio.

[2]  R. Breaker,et al.  Riboswitch diversity and distribution , 2017, RNA.

[3]  Rolf Backofen,et al.  Freiburg RNA tools: a central online resource for RNA-focused research and teaching , 2018, Nucleic Acids Res..

[4]  J. L. de la Pompa,et al.  A novel source of arterial valve cells linked to bicuspid aortic valve without raphe in mice , 2018, eLife.

[5]  Peter F Stadler,et al.  Fast and reliable prediction of noncoding RNAs , 2005, Proc. Natl. Acad. Sci. USA.

[6]  Lars Barquist,et al.  Building non-coding RNA families , 2012, 1206.4087.

[7]  P. B. Cowles,et al.  The Use of Glycine in the Disruption of Bacterial Cells. , 1948, Science.

[8]  V. Gamulin,et al.  Comparative genomic analysis of prion genes , 2007, BMC Genomics.

[9]  Robert D. Finn,et al.  Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families , 2017, Nucleic Acids Res..

[10]  Peter Clote,et al.  RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment , 2018, bioRxiv.

[11]  S. Strobel,et al.  Gene regulation by a glycine riboswitch singlet uses a finely tuned energetic landscape for helical switching , 2018, RNA.

[12]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[13]  Zasha Weinberg,et al.  SAM-VI RNAs selectively bind S-adenosylmethionine and exhibit similarities to SAM-III riboswitches , 2018, RNA biology.

[14]  M. Saier,et al.  Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria , 2013, BMC Genomics.

[15]  Jeffrey E. Barrick,et al.  New RNA motifs suggest an expanded scope for riboswitches in bacterial genetic control. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Rolf Backofen,et al.  GraphClust2: Annotation and discovery of structured RNAs with scalable and accessible integrative clustering , 2019, GigaScience.

[17]  S. Strobel,et al.  Identification of a tertiary interaction important for cooperative ligand binding by the glycine riboswitch. , 2011, RNA.

[18]  David J. T. Sumpter,et al.  Individual Rules for Trail Pattern Formation in Argentine Ants (Linepithema humile) , 2012, PLoS Comput. Biol..

[19]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[20]  K. Izaki,et al.  Effets of Glycine and D-Amino Acids on Growth of Various Microorganisms , 1969 .

[21]  Rolf Backofen,et al.  RNAscClust: clustering RNA sequences using structure conservation and graph based motifs , 2017, Bioinform..

[22]  T. Robinson,et al.  Raising the flag on marine alien fouling species , 2017 .

[23]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..

[24]  Zasha Weinberg,et al.  A Glycine-Dependent Riboswitch That Uses Cooperative Binding to Control Gene Expression , 2004, Science.

[25]  R. Breaker,et al.  Variant Bacterial Riboswitches Associated with Nucleotide Hydrolase Genes Sense Nucleoside Diphosphates. , 2018, Biochemistry.

[26]  Y. Ohnishi,et al.  Two Glycine Riboswitches Activate the Glycine Cleavage System Essential for Glycine Detoxification in Streptomyces griseus , 2014, Journal of bacteriology.

[27]  R. Breaker,et al.  Challenges of ligand identification for riboswitch candidates , 2011, RNA biology.

[28]  D. Turner,et al.  Dynalign: an algorithm for finding the secondary structure common to two RNA sequences. , 2002, Journal of molecular biology.

[29]  Fatemeh Almodaresi,et al.  Grouper: graph-based clustering and annotation for improved de novo transcriptome analysis , 2018, Bioinform..

[30]  MICHAEL A. CHARLESTON,et al.  The Effects of Sequence Length, Tree Topology, and Number of Taxa on the Performance of Phylogenetic Methods , 1994, J. Comput. Biol..

[31]  Sumit Mukherjee,et al.  Riboswitch Scanner: an efficient pHMM-based web-server to detect riboswitches in genomic sequences , 2016, Bioinform..

[32]  Rolf Backofen,et al.  GraphClust: alignment-free structural clustering of local RNA secondary structures , 2012, Bioinform..

[33]  R. Breaker,et al.  Riboswitches for the alarmone ppGpp expand the collection of RNA-based signaling systems , 2018, Proceedings of the National Academy of Sciences.

[34]  Jing-Dong Ye,et al.  An energetically beneficial leader-linker interaction abolishes ligand-binding cooperativity in glycine riboswitches. , 2012, RNA.

[35]  Z. Weinberg,et al.  The structure of the SAM/SAH-binding riboswitch , 2018, Nucleic acids research.

[36]  Enrique Merino,et al.  RibEx: a web server for locating riboswitches and other conserved bacterial regulatory elements , 2005, Nucleic Acids Res..

[37]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[38]  Jeffrey E. Barrick,et al.  The distributions, mechanisms, and structures of metabolite-binding riboswitches , 2007, Genome Biology.

[39]  Alex Bateman,et al.  Non‐Coding RNA Analysis Using the Rfam Database , 2018, Current protocols in bioinformatics.

[40]  Paul P. Gardner,et al.  Comparative Analysis of RNA Families Reveals Distinct Repertoires for Each Domain of Life , 2012, PLoS Comput. Biol..

[41]  Yong Xiong,et al.  Structural basis of cooperative ligand binding by the glycine riboswitch. , 2011, Chemistry & biology.

[42]  Peter F. Stadler,et al.  RNAz 2.0: Improved Noncoding RNA Detection , 2010, Pacific Symposium on Biocomputing.

[43]  R. Breaker,et al.  Riboswitches that sense S-adenosylmethionine and S-adenosylhomocysteine. , 2008, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[44]  R. Breaker,et al.  Tandem riboswitches form a natural Boolean logic gate to control purine metabolism in bacteria , 2018, eLife.

[45]  T. Cech,et al.  Self-splicing RNA: Autoexcision and autocyclization of the ribosomal RNA intervening sequence of tetrahymena , 1982, Cell.

[46]  Simon C. Potter,et al.  The EMBL-EBI search and sequence analysis tools APIs in 2019 , 2019, Nucleic Acids Res..

[47]  R. Breaker,et al.  Genetic Control by Metabolite‐Binding Riboswitches , 2003, Chembiochem : a European journal of chemical biology.

[48]  Sean R. Eddy,et al.  Infernal 1.1: 100-fold faster RNA homology searches , 2013, Bioinform..

[49]  Daniel N. Wilson,et al.  The structure and function of the eukaryotic ribosome. , 2012, Cold Spring Harbor perspectives in biology.

[50]  Gaurav Sharma,et al.  Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign , 2007, BMC Bioinformatics.

[51]  Zasha Weinberg,et al.  R2R - software to speed the depiction of aesthetic consensus RNA secondary structures , 2011, BMC Bioinformatics.

[52]  A. Quinlan BEDTools: The Swiss‐Army Tool for Genome Feature Analysis , 2014, Current protocols in bioinformatics.

[53]  M. Gelfand,et al.  Riboswitches: the oldest mechanism for the regulation of gene expression? , 2004, Trends in genetics : TIG.

[54]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[55]  S. Strobel,et al.  Ligand binding by the tandem glycine riboswitch depends on aptamer dimerization but not double ligand occupancy , 2014, RNA.

[56]  P. Stadler,et al.  LocARNA-P: accurate boundary prediction and improved detection of structural RNAs. , 2012, RNA.

[57]  Rhiju Das,et al.  Automated RNA structure prediction uncovers a kink-turn linker in double glycine riboswitches. , 2012, Journal of the American Chemical Society.

[58]  K. Schleifer,et al.  Mode of Action of Glycine on the Biosynthesis of Peptidoglycan , 1973, Journal of bacteriology.

[59]  R. Symons,et al.  Self-cleavage of plus and minus RNA transcripts of avocado sunblotch viroid. , 1986, Nucleic acids research.

[60]  Ronald R. Breaker,et al.  Guanine riboswitch variants from Mesoplasma florum selectively recognize 2′-deoxyguanosine , 2007, Proceedings of the National Academy of Sciences.

[61]  B. Kreikemeyer,et al.  A Glycine Riboswitch in Streptococcus pyogenes Controls Expression of a Sodium:Alanine Symporter Family Protein Gene , 2018, Front. Microbiol..

[62]  Wenting Liu,et al.  Robust Identification of Noncoding RNA from Transcriptomes Requires Phylogenetically-Informed Sampling , 2014, PLoS Comput. Biol..

[63]  R. Breaker Riboswitches and the RNA world. , 2012, Cold Spring Harbor perspectives in biology.

[64]  Ming Zhang,et al.  Comparing sequences without using alignments: application to HIV/SIV subtyping , 2007, BMC Bioinformatics.

[65]  M. Gelfand,et al.  Abundance and functional diversity of riboswitches in microbial communities , 2007, BMC Genomics.

[66]  Rolf Backofen,et al.  Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based Clustering , 2007, PLoS Comput. Biol..

[67]  Scott A Strobel,et al.  Chemical basis of glycine riboswitch cooperativity. , 2007, RNA.

[68]  Peter Clote,et al.  RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment , 2020, PloS one.

[69]  David H. Mathews,et al.  RNAstructure: software for RNA secondary structure prediction and analysis , 2010, BMC Bioinformatics.

[70]  Sam Griffiths-Jones,et al.  RALEE--RNA ALignment Editor in Emacs , 2005, Bioinform..

[71]  K. Miyazaki,et al.  Comparative RNA function analysis reveals high functional similarity between distantly related bacterial 16 S rRNAs , 2017, Scientific Reports.

[72]  Jeffrey S. Thompson,et al.  A new approach for detecting riboswitches in DNA sequences , 2014, Bioinform..

[73]  R. Breaker,et al.  Singlet glycine riboswitches bind ligand as well as tandem riboswitches , 2016, RNA.

[74]  R. Breaker,et al.  Metabolism of Free Guanidine in Bacteria Is Regulated by a Widespread Riboswitch Class. , 2017, Molecular cell.

[75]  Sumit Mukherjee,et al.  RiboD: a comprehensive database for prokaryotic riboswitches , 2019, Bioinform..

[76]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[77]  Hsien-Da Huang,et al.  Computational identification of riboswitches based on RNA conserved functional sequences and conformations. , 2009, RNA.

[78]  Jeffrey E. Barrick,et al.  Coenzyme B12 riboswitches are widespread genetic control elements in prokaryotes. , 2004, Nucleic acids research.

[79]  Alba Cristina Magalhaes Alves de Melo,et al.  Foldalign 2.5: multithreaded implementation for pairwise structural RNA alignment , 2015, Bioinform..

[80]  Verena D. Schmittmann,et al.  Qgraph: Network visualizations of relationships in psychometric data , 2012 .

[81]  Zasha Weinberg,et al.  Bioinformatic analysis of riboswitch structures uncovers variant classes with altered ligand specificity , 2017, Proceedings of the National Academy of Sciences.

[82]  R. Breaker,et al.  Rare variants of the FMN riboswitch class in Clostridium difficile and other bacteria exhibit altered ligand specificity , 2018, RNA.

[83]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .