Computational Identification of Three-Way Junctions in Folded RNAs: A Case Study in Arabidopsis

Three-way junctions in folded RNAs have been investigated both experimentally and computationally. The interest in their analysis stems from the fact that they have significantly been found to possess a functional role. In recent work, three-way junctions have been categorized into families depending on the relative lengths of the segments linking the three helices. Here, based on ideas originating from computational geometry, an algorithm is proposed for detecting three-way junctions in data sets of genes that are related to a metabolic pathway of interest. In its current implementation, the algorithm relies on a moving window that performs energy minimization folding predictions, and is demonstrated on a set of genes that are involved in purine metabolism in plants. The pattern matching algorithm can be extended to other organisms and other metabolic cycles of interest in which three-way junctions have been or will be discovered to play an important role. In the test case presented here with, the computational prediction of a three-way junction in Arabidopsis that was speculated to have an interesting functional role is verified experimentally.

[1]  Durbin,et al.  Biological Sequence Analysis , 1998 .

[2]  M. Gelfand,et al.  Riboswitches: the oldest mechanism for the regulation of gene expression? , 2004, Trends in genetics : TIG.

[3]  Sean R. Eddy,et al.  RSEARCH: Finding homologs of single structured RNA sequences , 2003, BMC Bioinformatics.

[4]  Jeffrey E. Barrick,et al.  New RNA motifs suggest an expanded scope for riboswitches in bacterial genetic control. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[5]  G. Mauri,et al.  An algorithm for finding conserved secondary structure motifs in unaligned RNA sequences , 2008, Journal of Computer Science and Technology.

[6]  D. Ecker,et al.  RNAMotif, an RNA secondary structure definition and search algorithm. , 2001, Nucleic acids research.

[7]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[8]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[9]  Jeffrey E. Barrick,et al.  Metabolite-binding RNA domains are present in the genes of eukaryotes. , 2003, RNA.

[10]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[11]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[12]  H. Ashihara,et al.  Purine and pyrimidine nucleotide metabolism in higher plants. , 2003, Journal of plant physiology.

[13]  R. Breaker,et al.  Genetic Control by Metabolite‐Binding Riboswitches , 2003, Chembiochem : a European journal of chemical biology.

[14]  R. Breaker,et al.  Thiamine pyrophosphate riboswitches are targets for the antimicrobial compound pyrithiamine. , 2005, Chemistry & biology.

[15]  W. Pearson Empirical statistical estimates for sequence similarity searches. , 1998, Journal of molecular biology.

[16]  A. Paterson,et al.  Nucleotide Metabolism: An Introduction , 1973 .

[17]  Stephen R Holbrook,et al.  RNA structure: the long and the short of it , 2005, Current Opinion in Structural Biology.

[18]  M. Gelfand,et al.  Comparative Genomics of the Vitamin B12 Metabolism and Regulation in Prokaryotes* , 2003, Journal of Biological Chemistry.

[19]  J. Szostak,et al.  In vitro selection of RNA molecules that bind specific ligands , 1990, Nature.

[20]  Ronald R. Breaker,et al.  Thiamine derivatives bind messenger RNAs directly to regulate bacterial gene expression , 2002, Nature.

[21]  Nobuo Yamashita,et al.  Thiamine‐regulated gene expression of Aspergillus oryzae thiA requires splicing of the intron containing a riboswitch‐like domain in the 5′‐UTR , 2003, FEBS letters.

[22]  R. Montange,et al.  Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine , 2004, Nature.

[23]  A. Norman,et al.  Biosynthesis of purine nucleotides in chick liver and intestine. , 1969, Biochimica et biophysica acta.

[24]  M. Zuker On finding all suboptimal foldings of an RNA molecule. , 1989, Science.

[25]  B. Haas,et al.  Searching Genomes for Noncoding RNA Using FastR , 2005, TCBB.

[26]  Zasha Weinberg,et al.  CMfinder - a covariance model based RNA motif finding algorithm , 2006, Bioinform..

[27]  D. Ebbole,et al.  Cloning and characterization of a 12-gene cluster from Bacillus subtilis encoding nine enzymes for de novo purine nucleotide synthesis. , 1987, The Journal of biological chemistry.

[28]  D. Patel,et al.  Adaptive recognition by nucleic acid aptamers. , 2000, Science.

[29]  Mark Stitt,et al.  Pyrimidine and purine biosynthesis and degradation in plants. , 2006, Annual review of plant biology.

[30]  Klara Kedem,et al.  STR2: A structure to string approach for locating G-box riboswitch shapes in pre-selected genes , 2004, Silico Biol..

[31]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[32]  R. Breaker,et al.  An mRNA structure that controls gene expression by binding FMN , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Evgeny Nudler,et al.  Flipping Riboswitches , 2006, Cell.

[34]  A. Serganov,et al.  Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs. , 2004, Chemistry & biology.

[35]  Roded Sharan,et al.  A sequence-based filtering method for ncRNA identification and its application to searching for riboswitch elements , 2006, ISMB.

[36]  D. Verma,et al.  Structural organization of de novo purine biosynthesis enzymes in plants: 5-aminoimidazole ribonucleotide carboxylase and 5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase cDNAs from Vigna aconitifolia , 2004, Plant Molecular Biology.

[37]  Dan Mercola,et al.  A Glycine-Dependent Riboswitch That Uses Cooperative Binding to Control Gene Expression , 2004 .

[38]  F. Neidhardt,et al.  Escherichia Coli and Salmonella: Typhimurium Cellular and Molecular Biology , 1987 .

[39]  Jeffrey E. Barrick,et al.  Riboswitches Control Fundamental Biochemical Pathways in Bacillus subtilis and Other Bacteria , 2003, Cell.

[40]  Bo-Yen Chou,et al.  尋找在為排比的核醣核酸的共同二級結構特徵區塊之演算法; An algorithm for finding common secondary structure motifs in unaligned RNA sequences , 2005 .

[41]  E. Nudler,et al.  The riboswitch control of bacterial metabolism. , 2004, Trends in biochemical sciences.

[42]  J. M. Buchanan CHAPTER 35 – Biosynthesis of Purine Nucleotides* , 1960 .

[43]  Bruce A. Shapiro,et al.  An algorithm for comparing multiple RNA secondary structures , 1988, Comput. Appl. Biosci..

[44]  E. Westhof,et al.  Topology of three-way junctions in folded RNAs. , 2006, RNA.

[45]  Margaret S. Ebert,et al.  An mRNA structure in bacteria that controls gene expression by binding lysine. , 2003, Genes & development.

[46]  Evgeny Nudler,et al.  Sensing Small Molecules by Nascent RNA A Mechanism to Control Transcription in Bacteria , 2002, Cell.

[47]  R. Breaker,et al.  Gene regulation by riboswitches , 2004, Nature Reviews Molecular Cell Biology.

[48]  S. Henikoff Multifunctional polypeptides for purine de novo synthesis , 1987, BioEssays : news and reviews in molecular, cellular and developmental biology.

[49]  P. Szankasi,et al.  DNA sequence analysis of the ade6 gene of Schizosaccharomyces pombe. Wild-type and mutant alleles including the recombination host spot allele ade6-M26. , 1988, Journal of molecular biology.