Statistical inference for well-ordered structures in nucleotide sequences

Distinct, local structures are frequently correlated with functional RNA elements involved in post-transcriptional regulation of gene expression. Discovery of microRNAs (miRNAs) suggests that there are a large class of small noncoding RNAs in eukaryotic genomes. These miRNAs have the potential to form distinct fold-back stem-loop structures. The prediction of those well-ordered folding sequences (WFS) in genomic sequences is very helpful for our understanding of RNA-based gene regulation and the determination of local RNA elements with structure-dependent functions. In this study, we describe a novel method for discovering the local WFS in a nucleotide sequence by Monte Carlo simulation and RNA folding. In the approach the quality of a local WFS is assessed by the energy difference (E/sub diff/) between the optimal structure folded in the local segment and its corresponding optimal, restrained structure where all the previous base pairings formed in the optimal structure are prohibited. Distinct WFS can be discovered by scanning successive segments along a sequence for evaluating the difference between Ediff of the natural sequence and those computed from randomly shuffled sequences. Our results indicate that the statistically significant WFS detected in the genomic sequences of Caenorhabditis elegans (C.elegans) F49E12, T07C5, T07D1, T10H9, Y56A3A and Y71G12B are coincident with known fold-back stem-loops found in miRNA precursors. The potential and implications of our method in searching for miRNAs in genomes is discussed.

[1]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[2]  J. Mattick Non‐coding RNAs: the architects of eukaryotic complexity , 2001, EMBO reports.

[3]  Shirley Dex,et al.  JR 旅客販売総合システム(マルス)における運用及び管理について , 1991 .

[4]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[5]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[6]  I. Brierley,et al.  Characterization of an efficient coronavirus ribosomal frameshifting signal: Requirement for an RNA pseudoknot , 1989, Cell.

[7]  C. Smibert,et al.  Translational regulation of maternal mRNAs. , 1996, Current opinion in genetics & development.

[8]  S. Le,et al.  The HIV-1 rev trans-activator acts through a structured target sequence to activate nuclear export of unspliced viral mRNA , 1989, Nature.

[9]  Jih-Hsiang Chen,et al.  Discovering well-ordered folding patterns in nucleotide sequences , 2003, Bioinform..

[10]  P. Hraber,et al.  Estimating the Contributions of Selection and Self-Organization in RNA Secondary Structure , 1999, Journal of Molecular Evolution.

[11]  R D Klausner,et al.  A model for the structure and functions of iron-responsive elements. , 1988, Gene.

[12]  R. Simons,et al.  RNA structure and function , 1998 .

[13]  H. Lipshitz,et al.  RNA localization in development. , 1998, Annual review of biochemistry.

[14]  S. Eddy Non–coding RNA genes and the modern RNA world , 2001, Nature Reviews Genetics.

[15]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[16]  D. Draper,et al.  Strategies for RNA folding. , 1996, Trends in biochemical sciences.

[17]  J. Dubnau,et al.  RNA recognition and translational regulation by a homeodomain protein , 1996, Nature.

[18]  G. Ruvkun,et al.  A uniform system for microRNA annotation. , 2003, RNA.

[19]  L. Lim,et al.  An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans , 2001, Science.

[20]  P. Sarnow,et al.  Internal ribosome entry sites in eukaryotic mRNA molecules. , 2001, Genes & development.

[21]  Kaizhong Zhang,et al.  RNA molecules with structure dependent functions are uniquely folded. , 2002, Nucleic acids research.

[22]  Guy Riddihough The Other RNA World , 2002, Science.