A High-Throughput Approach for Associating microRNAs with Their Activity Conditions

Plant microRNAs (miRNAs) are short RNA sequences that bind to target mRNAs and change their expression levels by redirecting their stabilities and marking them for cleavage. In Arabidopsis thaliana, microRNAs have been shown to regulate development and are believed to impact expression both under various conditions, such as stress and stimuli, as well as in specific tissue types. We present a high throughput approach for associating between microRNAs and conditions in which they act, using novel statistical and algorithmic techniques. Our new tool, miRNAXpress, at first computes a (binary) matrix T denoting the potential targets of microRNAs. Then, using T and an additional predefined matrix X indicating expression of genes under various conditions, it produces a new matrix that predicts associations between microRNAs and the conditions in which they act. Thus, the program comprises two main modules that work in tandem to compute the desired output. The first is an efficient target prediction engine that predicts mRNA targets of query microRNAs by evaluating the optimal duplex that could be formed between the two: given a short query RNA, a long target RNA, and a predefined energy cut-off threshold, the program finds and reports all putative binding sites of the query RNA in the target RNA with hybridization energy bounded by the predefined threshold. The second module realizes an association operation that is computed by a method which relies on an efficient t-test to compute the associations. The calculation of the matrix of microRNAs and their potential targets is the computationally intensive part of the work done by miRNAXpress, and therefore an efficient algorithm for this portion facilitates the entire process. Thus, the target prediction engine is based on an efficient approximate hybridization search algorithm whose efficiency is the result of utilizing the sparsity of the search space without sacrificing the optimality of the results. The time complexity of this algorithm is almost linear in the size of a sparse set of locations where base-pairs are stacked at a height of three or more. Thus miRNAXpress is a novel tool for associating between microRNAs and the conditions in which they act. We employed it to conduct a study, using the plant Arabidopsis thaliana as our model organism. By applying miRNAXpress to 98 microRNAs and 380 conditions, some biologically interesting and statistically strong relations were discovered. For example, mir159C activity is possibly a factor in the misresponse of nph4 mutants to phototropic stimulations.

[1]  Eugene W. Myers,et al.  Chaining multiple-alignment fragments in sub-quadratic time , 1995, SODA '95.

[2]  Julius Brennecke,et al.  Identification of Drosophila MicroRNA Targets , 2003, PLoS biology.

[3]  Temple F. Smith,et al.  Rapid dynamic programming algorithms for RNA secondary structure , 1986 .

[4]  David Eppstein,et al.  Sparse dynamic programming I: linear cost functions , 1992, JACM.

[5]  D. Bartel,et al.  MicroRNA-Directed Cleavage of HOXB8 mRNA , 2004, Science.

[6]  Alok Aggarwal,et al.  Notes on searching in multidimensional monotone arrays , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[7]  João Meidanis,et al.  Introduction to computational molecular biology , 1997 .

[8]  Baruch Schieber,et al.  On-line dynamic programming with applications to the prediction of RNA secondary structure , 1991, SODA '90.

[9]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[10]  Anton J. Enright,et al.  MicroRNA targets in Drosophila , 2003, Genome Biology.

[11]  D. Spaar,et al.  Plant virus infection development as affected by heavy metal stress , 2004 .

[12]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[13]  B. Reinhart,et al.  A biochemical framework for RNA silencing in plants. , 2003, Genes & development.

[14]  Z. Xie,et al.  Negative Feedback Regulation of Dicer-Like1 in Arabidopsis by microRNA-Guided mRNA Degradation , 2003, Current Biology.

[15]  Raffaele Giancarlo,et al.  Speeding up Dynamic Programming with Applications to Molecular Biology , 1989, Theor. Comput. Sci..

[16]  B. Reinhart,et al.  Prediction of Plant MicroRNA Targets , 2002, Cell.

[17]  Peter van Emde Boas,et al.  Design and implementation of an efficient priority queue , 1976, Mathematical systems theory.

[18]  Lawrence L. Larmore,et al.  The least weight subsequence problem , 1987, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[19]  David Eppstein,et al.  Speeding up dynamic programming , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[20]  Chris Chatfield,et al.  Statistics for Technology (A Course in Applied Statistics) , 1984 .

[21]  Diana V. Dugas,et al.  MicroRNA regulation of gene expression in plants. , 2004, Current opinion in plant biology.

[22]  Nikolaus Rajewsky,et al.  Computational identification of microRNA targets , 2004, Genome Biology.

[23]  C. Llave,et al.  Cleavage of Scarecrow-like mRNA Targets Directed by a Class of Arabidopsis miRNA , 2002, Science.

[24]  Javier F. Palatnik,et al.  Control of leaf morphogenesis by microRNAs , 2003, Nature.

[25]  E. Myers,et al.  Sequence comparison with concave weighting functions. , 1988, Bulletin of mathematical biology.

[26]  Edwards Allen,et al.  P1/HC-Pro, a viral suppressor of RNA silencing, interferes with Arabidopsis development and miRNA unction. , 2003, Developmental cell.

[27]  R. Giegerich,et al.  Fast and effective prediction of microRNA/target duplexes. , 2004, RNA.