A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction

BackgroundDeep-sequencing has enabled the identification of large numbers of miRNAs and siRNAs, making the high-throughput target identification a main limiting factor in defining their function. In plants, several tools have been developed to predict targets, majority of them being trained on Arabidopsis datasets. An extensive and systematic evaluation has not been made for their suitability for predicting targets in species other than Arabidopsis. Nor, these have not been evaluated for their suitability for high-throughput target prediction at genome level.ResultsWe evaluated the performance of 11 computational tools in identifying genome-wide targets in Arabidopsis and other plants with procedures that optimized score-cutoffs for estimating targets. Targetfinder was most efficient [89% ‘precision’ (accuracy of prediction), 97% ‘recall’ (sensitivity)] in predicting ‘true-positive’ targets in Arabidopsis miRNA-mRNA interactions. In contrast, only 46% of true positive interactions from non-Arabidopsis species were detected, indicating low ‘recall’ values. Score optimizations increased the ‘recall’ to only 70% (corresponding ‘precision’: 65%) for datasets of true miRNA-mRNA interactions in species other than Arabidopsis. Combining the results of Targetfinder and psRNATarget delivers high true positive coverage, whereas the intersection of psRNATarget and Tapirhybrid outputs deliver highly ‘precise’ predictions. The large number of ‘false negative’ predictions delivered from non-Arabidopsis datasets by all the available tools indicate the diversity in miRNAs-mRNA interaction features between Arabidopsis and other species. A subset of miRNA-mRNA interactions differed significantly for features in seed regions as well as the total number of matches/mismatches.ConclusionAlthough, many plant miRNA target prediction tools may be optimized to predict targets with high specificity in Arabidopsis, such optimized thresholds may not be suitable for many targets in non-Arabidopsis species. More importantly, non-conventional features of miRNA-mRNA interaction may exist in plants indicating alternate mode of miRNA target recognition. Incorporation of these divergent features would enable next-generation of algorithms to better identify target interactions.

[1]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[2]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[3]  Terry Gaasterland,et al.  Prediction and identification of Arabidopsis thaliana microRNAs and their mRNA targets , 2004, Genome Biology.

[4]  Tobias Dezulian,et al.  Sequence and expression differences underlie functional specialization of Arabidopsis microRNAs miR159 and miR319. , 2007, Developmental cell.

[5]  Z. Yang,et al.  Computational identification of novel microRNAs and targets in Brassica napus , 2007, FEBS letters.

[6]  Daniel W. A. Buchan,et al.  A large-scale evaluation of computational protein function prediction , 2013, Nature Methods.

[7]  Vincent Moulton,et al.  Identification of grapevine microRNAs and their targets using high-throughput sequencing and degradome analysis. , 2010, The Plant journal : for cell and molecular biology.

[8]  Miguel A Andrade-Navarro,et al.  Identification of novel stem cell markers using gap analysis of gene expression data , 2007, Genome Biology.

[9]  N. Rajewsky microRNA target predictions in animals , 2006, Nature Genetics.

[10]  D. Bartel,et al.  Weak Seed-Pairing Stability and High Target-Site Abundance Decrease the Proficiency of lsy-6 and Other miRNAs , 2011, Nature Structural &Molecular Biology.

[11]  C. Llave,et al.  Cleavage of Scarecrow-like mRNA Targets Directed by a Class of Arabidopsis miRNA , 2002, Science.

[12]  Jason S. Cumbie,et al.  High-Throughput Sequencing of Arabidopsis microRNAs: Evidence for Frequent Birth and Death of MIRNA Genes , 2007, PloS one.

[13]  Baohong Zhang,et al.  Bioinformatics Applications Note Data and Text Mining Target-align: a Tool for Plant Microrna Target Identification , 2022 .

[14]  Heinz Saedler,et al.  The miRNA156/157 recognition element in the 3' UTR of the Arabidopsis SBP box gene SPL3 prevents early flowering by translational inhibition in seedlings. , 2007, The Plant journal : for cell and molecular biology.

[15]  A. Millar,et al.  Genetic and Molecular Approaches to Assess MicroRNA Function , 2012 .

[16]  E. Sontheimer,et al.  Origins and Mechanisms of miRNAs and siRNAs , 2009, Cell.

[17]  Tyler W. H. Backman,et al.  Update of ASRP: the Arabidopsis Small RNA Project database , 2007, Nucleic Acids Res..

[18]  Yves Van de Peer,et al.  TAPIR, a web server for the prediction of plant microRNA targets, including target mimics , 2010, Bioinform..

[19]  Ashwani Jha,et al.  Employing machine learning for reliable miRNA target identification in plants , 2011, BMC Genomics.

[20]  T. Unver,et al.  Genome-Wide Identification of miRNAs Responsive to Drought in Peach (Prunus persica) by High-Throughput Deep Sequencing , 2012, PloS one.

[21]  Shuigeng Zhou,et al.  imiRTP: An Integrated Method to Identifying miRNA-target Interactions in Arabidopsis thaliana , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine.

[22]  Q Zou,et al.  Benchmark comparison of ab initio microRNA identification methods and software. , 2012, Genetics and molecular research : GMR.

[23]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[24]  Xuemei Chen,et al.  MicroRNAs Inhibit the Translation of Target mRNAs on the Endoplasmic Reticulum in Arabidopsis , 2013, Cell.

[25]  Baohong Zhang,et al.  Identification of soybean microRNAs and their targets , 2008, Planta.

[26]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[27]  Xiaofeng Cao,et al.  Degradome sequencing reveals endogenous small RNA targets in rice (Oryza sativa L. ssp. indica) , 2010, Frontiers in Biology.

[28]  Detlef Weigel,et al.  Gene silencing in plants using artificial microRNAs and other small RNAs. , 2008, The Plant journal : for cell and molecular biology.

[29]  D. Bartel,et al.  Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. , 2004, Molecular cell.

[30]  Meng Wang,et al.  PsRobot: a web-based plant small RNA meta-analysis toolbox , 2012, Nucleic Acids Res..

[31]  Sean R Eddy,et al.  How do RNA folding algorithms work? , 2004, Nature Biotechnology.

[32]  R. Sunkar MicroRNAs in Plant Development and Stress Responses , 2012, Signaling and Communication in Plants.

[33]  Jan Krüger,et al.  RNAhybrid: microRNA target prediction easy, fast and flexible , 2006, Nucleic Acids Res..

[34]  Nicolas Bouché,et al.  microRNA-directed cleavage and translational repression of the copper chaperone for superoxide dismutase mRNA in Arabidopsis. , 2010, The Plant journal : for cell and molecular biology.

[35]  Robert Powers,et al.  Protein NMR recall, precision, and F-measure scores (RPF scores): structure quality assessment measures based on information retrieval statistics. , 2005, Journal of the American Chemical Society.

[36]  A. Komamine,et al.  RNAi and Plant Gene Function Analysis , 2011, Methods in Molecular Biology.

[37]  M. Zanetti,et al.  Insights into post-transcriptional regulation during legume-rhizobia symbiosis , 2013, Plant signaling & behavior.

[38]  I. Baldwin,et al.  Herbivory-induced changes in the small-RNA transcriptome and phytohormone signaling in Nicotiana attenuata , 2008, Proceedings of the National Academy of Sciences.

[39]  P. Rouzé,et al.  Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Narendra Tuteja,et al.  microRNAs targeting DEAD-box helicases are involved in salinity stress response in rice (Oryza sativa L.) , 2012, BMC Plant Biology.

[41]  Adam M. Gustafson,et al.  microRNA-Directed Phasing during Trans-Acting siRNA Biogenesis in Plants , 2005, Cell.

[42]  Barbara Baker,et al.  SoMART: a web server for plant miRNA, tasiRNA and target gene analysis. , 2012, The Plant journal : for cell and molecular biology.

[43]  Shivakundan Singh Tej,et al.  Elucidation of the Small RNA Component of the Transcriptome , 2005, Science.

[44]  I. Baldwin,et al.  RNA-directed RNA polymerase 1 (RdR1) mediates the resistance of Nicotiana attenuata to herbivore attack in nature. , 2007, The Plant journal : for cell and molecular biology.

[45]  M. Shamimuzzaman,et al.  Identification of soybean seed developmental stage-specific and tissue-specific miRNA targets by degradome sequencing , 2012, BMC Genomics.

[46]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[47]  David M. Goodstein,et al.  Phytozome: a comparative platform for green plant genomics , 2011, Nucleic Acids Res..

[48]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature reviews genetics.

[49]  Wen-Wu Guo,et al.  Identification and Comparative Profiling of miRNAs in an Early Flowering Mutant of Trifoliate Orange and Its Wild Type by Genome-Wide Deep Sequencing , 2012, PloS one.

[50]  L. Sieburth,et al.  Widespread Translational Inhibition by Plant miRNAs and siRNAs , 2008, Science.

[51]  Sunghwan Sohn,et al.  Abbreviation definition identification based on automatic precision estimates , 2008, BMC Bioinformatics.

[52]  Rui Shi,et al.  Computational prediction of plant miRNA targets. , 2011, Methods in molecular biology.

[53]  Z. Chen,et al.  Roles of target site location and sequence complementarity in trans-acting siRNA formation in Arabidopsis. , 2012, The Plant journal : for cell and molecular biology.

[54]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[55]  D. Nettleton,et al.  The Arabidopsis MicroRNA396-GRF1/GRF3 Regulatory Module Acts as a Developmental Regulator in the Reprogramming of Root Cells during Cyst Nematode Infection1[W][OA] , 2012, Plant Physiology.

[56]  I. Baldwin,et al.  RNA-Directed RNA Polymerase3 from Nicotiana attenuata Is Required for Competitive Growth in Natural Environments1[W][OA] , 2008, Plant Physiology.

[57]  Yuanji Zhang,et al.  miRU: an automated plant miRNA target prediction server , 2005, Nucleic Acids Res..

[58]  Stefan L Ameres,et al.  Diversifying microRNA sequence and function , 2013, Nature Reviews Molecular Cell Biology.

[59]  Patrick Xuechun Zhao,et al.  psRNATarget: a plant small RNA target analysis server , 2011, Nucleic Acids Res..

[60]  A. Adai,et al.  Computational prediction of miRNAs in Arabidopsis thaliana. , 2005, Genome research.

[61]  I. Somssich,et al.  The Role of WRKY Transcription Factors in Plant Immunity[W] , 2009, Plant Physiology.

[62]  V. Baev,et al.  miRTour: Plant miRNA and target prediction tool , 2011, Bioinformation.

[63]  R. Overbeek,et al.  Searching for patterns in genomic data. , 1997, Trends in genetics : TIG.

[64]  Diana V. Dugas,et al.  Sucrose induction of Arabidopsis miR398 represses two Cu/Zn superoxide dismutases , 2008, Plant Molecular Biology.

[65]  D. Lipman,et al.  Rapid and sensitive protein similarity searches. , 1985, Science.

[66]  O. Voinnet Origin, Biogenesis, and Activity of Plant MicroRNAs , 2009, Cell.

[67]  Shuigeng Zhou,et al.  Finding MicroRNA Targets in Plants: Current Status and Perspectives , 2012, Genom. Proteom. Bioinform..

[68]  Yun Zheng,et al.  Transcriptome-wide identification of microRNA targets in rice. , 2010, The Plant journal : for cell and molecular biology.

[69]  Patrick Xuechun Zhao,et al.  Computational analysis of miRNA targets in plants: current status and challenges , 2011, Briefings Bioinform..

[70]  Xuemei Chen,et al.  Biogenesis, Turnover, and Mode of Action of Plant MicroRNAs[OPEN] , 2013, Plant Cell.

[71]  R. Giegerich,et al.  Fast and effective prediction of microRNA/target duplexes. , 2004, RNA.

[72]  Anna Tramontano,et al.  Evaluation of residue–residue contact prediction in CASP10 , 2014, Proteins.

[73]  D. Bartel,et al.  MicroRNAS and their regulatory roles in plants. , 2006, Annual review of plant biology.

[74]  Scott A. Givan,et al.  ASRP: the Arabidopsis Small RNA Project Database , 2004, Nucleic Acids Res..

[75]  S. Luo,et al.  Global identification of microRNA–target RNA pairs by parallel analysis of RNA ends , 2008, Nature Biotechnology.

[76]  A. Tramontano,et al.  Evaluation of residue–residue contact predictions in CASP9 , 2011, Proteins.