RIscoper: a tool for RNA-RNA interaction extraction from the literature

Motivation Numerous experimental and computational studies in the biomedical literature have provided considerable amounts of data on diverse RNA-RNA interactions (RRIs). However, few text mining systems for RRIs information extraction are available. Results RIscoper (RNA Interactome Scoper) represents the first tool for full-scale RNA interactome scanning and was developed for extracting RRIs from the literature based on the N-gram model. Notably, a reliable RRI corpus was integrated in RIscoper, and more than 13,300 manually curated sentences with RRI information were recruited. RIscoper allows users to upload full texts or abstracts, and provides an online search tool that is connected with PubMed (PMID and keyword input), and these capabilities are useful for biologists. RIscoper has a strong performance (90.4% precision and 93.9% recall), integrates natural language processing techniques and has a reliable RRI corpus. Availability The standalone software and web server of RIscoper are freely available at www.rna-society.org/riscoper/. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Karin M. Verspoor,et al.  BioLemmatizer: a lemmatization tool for morphological processing of biomedical text , 2012, J. Biomed. Semant..

[2]  Kwong-Sak Leung,et al.  ViRBase: a resource for virus–host ncRNA-associated interactions , 2014, Nucleic Acids Res..

[3]  Zhiyong Lu,et al.  Beyond accuracy: creating interoperable and scalable text-mining web services , 2016, Bioinform..

[4]  Dong Xu,et al.  DTMiner: identification of potential disease targets through biomedical literature mining , 2016, Bioinform..

[5]  Hakim Tafer,et al.  RNAplex: a fast tool for RNA-RNA interaction search , 2008, Bioinform..

[6]  Yan Huang,et al.  RNALocate: a resource for RNA subcellular localizations , 2016, Nucleic Acids Res..

[7]  Xia Li,et al.  ncRDeathDB: A comprehensive bioinformatics resource for deciphering network organization of the ncRNA-mediated cell death system , 2015, Autophagy.

[8]  Jan Gorodkin,et al.  RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model , 2012, Bioinform..

[9]  J. Ule,et al.  Protein–RNA interactions: new genomic technologies and perspectives , 2012, Nature Reviews Genetics.

[10]  Sampo Pyysalo,et al.  Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer , 2017, Bioinform..

[11]  Yue Zhao,et al.  RAID v2.0: an updated resource of RNA-associated interactions across organisms , 2016, Nucleic Acids Res..

[12]  Yue Zhao,et al.  MNDR v2.0: an updated resource of ncRNA–disease associations in mammals , 2017, Nucleic Acids Res..

[13]  Daniel Lai,et al.  A comprehensive comparison of general RNA–RNA interaction prediction methods , 2015, Nucleic acids research.

[14]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[15]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[16]  Hongfei Lin,et al.  Drug drug interaction extraction from biomedical literature using syntax convolutional neural network , 2016, Bioinform..

[17]  Katz Estimation of probabilities from spase data for the language model component of a speech recognitizer , 1987 .

[18]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[19]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[20]  Wei Zheng,et al.  Drug–drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths , 2017, Bioinform..

[21]  Paul P. Gardner,et al.  A comprehensive benchmark of RNA–RNA interaction prediction tools for all domains of life , 2016, Bioinform..

[22]  Niranjan Nagarajan,et al.  @MInter: automated text-mining of microbial interactions , 2016, Bioinform..

[23]  Zhiyong Lu,et al.  BioCreative III interactive task: an overview , 2011, BMC Bioinformatics.