MyMiner: a web application for computer-assisted biocuration and text annotation

MOTIVATION The exponential growth of scientific literature has resulted in a massive amount of unstructured natural language data that cannot be directly handled by means of bioinformatics tools. Such tools generally require structured data, often generated through a cumbersome process of manual literature curation. Herein, we present MyMiner, a free and user-friendly text annotation tool aimed to assist in carrying out the main biocuration tasks and to provide labelled data for the development of text mining systems. MyMiner allows easy classification and labelling of textual data according to user-specified classes as well as predefined biological entities. The usefulness and efficiency of this application have been tested for a range of real-life annotation scenarios of various research topics. AVAILABILITY http://myminer.armi.monash.edu.au.

[1]  Burr Settles ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text , 2005 .

[2]  Philip V. Ogren,et al.  Knowtator: A Protégé plug-in for annotated corpus construction , 2006, NAACL.

[3]  Winston A Hide,et al.  Big data: The future of biocuration , 2008, Nature.

[4]  A. Valencia,et al.  Creating Reference Datasets for Systems Biology Applications Using Text Mining , 2009, Annals of the New York Academy of Sciences.

[5]  Martin H. Schaefer,et al.  MedlineRanker: flexible ranking of biomedical literature , 2009, Nucleic Acids Res..

[6]  Mark A. Musen,et al.  The Open Biomedical Annotator , 2009, Summit on translational bioinformatics.

[7]  Goran Nenadic,et al.  LINNAEUS: A species name identification system for biomedical literature , 2010, BMC Bioinformatics.

[8]  Martin Krallinger Importance of negations and experimental qualifiers in biomedical literature , 2010, NeSp-NLP@ACL.

[9]  Roser Morante,et al.  Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, NeSp-NLP@ACL 2010, Uppsala, Sweden, July 10, 2010 , 2010, NeSp-NLP@ACL.

[10]  Zhiyong Lu,et al.  Benchmarking of the 2010 BioCreative Challenge III text-mining competition by the BioGRID and MINT interaction databases , 2011 .

[11]  Zhiyong Lu,et al.  BioCreative III interactive task: an overview , 2011, BMC Bioinformatics.

[12]  Tim Clark,et al.  Open semantic annotation of scientific publications using DOMEO , 2012, J. Biomed. Semant..

[13]  Paolo Ciccarese,et al.  DOMEO: a web-based tool for semantic annotation of online documents , 2012 .

[14]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[15]  W. John Wilbur,et al.  PIE the search: searching PubMed literature for protein interaction information , 2012, Bioinform..