A Multi-Platform Annotation Ecosystem for Domain Adaptation

This paper describes an ecosystem consisting of three independent text annotation platforms. To demonstrate their ability to work in concert, we illustrate how to use them to address an interactive domain adaptation task in biomedical entity recognition. The platforms and the approach are in general domain-independent and can be readily applied to other areas of science.

[1]  Yang Jin,et al.  Automated recognition of malignancy mentions in biomedical literature , 2006, BMC Bioinformatics.

[2]  Iryna Gurevych,et al.  The INCEpTION Platform: Machine-Assisted and Knowledge-Oriented Interactive Annotation , 2018, COLING.

[3]  Mark Greenwood,et al.  OpenMinTeD: A Platform Facilitating Text Mining of Scholarly Content , 2018 .

[4]  James Pustejovsky,et al.  The Language Application Grid Web Service Exchange Vocabulary , 2014, WLSI.

[5]  Sophia Ananiadou,et al.  Developing a Robust Part-of-Speech Tagger for Biomedical Text , 2005, Panhellenic Conference on Informatics.

[6]  Nancy Ide,et al.  Towards cross-platform interoperability for machine-assisted text annotation , 2019, Genomics & informatics.

[7]  Yue Wang,et al.  PubAnnotation - a persistent and sharable corpus and annotation repository , 2012, BioNLP@HLT-NAACL.

[8]  Sophia Ananiadou,et al.  Development and Analysis of NLP Pipelines in Argo , 2013, ACL.

[9]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[10]  Ide Nancy,et al.  Language Applications Grid , 2017 .

[11]  Kalina Bontcheva,et al.  Getting More Out of Biomedical Documents with GATE's Full Lifecycle Open Source Text Analytics , 2013, PLoS Comput. Biol..

[12]  Nancy Wilkins-Diehr,et al.  XSEDE: Accelerating Scientific Discovery , 2014, Computing in Science & Engineering.

[13]  Fabio Rinaldi,et al.  OGER++: hybrid multi-type entity recognition , 2019, Journal of Cheminformatics.

[14]  Erhard W. Hinrichs,et al.  WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure , 2010, LREC.

[15]  Thilo Götz,et al.  Design and implementation of the UIMA Common Analysis System , 2004, IBM Syst. J..

[16]  Ian T. Foster,et al.  Jetstream: A Distributed Cloud Infrastructure for Underresourced higher education communities , 2015, SCREAM@HPDC.

[17]  Mahmoud El-Haj,et al.  Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger , 2018, LREC.

[18]  James Pustejovsky,et al.  The LAPPS Interchange Format , 2015, WLSI.

[19]  Iryna Gurevych,et al.  A broad-coverage collection of portable NLP components for building shareable analysis pipelines , 2014, OIAF4HLT@COLING.