SANTO: A Web-based Annotation Tool for Ontology-driven Slot Filling

Supervised machine learning algorithms require training data whose generation for complex relation extraction tasks tends to be difficult. Being optimized for relation extraction at sentence level, many annotation tools lack in facilitating the annotation of relational structures that are widely spread across the text. This leads to non-intuitive and cumbersome visualizations, making the annotation process unnecessarily time-consuming. We propose SANTO, an easy-to-use, domain-adaptive annotation tool specialized for complex slot filling tasks which may involve problems of cardinality and referential grounding. The web-based architecture enables fast and clearly structured annotation for multiple users in parallel. Relational structures are formulated as templates following the conceptualization of an underlying ontology. Further, import and export procedures of standard formats enable interoperability with external sources and tools.

[1]  Bridget T. McInnes,et al.  Literature Based Discovery: Models, methods, and trends , 2017, J. Biomed. Informatics.

[2]  João Rocha,et al.  Semantic annotation tools survey , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[3]  Lora Aroyo,et al.  Knowledge-Based Linguistic Annotation of Digital Cultural Heritage Collections , 2009, IEEE Intelligent Systems.

[4]  Katrin Erk,et al.  SALTO - A Versatile Multi-Level Annotation Tool , 2006, LREC.

[5]  Iryna Gurevych,et al.  WebAnno: A Flexible, Web-based and Visually Supported System for Distributed Annotations , 2013, ACL.

[6]  Philipp Cimiano,et al.  Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-Based Information Extraction , 2018, NLDB.

[7]  Cheng Zhang,et al.  Biomedical text mining and its applications in cancer research , 2013, J. Biomed. Informatics.

[8]  Val Goranko,et al.  12th EUROPEAN SUMMER SCHOOL IN LOGIC, LANGUAGE AND INFORMATION , 2000 .

[9]  Christian Bizer,et al.  Extracting attribute-value pairs from product specifications on the web , 2017, WI.

[10]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[11]  Thomas S. Morton,et al.  WordFreak: An Open Tool for Linguistic Annotation , 2003, HLT-NAACL.

[12]  Laurel D. Riek,et al.  Callisto: A Configurable Annotation Workbench , 2004, LREC.

[13]  Lidong Bing,et al.  Unsupervised Extraction of Popular Product Attributes from E-Commerce Web Sites by Considering Customer Reviews , 2016, TOIT.

[14]  Shuying Shen,et al.  A Prototype Tool Set to Support Machine-Assisted Annotation , 2012, BioNLP@HLT-NAACL.

[15]  Heike Adel,et al.  Comparing Convolutional Neural Networks to Traditional Models for Slot Filling , 2016, NAACL.

[16]  Philipp Cimiano,et al.  SCIO: An Ontology to Support the Formalization of Pre-Clinical Spinal Cord Injury Experiments , 2017, JOWO.

[17]  Tuomo Kakkonen DepAnn - An Annotation Tool for Dependency Treebanks , 2006, ArXiv.

[18]  Christoph Müller,et al.  Multi-level annotation of linguistic data with MMAX 2 , 2006 .

[19]  Valentina Bartalesi Lenzi,et al.  CAT: the CELCT Annotation Tool , 2012, LREC.

[20]  Philip V. Ogren,et al.  Knowtator: A Protégé plug-in for annotated corpus construction , 2006, NAACL.

[21]  Dayne Freitag,et al.  Machine Learning for Information Extraction in Informal Domains , 2000, Machine Learning.

[22]  Kalina BontchevaHamish,et al.  Universities of Leeds, Sheffield and York , 2022 .

[23]  Danqi Chen,et al.  Position-aware Attention and Supervised Data Improve Slot Filling , 2017, EMNLP.

[24]  Lora Aroyo,et al.  Hacking History: Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections , 2011, DeRiVE@ISWC.