KOKO is a declarative information extraction system that incorporates advances in natural language processing techniques in its extraction language. KOKO’s extraction language supports simultaneous specification of conditions over the surface syntax and on the structure of the dependency parse tree of sentences, thereby allowing for more refined extractions. Furthermore, the KOKO extraction language allows for aggregating evidence from an input document and supports conditions that are tolerant of linguistic variation of expressing concepts. In this demo, we outline the design of KOKO, a system for extracting information and understanding the results of the extraction. KOKO provides an interactive interface that allows participants to write queries, understand the input and results of the queries. In particular, the user can customize the input text, visualize the input text’s dependency parse trees, and understand the correspondences between query components, dependency tree nodes, text tokens, and the computation and associated scores that led to an extraction. PVLDB Reference Format: Xiaolan Wang, Jiyu Komiya, Yoshihiko Suhara, Aaron Feng, Behzad Golshan, Alon Halevy, Wang-Chiew Tan. Koko: A System for Scalable Semantic Querying of Text. PVLDB, 11 (12): 2018-2021, 2018. DOI: https://doi.org/10.14778/3229863.3236249
[1]
Steven J. DeRose,et al.
XML Path Language (XPath)
,
1999
.
[2]
Frederick Reiss,et al.
Rule-Based Information Extraction is Dead! Long Live Rule-Based Information Extraction Systems!
,
2013,
EMNLP.
[3]
Alon Y. Halevy,et al.
HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments
,
2018,
LREC.
[4]
Slav Petrov,et al.
A Universal Part-of-Speech Tagset
,
2011,
LREC.
[5]
Joakim Nivre,et al.
Universal Dependency Annotation for Multilingual Parsing
,
2013,
ACL.
[6]
Hidekazu Oiwa,et al.
Scalable Semantic Querying of Text
,
2018,
Proc. VLDB Endow..