OpenKN at TAC KBP 2016

This paper describes the system OpenKN which we established in the TAC KBP 2016. In TAC KBP 2016, we participated in one track: Cold Start KB Track. In order to complete the task, we developed a fivestep system which solves the problem of building knowledge base from a document collection of unstructured text. These five steps to complete this task are documentprocessing, relation extraction, crossdocument co-reference resolution, inference, and post-processing, where the relation extractor is the combination of four methods: rule-based pattern extractor, bootstrapping, OpenIE and an Implicit Relation Information Extractor.

[1]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[2]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[3]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[4]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[5]  Oren Etzioni,et al.  Open Information Extraction to KBP Relations in 3 Hours , 2013, TAC.

[6]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[7]  Jason Weston,et al.  Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction , 2013, EMNLP.

[8]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[9]  Heng Ji,et al.  RPI BLENDER TAC-KBP2016 System Description , 2015, TAC.

[10]  Yifan He,et al.  The NYU Cold Start System for TAC 2015 , 2015, TAC.

[11]  Silviu Cucerzan,et al.  TAC Entity Linking by Performing Full-document Entity Extraction and Disambiguation , 2011, TAC.

[12]  Daniel S. Weld,et al.  University of Washington System for 2015 KBP Cold Start Slot Filling , 2015 .

[13]  Yuanzhuo Wang,et al.  Locally Adaptive Translation for Knowledge Graph Embedding , 2015, AAAI.

[14]  Xiaohui Yan,et al.  A biterm topic model for short texts , 2013, WWW.

[15]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[16]  Yifan He,et al.  ICE: Rapid Information Extraction Customization for NLP Novices , 2015, HLT-NAACL.