Knowledge Base Population: Successful Approaches and Challenges

In this paper we give an overview of the Knowledge Base Population (KBP) track at the 2010 Text Analysis Conference. The main goal of KBP is to promote research in discovering facts about entities and augmenting a knowledge base (KB) with these facts. This is done through two tasks, Entity Linking -- linking names in context to entities in the KB -- and Slot Filling -- adding information about an entity to the KB. A large source collection of newswire and web documents is provided from which systems are to discover information. Attributes ("slots") derived from Wikipedia infoboxes are used to create the reference KB. In this paper we provide an overview of the techniques which can serve as a basis for a good KBP system, lay out the remaining challenges by comparison with traditional Information Extraction (IE) and Question Answering (QA) tasks, and provide some suggestions to address these challenges.

[1]  Yan Li,et al.  PRIS at TAC2010 KBP Track , 2010, TAC.

[2]  Jian Su,et al.  NUS-I2R: Learning a Combined System for Entity Linking , 2010, TAC.

[3]  Robert J. Gaizauskas,et al.  The University of Sheffield System at TAC KBP 2010 , 2010, TAC.

[4]  Vasudeva Varma,et al.  IIIT Hyderabad in Guided Summarization and Knowledge Base Population , 2010, TAC.

[5]  Ying Shi,et al.  LCC Approaches to Knowledge Base Population at TAC 2010 , 2010, TAC.

[6]  Eneko Agirre,et al.  UBC at Slot Filling TAC-KBP 2010 , 2010, TAC.

[7]  Imed Zitouni,et al.  Improving Mention Detection Robustness to Noisy Input , 2010, EMNLP.

[8]  James Mayfield,et al.  Learning Named Entity Hyponyms for Question Answering , 2008, IJCNLP.

[9]  Robert P. Cook,et al.  Freebase: A Shared Database of Structured General Human Knowledge , 2007, AAAI.

[10]  Dietrich Klakow,et al.  Saarland University Spoken Language Systems at the Slot Filling Task of TAC KBP 2010 , 2010, TAC.

[11]  Xiang Li,et al.  CUNY-BLENDER TAC-KBP2010 Entity Linking and Slot Filling System Description , 2010, TAC.

[12]  Cécile Paris,et al.  Pseudo Relevance Feedback Using Named Entities for Question Answering , 2006, ALTA.

[13]  Alex Baron,et al.  Who is Who and What is What: Experiments in Cross-Document Co-Reference , 2008, EMNLP.

[14]  Paul McNamee HLTCOE Efforts in Entity Linking at TAC KBP 2010 , 2010, TAC.

[15]  Heng Ji,et al.  Overview of the TAC 2010 Knowledge Base Population Track , 2010 .

[16]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[17]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[18]  Ian H. Witten,et al.  Mining Meaning from Wikipedia , 2008, Int. J. Hum. Comput. Stud..

[19]  M. de Rijke,et al.  Personal Name Resolution of Web People Search , 2008 .

[20]  Mark Dredze,et al.  Entity Disambiguation for Knowledge Base Population , 2010, COLING.

[21]  Dávid Márk Nemeskey,et al.  BUDAPESTACAD at TAC 2010 , 2010, TAC.

[22]  M. K. Kowar,et al.  Bhilai Institute of Technology Durg at TAC 2010: Knowledge Base Population Task Challenge , 2010, TAC.

[23]  Valentin I. Spitkovsky,et al.  A Simple Distant Supervision Approach for the TAC-KBP Slot Filling Task , 2010, TAC.

[24]  Vittorio Castelli,et al.  Slot Filling through Statistical Processing and Inference Rules , 2009, TAC.

[25]  Paloma Martínez,et al.  Combining similarities with regression based classifiers for Entity Linking at TAC 2010 , 2010, TAC.

[26]  Joel Nothman,et al.  Document-level Entity Linking: CMCRC at TAC 2010 , 2010, TAC.

[27]  John Dunnion,et al.  UCD IIRG at TAC 2012 , 2012, TAC.

[28]  Jennifer Chu-Carroll,et al.  Improving QA Accuracy by Question Inversion , 2006, ACL.

[29]  Paul McNamee,et al.  An Evaluation of Technologies for Knowledge Base Population , 2010, LREC.

[30]  Julio Gonzalo,et al.  The SemEval-2007 WePS Evaluation: Establishing a benchmark for the Web People Search Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[31]  Valentin I. Spitkovsky,et al.  Stanford-UBC Entity Linking at TAC-KBP , 2010, TAC.

[32]  Yang Song,et al.  ICL KBP Approaches to Knowledge Base Population at TAC2010 , 2010, TAC.

[33]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[34]  Ralph Grishman,et al.  New York University KBP 2010 Slot-Filling System , 2010, TAC.

[35]  D. McNamara Reading both high-coherence and low-coherence texts: effects of text sequence and prior knowledge. , 2001, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[36]  Jing Jiang,et al.  SMU-SIS at TAC 2010 - KBP Track Entity Linking , 2010, TAC.

[37]  Norberto Fernández García,et al.  WebTLab: A cooccurrence-based approach to KBP 2010 Entity-Linking task , 2010, TAC.