论文信息 - Stanford's Distantly-Supervised Slot-Filling System

Stanford's Distantly-Supervised Slot-Filling System

This paper describes the design and implementation of the slot filling system prepared by Stanford’s natural language processing group for the 2011 Knowledge Base Population (KBP) track at the Text Analysis Conference (TAC). Our system relies on a simple distant supervision approach using mainly resources furnished by the track’s organizers: we used slot examples from the provided knowledge base, which we mapped to documents from several corpora: those distributed by the organizers, Wikipedia, and web snippets. This system is a descendant of Stanford’s system from last year, with several improvements: an inference process that allows for multi-label predictions and uses worldknowledge to validate outputs; model combination; and a tighter integration of entity coreference and web snippets in the training process. Our submissions scored 16 F1 points using web snippets and 13.5 F1 without web snippets (both scores are higher than the median score of 12.7 F1). We also describe our temporal slot filling system, which achieved 37.0 F1 on the diagnostics temporal task on the developmental queries.

[1] Luke S. Zettlemoyer,et al. Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[2] Angel X. Chang,et al. SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[3] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[4] Ralph Grishman,et al. New York University 2012 System for KBP Slot Filling , 2012, TAC.

[5] Andrew McCallum,et al. Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[6] Valentin I. Spitkovsky,et al. A Simple Distant Supervision Approach for the TAC-KBP Slot Filling Task , 2010, TAC.

[7] Heeyoung Lee,et al. Stanford’s Multi-Pass Sieve Coreference Resolution System at the CoNLL-2011 Shared Task , 2011, CoNLL Shared Task.