论文信息 - Learning the Central Events and Participants in Unlabeled Text

Learning the Central Events and Participants in Unlabeled Text

The majority of information on the Internet is expressed in written text. Understanding and extracting this information is crucial to building intelligent systems that can organize this knowledge. Today, most algorithms focus on learning atomic facts and relations. For instance, we can reliably extract facts like "Annapolis is a City" by observing redundant word patterns across a corpus. However, these facts do not capture richer knowledge like the way detonating a bomb is related to destroying a building, or that the perpetrator who was convicted must have been arrested. A structured model of these events and entities is needed for a deeper understanding of language. This talk describes unsupervised approaches to learning such rich knowledge.

Nathanael Chambers | Daniel Jurafsky

[1] Roger C. Schank,et al. SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[2] Ellen Riloff,et al. Inducing Domain-Specific Semantic Class Taggers from (Almost) Nothing , 2010, ACL.

[3] Oren Etzioni,et al. Open Information Extraction from the Web , 2007, CACM.

[4] Estevam R. Hruschka,et al. Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.

[5] Pedro M. Domingos,et al. Extracting Semantic Networks from Text Via Relational Clustering , 2008, ECML/PKDD.