Learning the Central Events and Participants in Unlabeled Text
暂无分享,去创建一个
The majority of information on the Internet is expressed in written text. Understanding and extracting this information is crucial to building intelligent systems that can organize this knowledge. Today, most algorithms focus on learning atomic facts and relations. For instance, we can reliably extract facts like "Annapolis is a City" by observing redundant word patterns across a corpus. However, these facts do not capture richer knowledge like the way detonating a bomb is related to destroying a building, or that the perpetrator who was convicted must have been arrested. A structured model of these events and entities is needed for a deeper understanding of language. This talk describes unsupervised approaches to learning such rich knowledge.
[1] Roger C. Schank,et al. SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .
[2] Ellen Riloff,et al. Inducing Domain-Specific Semantic Class Taggers from (Almost) Nothing , 2010, ACL.
[3] Oren Etzioni,et al. Open Information Extraction from the Web , 2007, CACM.
[4] Estevam R. Hruschka,et al. Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.
[5] Pedro M. Domingos,et al. Extracting Semantic Networks from Text Via Relational Clustering , 2008, ECML/PKDD.