This paper describes a system that learns discourse rules for domaln-speclfic analysis of unrestricted text. The goal of discourse analysis in this context is to transform locally identified references to relevant information in the text into a coherent representation of the entire text. This involves a complex series of decidons about merging coreferential objects, filtering out irrelevant information, inferring missing information, and identifying logical relations between domain objects. The Wrap-Up discourse analyzer induces a set of classifiers from a tra]n|ng corpus to handle these discourse decisions. Wrap-Up is fully tr~nable, and not only determ|nes what classifiers are needed based on domain output specifications, but automatically selects the features needed by each classifier. Wrap-Up’s classifiers blend linguistic knowledge with real world domain knowledge.
[1]
Wendy G. Lehnert,et al.
Wrap-Up: a Trainable Discourse Module for Information Extraction
,
1994,
J. Artif. Intell. Res..
[2]
Elizabeth D. Liddy,et al.
Development, Implementation and Testing of a Discourse Model for Newspaper Texts
,
1993,
HLT.
[3]
L SidnerCandace,et al.
Attention, intentions, and the structure of discourse
,
1986
.
[4]
Ellen Riloff,et al.
Automatically Constructing a Dictionary for Information Extraction Tasks
,
1993,
AAAI.
[5]
Jerry R. Hobbs.
Resolving pronoun references
,
1986
.
[6]
Wendy G. Lehnert,et al.
Corpus-Driven Knowledge Acquisition for Discourse Analysis
,
1994,
AAAI.
[7]
Claire Cardie,et al.
UMass/Hughes: Description of the CIRCUS System Used for MUC-51
,
1993,
MUC.