This paper discusses how to automatically generate slide shows. The reported presentation system inputs documents annotated with the GDA tagset, an XML tagset which allows machines to automatically infer the semantic structure underlying the raw documents. The system picks up important topics in the input document on the basis of the semantic dependencies and coreferences identified from the tags. This topic selection depends also on interactions with the audience, leading to dynamic adaptation of the presentation. A slide is composed for each topic by extracting relevant sentences and paraphrasing them to an itemized summary. Some heuristics are employed here for paraphrasing and layout. Since the GDA tagset is independent of the domain and style of documents and applicable to diverse natural languages, the reported system is also domain/style independent and easy to adapt to different languages.
[1]
Scott Weinstein,et al.
Centering: A Framework for Modeling the Local Coherence of Discourse
,
1995,
CL.
[2]
Oren Etzioni,et al.
Adaptive Web Sites: an AI Challenge
,
1997,
IJCAI.
[3]
Kôiti Hasida,et al.
Automatic Text Summarization Based on the Global Document Annotation
,
1998,
COLING-ACL.
[4]
Beatrice Santorini,et al.
Building a Large Annotated Corpus of English: The Penn Treebank
,
1993,
CL.
[5]
Oren Etzioni,et al.
Adaptive Web Sites: Automatically Synthesizing Web Pages
,
1998,
AAAI/IAAI.