论文信息 - Summary Generation and Evaluation in SumUM

Summary Generation and Evaluation in SumUM

We describe and evaluate Sum UM, a text summarization system that produces indicative-informative abstracts for technical papers. Our approach consists of the shallow syntactic and conceptual analysis of the source document and of the implementation of text regeneration techniques based on a study of abstracts produced by professional abstractors. In an evaluation of indicative content in a categorization task, we observed no differences with other automatic method, while differences are observed in an evaluation of informative content. In an evaluation of text quality, the abstracts were considered acceptable when compared with other automatic abstracts.

Horacio Saggion | Guy Lapalme

[1] Donia Scott,et al. A Discourse Model for Gist Preservation , 1996, SBIA.

[2] H. P. Edmundson,et al. New Methods in Automatic Extracting , 1969, JACM.

[3] Chris D. Paice,et al. The identification of important concepts in highly structured technical papers , 1993, SIGIR.

[4] Daniel Marcu,et al. From discourse structures to text summaries , 1997 .

[5] Horacio Saggion,et al. Concept Identification and Presentation in the Context of Technical Text Summarization , 2000 .

[6] Inderjeet Mani,et al. The Tipster Summac Text Summarization Evaluation , 1999, EACL.

[7] Gerard Salton,et al. Automatic Text Structuring and Summarization , 1997, Inf. Process. Manag..

[8] Elizabeth Du,et al. The discourse-level structure of empirical abstracts: an exploratory study , 1991, Inf. Process. Manag..