ESTSUM – ESTONIAN NEWSPAPER TEXTS SUMMARIZER
暂无分享,去创建一个
This article describes an experimental software system for automatic summary generation of Estonian newspaper texts called EstSum. EstSum constructs short summaries of text by selecting the key sentences that characterize the document. Sentences are ranked for potential inclusion in the summary using a weighted combination of statistical, linguistic and typographic features like the position, format and type of sentence, and the word frequency. During the testing, a corpus of 10 hand-created summaries of neswpaper articles was used. The summarizer's output was compared to the handmade summaries and the percentage of overlapping sentences was 60% in average.
[1] Dragomir R. Radev,et al. Introduction to the Special Issue on Summarization , 2002, CL.
[2] H. Dalianis,et al. 1 From SweSum to ScandSum-Automatic text summarization for the Scandinavian languages , 2003 .
[3] Martin Hassel. Exploitation of Named Entities in Automatic Text Summarization for Swedish , 2003 .
[4] M. Maybury,et al. Automatic Summarization , 2002, Computational Linguistics.