Generation of Summaries that Appropriately and Adequately Express the Contents of Original Documents Using Word-Association Knowledge

In this study, our purpose was to make a short summary for sentences. For example, we aimed to make a short summary “terror” for sentences “A bomb went off. Some people were killed. This was triggered by rebel campaign.” In this study, we proposed a new method that generates summaries that can appropriately and adequately express the contents of their respective original documents using word-association knowledge. In this method, we assumed that a good summary comprises words that can express the contents of the original document and does not contain words that are unable to express the contents of the original document. Using statistical tests, we confirmed that the use of elements in our method was beneficial. Our method obtained 0.75 as the ratio where the top 10 summaries for each document include a correct summary and 0.45 as the mean reciprocal rank (MRR) in the “lenient” case of experiments.

[1]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[2]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[3]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[4]  K. Kondo Summarization with Dictionary-based Paraphrasing , 1997 .

[5]  Carolyn Penstein Rosé,et al.  SIDE: The Summarization Integrated Development Environment , 2008, ACL.

[6]  Sur-Jin Ker,et al.  A Text Categorization Based on a Summarization Extraction , 2000 .

[7]  Zhu Zhang,et al.  NewsInEssence: A System For Domain-Independent, Real-Time News Clustering and Multi-Document Summarization , 2001, HLT.

[8]  Michele Banko,et al.  Headline Generation Based on Statistical Translation , 2000, ACL.

[9]  Noriko Kando,et al.  MuST: Workshop on MUltimodal Summarization for Trend Information , 2005, NTCIR.

[10]  Elizabeth D. Liddy,et al.  Advances in Automatic Text Summarization , 2001, Information Retrieval.

[11]  Chin-Yew Lin,et al.  Automated Text Summarization , 2005, IJCNLP.

[12]  Hongyan Jing,et al.  Sentence Reduction for Automatic Text Summarization , 2000, ANLP.

[13]  Eduard Hovy,et al.  The Potential and Limitations of Automatic Sentence Extraction for Summarization , 2003, HLT-NAACL 2003.

[14]  Jade Goldstein-Stewart,et al.  Summarizing text documents: sentence selection and evaluation metrics , 1999, SIGIR '99.

[15]  Regina Barzilay,et al.  Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization , 2004, NAACL.

[16]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.