论文信息 - Automatic multidocument summarization of research abstracts: Design and user evaluation

Automatic multidocument summarization of research abstracts: Design and user evaluation

The purpose of this study was to develop a method for automatic construction of multidocument summaries of sets of research abstracts that may be retrieved by a digital library or search engine in response to a user query. Sociology dissertation abstracts were selected as the sample domain in this study. A variable-based framework was proposed for integrating and organizing research concepts and relationships as well as research methods and contextual relations extracted from different dissertation abstracts. Based on the framework, a new summarization method was developed, which parses the discourse structure of abstracts, extracts research concepts and relationships, integrates the information across different abstracts, and organizes and presents them in a Web-based interface. The focus of this article is on the user evaluation that was performed to assess the overall quality and usefulness of the summaries. Two types of variable-based summaries generated using the summarization methodwith or without the use of a taxonomywere compared against a sentence-based summary that lists only the research-objective sentences extracted from each abstract and another sentence-based summary generated using the MEAD system that extracts important sentences. The evaluation results indicate that the majority of sociological researchers (70%) and general users (64%) preferred the variable-based summaries generated with the use of the taxonomy. © 2007 Wiley Periodicals, Inc.

Christopher S. G. Khoo | Dion Hoe-Lian Goh | Shiyan Ou

[1] John M. Conroy,et al. Machine and human performance for single and multidocument summarization , 2003 .

[2] Zhu Zhang,et al. Towards CST-enhanced summarization , 2002, AAAI/IAAI.

[3] Francine Chen,et al. A trainable document summarizer , 1995, SIGIR '95.

[4] Christopher S. G. Khoo,et al. A Hierarchical Framework for Multi-document Summarization of Dissertation Abstracts , 2002, ICADL.

[5] Gustave J. Rath,et al. The formation of abstracts by the selection of sentences , 1961 .

[6] Therese Firmin Hand,et al. A Proposal for Task-based Evaluation of Text Summarization Systems , 1997, Workshop On Intelligent Scalable Text Summarization.

[7] Panagiotis Stamatopoulos,et al. Summarization from Medical Documents: A Survey , 2005, Artif. Intell. Medicine.

[8] H. P. Edmundson,et al. New Methods in Automatic Extracting , 1969, JACM.

[9] Timo Järvinen,et al. A non-projective dependency parser , 1997, ANLP.

[10] Karen Sparck Jones,et al. Book Reviews: Evaluating Natural Language Processing Systems: An Analysis and Review , 1996, CL.

[11] Jade Goldstein-Stewart,et al. Summarizing text documents: sentence selection and evaluation metrics , 1999, SIGIR '99.