SimSum: an empirically founded simulation of summarizing

SimSum (Simulation of Summarizing) simulates 20 real-world working steps of expert summarizers. It presents an empirically founded cognitive model of summarizing and demonstrates that human summarization strategies can be simulated. The cognitive model operationalizes the discourse processing model developed by Kintsch and van Dijk. Knowledge engineering followed the KADS approach, empirical modeling used methods of grounded theory development. The observed strategies of expert summarizers have given rise to cooperating object-oriented agents communicating through dedicated blackboards. Each agent is implemented as a CLOS object with an assigned actor at the multimedia user interface. The interface is realized with Macromedia Director. Communication between CLOS and Macromedia Director is mediated by Apple Events. According to the first evaluation results in an educational environment, SimSum transmits summarization know-how effectively. It is, however, not designed as a tutorial system and serves active and curious users best. We are starting its expansion to summarizing in the WWW.

[1]  D. Corkill Blackboard Systems , 1991 .

[2]  Robert P. Futrelle,et al.  Summarization of Documents That Include Graphics , 1998 .

[3]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[4]  Chris Buckley,et al.  Automatic Text Summarization by Paragraph Extraction , 1997 .

[5]  Victor Lesser,et al.  The Evolution of Blackboard Control Architectures , 1992 .

[6]  Guus Schreiber,et al.  KADS : a principled approach to knowledge-based system development , 1993 .

[7]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[8]  Elisabeth Andr,et al.  AiA: Adaptive Communication Assistant for Effective Infobahn Access , 1997 .

[9]  Karen Spärck Jones Automatic summarising: factors and directions , 1998, ArXiv.

[10]  K. A. Ericsson,et al.  Verbal reports as data. , 1980 .

[11]  Kathleen McKeown,et al.  Text generation: using discourse strategies and focus constraints to generate natural language text , 1985 .

[12]  K. A. Ericsson,et al.  Protocol Analysis: Verbal Reports as Data , 1984 .

[13]  Lisa F. Rau,et al.  SCISOR: extracting information from on-line news , 1990, CACM.

[14]  Kathleen R. McKeown,et al.  Generating natural language summaries from multiple on-line sources , 1998 .

[15]  O. G. Selfridge,et al.  Pandemonium: a paradigm for learning , 1988 .

[16]  유창조 Naturalistic Inquiry , 2022, The SAGE Encyclopedia of Research Design.

[17]  Marilyn Bohl,et al.  Information processing , 1971 .

[18]  Udo Hahn,et al.  Text Summarization Based on Terminological Logics , 1998, ECAI.

[19]  Brigitte Endres-Niggemeyer,et al.  How to Implement a Naturalistic Model of Abstracting: Four Core Working Steps of an Expert Abstractor , 1995, Inf. Process. Manag..

[20]  Therese Firmin Hand,et al.  A Proposal for Task-based Evaluation of Text Summarization Systems , 1997, Workshop On Intelligent Scalable Text Summarization.

[21]  W. Kintsch,et al.  Strategies of discourse comprehension , 1983 .

[22]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[23]  Dragomir R. Radev,et al.  Generating summaries of multiple news articles , 1995, SIGIR '95.

[24]  Daniel Marcu,et al.  The rhetorical parsing, summarization, and generation of natural language texts , 1998 .

[25]  T. Trabasso,et al.  Causal relatedness and importance of story events , 1985 .

[26]  B. Boguraev Dynamic presentation of document content for rapid on-line skimming , 1998, AAAI 1998.

[27]  M. Scardamalia,et al.  The psychology of written composition , 1987 .

[28]  Lynette Hirschman,et al.  Evaluating Message Understanding Systems: An Analysis of the Third Message Understanding Conference (MUC-3) , 1993, CL.

[29]  H. P Nii,et al.  Blackboard Systems , 1986 .

[30]  Giovanni Guida,et al.  A propositional language for text representation , 1984 .

[31]  Chris D. Paice,et al.  The identification of important concepts in highly structured technical papers , 1993, SIGIR.

[32]  Christian Plaunt,et al.  Subtopic structuring for full-length document access , 1993, SIGIR.

[33]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[34]  Giovanni Guida,et al.  Forward And Backward Reasoning In Automatic Abstracting , 1982, COLING.

[35]  Kathleen McKeown Generating Multimedia Briefings: Language Generation in a Coordinated Multimedia Environment , 1997, IJCAI.

[36]  Giovanni Guida,et al.  Computational models of natural language processing , 1984 .

[37]  Kathleen R. McKeown Generating Patient-Specific Summaries of Online Literature , 2002 .

[38]  Eduard H. Hovy,et al.  Automated Discourse Generation Using Discourse Structure Relations , 1993, Artif. Intell..

[39]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[40]  Kavi Mahesh Hypertext Summary Extraction for Fast Document Browsing , 1997 .

[41]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[42]  Lisa F. Rau,et al.  Innovations in Text Interpretation , 1993, Artif. Intell..

[43]  Mark T. Maybury,et al.  Generating Summaries from Event Data , 1995, Inf. Process. Manag..

[44]  Brigitte Endres-Niggemeyer,et al.  Summarizing information , 1998 .

[45]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[46]  Dragomir R. Radev,et al.  Generating Natural Language Summaries from Multiple On-Line Sources , 1998, CL.

[47]  Giovanni Guida,et al.  Evaluating Importance: A Step Towards Text Summarization , 1985, IJCAI.

[48]  Kathleen McKeown,et al.  Generating Concise Natural Language Summaries , 1995, Inf. Process. Manag..