Using librarian techniques in automatic text summarization for information retrieval

A current application of automatic text summarization is to provide an overview of relevant documents coming from an information retrieval (IR) system. This paper examines how Centrifuser, one such summarization system, was designed with respect to methods used in the library community. We have reviewed these librarian expert techniques to assist information seekers and codified them into eight distinct strategies. We detail how we have operationalized six of these strategies in Centrifuser by computing an informative extract, indicative differences between documents, as well as navigational links to narrow or broaden a user's query. We conclude the paper with results from a preliminary evaluation.

[1]  Nicholas J. Belkin,et al.  A case for interaction: a study of interactive information retrieval behavior and effectiveness , 1996, CHI.

[2]  Christine L. Borgman,et al.  Why are online catalogs hard to use? Lessons learned from information-retrieval studies , 1986, J. Am. Soc. Inf. Sci..

[3]  Diane H. Sonnenwald,et al.  Developing a theory to guide the process of designing information retrieval systems , 1992, SIGIR '92.

[4]  Marcia J. Bates,et al.  Information search tactics , 1979, J. Am. Soc. Inf. Sci..

[5]  Christine L. Borgman,et al.  Why are Online Catalogs Hard to Use? Lessons Learned from Information=Retrieval Studies , 1986 .

[6]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[7]  Elaine Zaremba Jennerich,et al.  The Reference Interview As a Creative Art , 1987 .

[8]  Robert Balay,et al.  Guide to Reference Books , 1996 .

[9]  Michael Lewis,et al.  Testing visual information retrieval methodologies case study: Comparative analysis of textual, icon, graphical, and spring displays , 2002, J. Assoc. Inf. Sci. Technol..

[10]  Matthew Chalmers,et al.  Bead: explorations in information visualization , 1992, SIGIR '92.

[11]  Elizabeth Du,et al.  The discourse-level structure of empirical abstracts: an exploratory study , 1991, Inf. Process. Manag..

[12]  Kathleen R. McKeown,et al.  Domain-specific informative and indicative summarization for information retrieval , 2001 .

[13]  William A. Katz,et al.  Introduction to reference work , 1969 .

[14]  Luis Gravano,et al.  Computing Geographical Scopes of Web Resources , 2000, VLDB.

[15]  Inderjeet Mani,et al.  Summarizing Similarities and Differences Among Related Documents , 1997, Information Retrieval.

[16]  Steven K. Feiner,et al.  PERSIVAL, a system for personalized search and summarization over multimedia healthcare information , 2001, JCDL '01.

[17]  Nicholas J. Belkin,et al.  Interaction with Texts: Information Retrieval as Information-Seeking Behavior , 1993, Information Retrieval.

[18]  Raya Fidel,et al.  Writing Abstracts for Free‐Text Searching , 1986, J. Documentation.

[19]  Marti A. Hearst TileBars: visualization of term distribution information in full text information access , 1995, CHI '95.

[20]  Kathleen R. McKeown,et al.  SIMFINDER: A Flexible Clustering Tool for Summarization , 2001 .

[21]  Steven K. Feiner,et al.  The AIL automated interface layout system , 2002, IUI '02.