Text Structure Analysis as a Tool to Make Retrieved Documents Usable

This paper describes an information retrieval system with the function to support user's use of the retrieved documents using the text-level structure of documents. The text-level structure of each document is described by the occurrence of typical functional components in the text. Automatic detection of the components has been attempted in previous works using surface-level language processing. The proposed system firstly utilizes the text structure to conduct high-precision searches of documents or passages by distinguishing the role or function each concept plays in the text. It also allows browsing or skimming of retrieved texts, creating summaries on-the-fly with various levels of condensation specified by the user. Moreover, the system can search and display any unit of a text such as a sentence, a paragraph or a chapter. Comparison of relevant passages in retrieved documents across multiple texts is helpful for users to examine, analyze, compare and integrate texts and undertake such information work as decision making, problem solving or writing papers, etc., based on the information obtained from retrieved documents.

[1]  Sung-Hyon Myaeng,et al.  Development of a Document Summarization System for Effective Information Services , 1997, RIAO.

[2]  Michael P. Oakes,et al.  The Automatic Generation of Templates for Automatic Abstracting , 1999, BCS-IRSG Annual Colloquium on IR Research.

[3]  Robert A. Day How to write and publish a scientific paper , 1979 .

[4]  Ann Peterson Bishop,et al.  Document Structure and Digital Libraries: How Researchers Mobilize Information in Journal Articles , 1999, Inf. Process. Manag..

[5]  D. Swanson Undiscovered Public Knowledge , 1986 .

[6]  Andrew Dillon,et al.  Designing Usable Electronic Text: Ergonomic Aspects Of Human Information Usage , 1994 .

[7]  S. Jay Samuels,et al.  Adults' use of text structure in the recall of a scientific journal article , 1988 .

[8]  Sherrilynne Shirley Fuller Schema theory in the representation and analysis of text , 1985 .

[9]  学術情報センター National Center for Science Information Systems , 1986 .

[10]  Kyo Kageura,et al.  Phrase processing methods for Japanese text retrieval , 1998, SIGF.

[11]  James Allan,et al.  Approaches to passage retrieval in full text information systems , 1993, SIGIR.

[12]  Chris D. Paice,et al.  The identification of important concepts in highly structured technical papers , 1993, SIGIR.

[13]  Marie-Francine Moens,et al.  Automatic text structuring and categorization as a first step in summarizing legal cases , 1997, Inf. Process. Manag..

[14]  Seiji Miike,et al.  A full-text retrieval system with a dynamic abstract generation function , 1994, SIGIR '94.

[15]  Lisa F. Rau,et al.  SCISOR: extracting information from on-line news , 1990, CACM.

[16]  Robert N. Oddy,et al.  Towards the Use of Situational Information in Information Retrieval , 1992, J. Documentation.

[17]  H.B. Michaelson,et al.  How to write and publish a scientific paper , 1981, Proceedings of the IEEE.

[18]  Elizabeth Du,et al.  The discourse-level structure of empirical abstracts: an exploratory study , 1991, Inf. Process. Manag..

[19]  R. G. Harris Book Review: How to Write and Publish a Scientific Paper 3rd Ed , 1991 .

[20]  Brigitte Endres-Niggemeyer,et al.  How to Implement a Naturalistic Model of Abstracting: Four Core Working Steps of an Expert Abstractor , 1995, Inf. Process. Manag..

[21]  Padmini Srinivasan,et al.  An investigation of content representation using text grammars , 1993, TOIS.

[22]  John M. Swales,et al.  Genre Analysis: English in Academic and Research Settings , 1993 .

[23]  Abderrafih Lehmam,et al.  Text Structuration Leading to an Automatic Summary System: RAFI , 1999, Inf. Process. Manag..

[24]  Simone Teufel,et al.  Sentence extraction as a classification task , 1997 .

[25]  Bryce Allen Text Structures and the User-Intermediary Interaction , 1988 .

[26]  Ann Peterson Bishop,et al.  Digital libraries and knowledge disaggregation: the use of journal article components , 1998, DL '98.

[27]  Bryce Allen Recall cues in known-item retrieval , 1989, JASIS.