The function of documents

Abstract The purpose of a document is to facilitate the transfer of information from its author to its readers. It is the author's job to design the document so that the information it contains can be interpreted accurately and efficiently. To do this, the author can make use of a set of stylistic tools. In this paper, we introduce the concept of document functionality, which attempts to describe the roles of documents and their components in the process of transferring information. A functional description of a document provides insight into the type of the document, into its intended uses, and into strategies for automatic document interpretation and retrieval. To demonstrate these ideas, we define a taxonomy of functional document components and show how functional descriptions can be used to reverse-engineer the intentions of the author, to navigate in document space, and to provide important contextual information to aid in interpretation.

[1]  E. Rosch,et al.  Cognition and Categorization , 1980 .

[2]  Rama Chellappa,et al.  Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Kazuhiko Yamamoto,et al.  Structured Document Image Analysis , 1992, Springer Berlin Heidelberg.

[4]  Frank Y. Shih,et al.  Adaptive document block segmentation and classification , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[5]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  L. Stark,et al.  Dissertation Abstract , 1994, Journal of Cognitive Education and Psychology.

[7]  Azriel Rosenfeld,et al.  Recognition by Functional Parts , 1995, Comput. Vis. Image Underst..

[8]  Anil K. Jain,et al.  Page segmentation using tecture analysis , 1996, Pattern Recognit..

[9]  Rama Chellappa,et al.  Multiscale Document Page Segmentation Using Soft Decision Integration , 1997 .

[10]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Azriel Rosenfeld,et al.  Navigational Functionalities , 1995, Comput. Vis. Image Underst..

[12]  K. S. Baird,et al.  Anatomy of a versatile page reader , 1992, Proc. IEEE.

[13]  Mahesh Viswanathan,et al.  Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Kevin W. Bowyer,et al.  Function-based generic recognition for multiple object categories , 1994 .

[15]  Kenneth A. Kaufman,et al.  Inductive Learning System AQ15c: The Method and User's Guide , 1995 .

[16]  Harry Wechsler,et al.  Classification of binary document images into textual or nontextual data blocks using network models , 1995 .

[17]  K. Koffka Principles Of Gestalt Psychology , 1936 .