Integration of document representation, processing and management

This paper describes a way for document representation and proposes an approach towards an integrated document processing and management system. The approach has the intention to capture essentially freely structured documents, like those typically used in the office domain. The document analysis system ANASTASIL is capable to reveal the structure of complex paper documents, as well as logical objects within it, like receiver, footnote, date. Moreover, it facilitates the handling of the containing information. Analyzed documents are stored by the management system KRISYS that is connected to several different subsequent services. The described integrated system can be considered as an ideal extension of the human clerk, making his tasks in information processing easier. The symbolic representation of the analysis results allow an easy transformation in a given international standard, e.g., ODA/ODIF or SGML, and to interchange it via global network.

[1]  A. Günther,et al.  Wissensgesteuerte Formular interpretation mit Hilfe von Petrinetzen , 1986, DAGM-Symposium.

[2]  HorakWolfgang Office Document Architecture and Office Document Interchange Formats , 1985 .

[3]  Karl Schlechta,et al.  On principles and problems of defeasible inheritance , 1992 .

[4]  Nelson Mendonça Mattos An Approach to Knowledge Base Management , 1991, Lecture Notes in Computer Science.

[5]  Sargur N. Srihari,et al.  A Rule-Based System for Document Understanding , 1986, AAAI.

[6]  Andreas Dengel,et al.  Model Based Segmentation And Hypothesis Generation For The Recognition Of Printed Documents , 1988, Other Conferences.

[7]  Nelson Mendonça Mattos,et al.  Abstraction Concepts: The Basis for Data and Knowledge Modeling , 1988, ER.

[8]  Hanan Samet,et al.  Region representation: quadtrees from boundary codes , 1980, CACM.

[9]  Gerd Maderlechner,et al.  Dokumentanalyse mit Hilfe von ATN's und unscharfen Relationen , 1987, DAGM-Symposium.

[10]  Andreas Dengel,et al.  High Level Document Analysis Guided by Geometric Aspects , 1988, Int. J. Pattern Recognit. Artif. Intell..

[11]  George Nagy,et al.  HIERARCHICAL REPRESENTATION OF OPTICALLY SCANNED DOCUMENTS , 1984 .

[12]  Diane C. P. Smith,et al.  Database abstractions: aggregation and generalization , 1977, TODS.

[13]  K. Woehl Automatic Classification of Office Documents by Coupling Relational Data Bases and PROLOG Expert Systems , 1984, VLDB.

[14]  Wolfgang Horak,et al.  Office Document Architecture and Office Document Interchange Formats: Current Status of International Standardization , 1985, Computer.

[15]  Nelson Mendonça Mattos,et al.  Modeling with KRISYS: the Design Process of DB Applications Reviewed , 1989, ER.

[16]  Klaus Meyer-Wegener,et al.  PRIMA - a DBMS Prototype Supporting Engineering Applications , 1987, VLDB.

[17]  Andreas Dengel,et al.  Integrated document management system , 1990, Defense, Security, and Sensing.

[18]  정회경,et al.  Office Document Architecture , 1989 .