Conceptual modelling for invoice document processing

This paper is concerned with the presentation of a declarative knowledge base, the Conceptual Model, which describes the invoice domain as generally as possible. Such a model is based on a semantic network that is able to describe the invoice domain by different levels of abstraction. The Conceptual Model can be used for the labelling procedure of physical rectangles, extracted from invoices, in order to construct a model (Document Model) for each class of invoices. The Document Model contains physical coordinates for each rectangle, which can be estimated from an invoice, and the related semantic label. Once the Document Model is constructed, it can be applied to understand an invoice instance, whose class is univocally identified by its logo.

[1]  Francesca Cesarini,et al.  A system for data extraction from forms of known class , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[2]  Yuan Yan Tang,et al.  Financial document processing based on staff line and description language , 1995, IEEE Trans. Syst. Man Cybern..

[3]  Horst Bunke,et al.  Model-Based Analysis and Understanding of Check Forms , 1994, Int. J. Pattern Recognit. Artif. Intell..

[4]  Toyohide Watanabe,et al.  Layout Recognition of Multi-Kinds of Table-Form Documents , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Francesca Cesarini,et al.  A Hybrid System for Locating and Recognizing Low Level Graphic Items , 1995, GREC.

[6]  Francesca Cesarini,et al.  Rectangle labelling for an invoice understanding system , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[7]  Ehud Rivlin,et al.  Logo recognition using geometric invariants , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[8]  Andreas Dengel,et al.  ANASTASIL: A Hybrid Knowledge-Based System for Document Layout Analysis , 1989, IJCAI.

[9]  Nicola Guarino,et al.  Dwq : Esprit Long Term Research Project, No 22469 Part-whole Relations in Object-centered Systems: an Overview Part-whole Relations in Object-centered Systems: an Overview , 2022 .

[10]  Francesca Cesarini,et al.  A neural-based architecture for spot-noisy logo recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[11]  Francesca Cesarini,et al.  Data Extraction from Form Images , 1995, DEXA.

[12]  Rainer Hoch,et al.  From paper to office document standard representation , 1992, Computer.

[13]  Heinrich Niemann,et al.  ERNEST: A Semantic Network System for Pattern Understanding , 1990, IEEE Trans. Pattern Anal. Mach. Intell..