Using Physical and Logical Constraints for Invoice Understanding

Abstract:This work presents a methodology for invoice understanding. The invoices of our domain can be grouped into classes according to their logo. The understanding phase is based on two knowledge levels: a specific knowledge for each class, called a document model; and knowledge on the whole domain of interest, called a domain model. The invoices of a known class are understood by its document model, while the invoices of an unknown class are understood by using the domain model. The main contribution of this work is related to the use of the physical and logical constraints of the domain of interest for document understanding, without using an OCR system. Our approach has been tested by some experiments that are intended to identify some regions within invoices of unknown classes. In most cases, the results have shown the reliability of the approach.

[1]  DengelAndreas,et al.  From Paper to Office Document Standard Representation , 1992 .

[2]  Francesca Cesarini,et al.  Rectangle labelling for an invoice understanding system , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[3]  Karl-Hans Bläsius,et al.  Knowledge-based document analysis , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[4]  Horst Bunke,et al.  Model-Based Analysis and Understanding of Check Forms , 1994, Int. J. Pattern Recognit. Artif. Intell..

[5]  Hanno Walischewski,et al.  Automatic knowledge acquisition for spatial document interpretation , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[6]  Yuan Yan Tang,et al.  Financial document processing based on staff line and description language , 1995, IEEE Trans. Syst. Man Cybern..

[7]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[8]  Anil K. Jain,et al.  Document Representation and Its Application to Page Decomposition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Andreas Dengel,et al.  ANASTASIL: A Hybrid Knowledge-Based System for Document Layout Analysis , 1989, IJCAI.

[10]  Francesca Cesarini,et al.  A neural-based architecture for spot-noisy logo recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[11]  Rainer Hoch,et al.  From paper to office document standard representation , 1992, Computer.

[12]  Giovanni Soda,et al.  Logo Recognition by Recursive Neural Networks , 1997, GREC.

[13]  Yuan Yan Tang,et al.  Document Processing for Automatic Knowledge Acquisition , 1994, IEEE Trans. Knowl. Data Eng..

[14]  Francesca Cesarini,et al.  Structured document segmentation and representation by the modified X-Y tree , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[15]  Donato Malerba,et al.  Multistrategy Learning for Document Recognition , 1994, Appl. Artif. Intell..

[16]  T. Pavlidis Algorithms for Graphics and Image Processing , 1981, Springer Berlin Heidelberg.

[17]  Francesca Cesarini,et al.  INFORMys: A Flexible Invoice-Like Form-Reader System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Toyohide Watanabe,et al.  Layout Recognition of Multi-Kinds of Table-Form Documents , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Mohamed Cheriet,et al.  Documents analysis and understanding : a brief survey , 1991 .

[20]  Robert M. Haralick,et al.  Automatic text skew estimation in document images , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.