Seizing the Treasure: Transferring Knowledge in Invoice Analysis

This paper deals with the transfer of knowledge on invoice document layout and extraction strategies, collected by users of the invoice recognition software smartFIX over several years of productive use, to other user's systems. The results of a project analyzing this 'treasure' of knowledge and putting it to use in the smartFIX system are presented. The evaluation shows that this transfer of knowledge using state-of-the-art techniques in transfer learning achieves significantly higher initial recognition rates than the unaugmented system, delivering instant economic advantages by reducing accountant personnel workload.

[1]  Yolande Belaïd,et al.  Case-Based Reasoning for Invoice Analysis and Recognition , 2007, ICCBR.

[2]  Bertin Klein,et al.  smartFIX: A Requirements-Driven System for Document Analysis and Understanding , 2002, Document Analysis Systems.

[3]  Yolande Belaïd,et al.  A Case-Based Reasoning Approach for Invoice Structure Extraction , 2007 .

[4]  Francesca Cesarini,et al.  A two level knowledge approach for understanding documents of a multi-class domain , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[5]  Bertin Klein,et al.  On Benchmarking of Invoice Analysis Systems , 2006, Document Analysis Systems.

[6]  Thomas M. Breuel,et al.  Example-Based Logical Labeling of Document Title Page Images , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[7]  Thomas Bayer,et al.  A generic system for processing invoices , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[8]  Daniel P. Lopresti,et al.  A fast technique for comparing graph representations with applications to performance evaluation , 2003, Document Analysis and Recognition.

[9]  Francesca Cesarini,et al.  Analysis and understanding of multi-class invoices , 2003, Document Analysis and Recognition.

[10]  Horst Bunke,et al.  Similarity Measures for Structured Representations , 1993, EWCBR.

[11]  Bertin Klein,et al.  Results of a Study on Invoice-Reading Systems in Germany , 2004, Document Analysis Systems.