Case-Based Reasoning for Invoice Analysis and Recognition

This paper introduces the approach CBRDIA (Case-based Reasoning for Document Invoice Analysis) which uses the principles of case-based reasoning to analyze, recognize and interpret invoices. Two CBR cycles are performed sequentially in CBRDIA. The first one consists in checking whether a similar document has already been processed, which makes the interpretation of the current one easy. The second cycle works if the first one fails. It processes the document by analyzing and interpreting its structuring elements (adresses, amounts, tables, etc) one by one. The CBR cycles allow processing documents from both knonwn or unknown classes. Applied on 923 invoices, CBRDIA reaches a recognition rate of 85,22% for documents of known classes and 74,90% for documents of unknown classes.

[1]  Daniel P. Lopresti,et al.  A fast technique for comparing graph representations with applications to performance evaluation , 2003, Document Analysis and Recognition.

[2]  Francesca Cesarini,et al.  Analysis and understanding of multi-class invoices , 2003, Document Analysis and Recognition.

[3]  Abraham Kandel,et al.  Comparison of Distance Measures for Graph-Based Clustering of Documents , 2003, GbRPR.

[4]  Y. Bela,et al.  Morphological Tagging Approach in Document Analysis of Invoices , 2004 .

[5]  Michael M. Richter,et al.  Image processing in case-based reasoning , 2005, Knowl. Eng. Rev..

[6]  Horst Bunke,et al.  Graph Clustering Using the Weighted Minimum Common Supergraph , 2003, GbRPR.

[7]  Agnar Aamodt,et al.  CASE-BASED REASONING: FOUNDATIONAL ISSUES, METHODOLOGICAL VARIATIONS, AND SYSTEM APPROACHES AICOM - ARTIFICIAL INTELLIGENCE COMMUNICATIONS , 1994 .

[8]  Y. Belaid,et al.  Morphological tagging approach in document analysis of invoices , 2004, ICPR 2004.

[9]  Rosina O. Weber,et al.  Investigating Graphs in Textual Case-Based Reasoning , 2004, ECCBR.

[10]  William Cheetham,et al.  Using Ensembles of Binary Case-Based Reasoners , 2005, ICCBR.

[11]  Luc Lamontagne,et al.  Case-Based Reasoning Research and Development , 1997, Lecture Notes in Computer Science.

[12]  Y. Belaid,et al.  Morphological tagging approach in document analysis of invoices , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[13]  Sarah Jane Delany,et al.  Textual case-based reasoning for spam filtering: a comparison of feature-based and feature-free approaches , 2006, Artificial Intelligence Review.

[14]  Naohiro Furukawa,et al.  Form reading based on form-type identification and form-data recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[15]  Barry Smyth,et al.  Advances in Case-Based Reasoning , 1996, Lecture Notes in Computer Science.

[16]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..