Deep learning for automatic sale receipt understanding

As a general rule, data analytics are now mandatory for companies. Scanned document analysis brings additional challenges introduced by paper damages and scanning quality. In an industrial context, this work focuses on the automatic understanding of sale receipts which enable access to essential and accurate consumption statistics. Given an image acquired with a smart-phone, the proposed work mainly focuses on the first steps of the full tool chain which aims at providing essential information such as the store brand, purchased products and related prices with the highest possible confidence. To get this high confidence level, even if scanning is not perfectly controlled, we propose a double check processing tool-chain using Deep Convolutional Neural Networks (DCNNs) on one hand and more classical image and text processings on another hand. The originality of this work relates in this double check processing and in the joint use of DCNNs for different applications and text analysis.

[1]  Andrew Zisserman,et al.  Deep Features for Text Spotting , 2014, ECCV.

[2]  Jean-Philippe Domenger,et al.  Semi-structured document image matching and recognition , 2013, Electronic Imaging.

[3]  Raymond Smith,et al.  Adapting the Tesseract open source OCR engine for multilingual OCR , 2009, MOCR '09.

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Raimondo Schettini,et al.  Logo Recognition Using CNN Features , 2015, ICIAP.

[6]  Joachim Denzler,et al.  Fine-Tuning Deep Neural Networks in Continuous Learning Scenarios , 2016, ACCV Workshops.

[7]  Shaogang Gong,et al.  Deep Learning Logo Detection with Data Expansion by Synthesising Context , 2016, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[8]  Christian Wolf,et al.  Paragraph text segmentation into lines with Recurrent Neural Networks , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[9]  Jiri Matas,et al.  Scene Text Localization and Recognition with Oriented Stroke Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[10]  Siddharth Garimella,et al.  Identification of Receipts in a Multi-receipt Image using Spectral Clustering , 2016 .

[11]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Corneliu Florea,et al.  Local description using multi-scale complete rank transform for improved logo recognition , 2014, 2014 10th International Conference on Communications (COMM).

[13]  Alexandre Benoit,et al.  Lecture automatique d'un ticket de caisse par vision embarquée sur un téléphone mobile , 2016 .