论文信息 - Deep Learning for the Detection of Tabular Information from Electronic Component Datasheets

Deep Learning for the Detection of Tabular Information from Electronic Component Datasheets

The global electronic components supply chain consists of tens of thousands of e-component manufacturers who fabricate over a billion distinct components. These are described in datasheets that differ in style, layout and content, and frequently publish the salient product information in tables. Keeping up-to-date on this information consumes a great deal of human effort and corporate resources. Based on the motivation that AI-based techniques are strong candidates to minimize human intervention in many applications, in this paper, we aim at the first stage of this problem and conduct a comparison of deep learning methods in detecting tabular elements in these documents. Deep learning-based object detectors are shown to be state of the art in detection tasks in different domains therefore we chose two cutting-edge models to adapt to this field, namely Faster-RCNN and RetinaNet. We use backbone networks which are pre-trained on visually salient datasets then employ transfer learning techniques to adapt to our domain. We compare the two networks under two different datasets, namely a dataset that is widely used in academic studies and a private dataset that is used by the suppliers in real supply chains. Our numerical results show that the two networks adapt well to the domain with Faster-RCNN exhibiting marginally better precision with more than 1% difference. However, RetinaNet stands out with promising recall values indicating Feature Pyramid Network architecture can potentially detect technical documents better.

[1] Jianguo Zhang,et al. The PASCAL Visual Object Classes Challenge , 2006 .

[2] Zhuowen Tu,et al. Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Concetto Spampinato,et al. A Saliency-based Convolutional Neural Network for Table and Chart Detection in Digitized Documents , 2018, ICIAP.

[5] In Seop Na,et al. Table Detection from Document Image using Vertical Arrangement of Text Blocks , 2015 .

[6] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Wei Yu,et al. A Survey of Deep Learning: Platforms, Applications and Emerging Research Trends , 2018, IEEE Access.

[9] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10] Sergio Guadarrama,et al. Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Andreas Dengel,et al. DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[12] Zach G. Zacharia,et al. DEFINING SUPPLY CHAIN MANAGEMENT , 2001 .

[13] Quan Wang,et al. An Efficient Approach for Polyps Detection in Endoscopic Videos Based on Faster R-CNN , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[14] Faisal Shafait,et al. Table detection in heterogeneous documents , 2010, DAS '10.

[15] Leonid Karlinsky,et al. A CNN based method for automatic mass detection and classification in mammograms , 2019, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[16] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[17] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[19] Jean Serra,et al. Image Analysis and Mathematical Morphology , 1983 .

[20] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Steven C. H. Hoi,et al. Face Detection using Deep Learning: An Improved Faster RCNN Approach , 2017, Neurocomputing.

[22] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.