Deep Learning for the Detection of Tabular Information from Electronic Component Datasheets

The global electronic components supply chain consists of tens of thousands of e-component manufacturers who fabricate over a billion distinct components. These are described in datasheets that differ in style, layout and content, and frequently publish the salient product information in tables. Keeping up-to-date on this information consumes a great deal of human effort and corporate resources. Based on the motivation that AI-based techniques are strong candidates to minimize human intervention in many applications, in this paper, we aim at the first stage of this problem and conduct a comparison of deep learning methods in detecting tabular elements in these documents. Deep learning-based object detectors are shown to be state of the art in detection tasks in different domains therefore we chose two cutting-edge models to adapt to this field, namely Faster-RCNN and RetinaNet. We use backbone networks which are pre-trained on visually salient datasets then employ transfer learning techniques to adapt to our domain. We compare the two networks under two different datasets, namely a dataset that is widely used in academic studies and a private dataset that is used by the suppliers in real supply chains. Our numerical results show that the two networks adapt well to the domain with Faster-RCNN exhibiting marginally better precision with more than 1% difference. However, RetinaNet stands out with promising recall values indicating Feature Pyramid Network architecture can potentially detect technical documents better.

[1]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[2]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Concetto Spampinato,et al.  A Saliency-based Convolutional Neural Network for Table and Chart Detection in Digitized Documents , 2018, ICIAP.

[5]  In Seop Na,et al.  Table Detection from Document Image using Vertical Arrangement of Text Blocks , 2015 .

[6]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Wei Yu,et al.  A Survey of Deep Learning: Platforms, Applications and Emerging Research Trends , 2018, IEEE Access.

[9]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10]  Sergio Guadarrama,et al.  Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Andreas Dengel,et al.  DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[12]  Zach G. Zacharia,et al.  DEFINING SUPPLY CHAIN MANAGEMENT , 2001 .

[13]  Quan Wang,et al.  An Efficient Approach for Polyps Detection in Endoscopic Videos Based on Faster R-CNN , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[14]  Faisal Shafait,et al.  Table detection in heterogeneous documents , 2010, DAS '10.

[15]  Leonid Karlinsky,et al.  A CNN based method for automatic mass detection and classification in mammograms , 2019, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[16]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[17]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[19]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[20]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Steven C. H. Hoi,et al.  Face Detection using Deep Learning: An Improved Faster RCNN Approach , 2017, Neurocomputing.

[22]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.