论文信息 - DeepTabStR: Deep Learning based Table Structure Recognition

DeepTabStR: Deep Learning based Table Structure Recognition

This paper presents a novel method for the analysis of tabular structures in document images using the potential of deformable convolutional networks. In order to assess the suitability of the model to the task of table structure recognition, most of the prior methods have been tested on the smaller ICDAR-13 table structure recognition dataset comprising of just 156 tables. We curated a new image-based table structure recognition dataset, TabStructDB2, comprising of 1081 tables densely labeled with row and column information. Instead of collecting new images for this purpose, we leveraged the famous Page-Object Detection dataset from ICDAR-17, and added structural information for all the tabular regions present in the dataset. This new publicly available dataset will enable the development of more sophisticated table structure recognition techniques in the future. We performed extensive evaluation on the two datasets (ICDAR-13 and TabStructDB) including crossdataset testing in order to evaluate the efficacy of the proposed approach. We achieved state-of-the-art results with deformable models on ICDAR-13 with an average F-Measure of 92.98% (89.42% for rows and 96.55% for columns) and report baseline results on TabStructDB for guiding future research efforts with an F-Measure of 93.72% (91.26% for rows and 95.59% for columns). Despite promising results, structural analysis of tables with arbitrary layouts is still far from achievable at this point.

[1] Marcus Liwicki,et al. Improved Automatic Analysis of Architectural Floor Plans , 2011, 2011 International Conference on Document Analysis and Recognition.

[2] Yi Li,et al. R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[3] Andreas Dengel,et al. DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[4] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Andreas Dengel,et al. DeCNT: Deep Deformable CNN for Table Detection , 2018, IEEE Access.

[6] Abdel Belaïd,et al. Table information extraction and structure recognition using query patterns , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[7] Alexey O. Shigarov,et al. Configurable Table Structure Recognition in Untagged PDF documents , 2016, DocEng.

[8] Paul Lukowicz,et al. D-StaR: A Generic Method for Stamp Segmentation from Document Images , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[9] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Roman Kern,et al. A Comparison of Two Unsupervised Table Recognition Methods from Digital Scientific Articles , 2014, D Lib Mag..

[11] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[12] Yi Li,et al. Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13] Zhoujun Li,et al. TableBank: Table Benchmark for Image-based Table Detection and Recognition , 2019, LREC.

[14] Muhammad Imran Malik,et al. Table Detection Using Deep Learning , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[15] Zhi Tang,et al. ICDAR2017 Competition on Page Object Detection , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[16] Yalin Wang,et al. Table structure understanding and its performance evaluation , 2004, Pattern Recognit..

[17] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Thomas Kieninger,et al. The T-Recs Table Recognition and Analysis System , 1998, Document Analysis Systems.

[19] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[20] Hye-Young Paik,et al. TEXUS: A unified framework for extracting and understanding tables in PDF documents , 2019, Inf. Process. Manag..