论文信息 - TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content

TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content

The automatic recognition of tabular data in document images presents a significant challenge due to the diverse range of table styles and complex structures. Tables offer valuable content representation, enhancing the predictive capabilities of various systems such as search engines and Knowledge Graphs. Addressing the two main problems, namely table detection (TD) and table structure recognition (TSR), has traditionally been approached independently. In this research, we propose an end-to-end pipeline that integrates deep learning models, including DETR, Cascade TabNet, and PP OCR v2, to achieve comprehensive image-based table recognition. This integrated approach effectively handles diverse table styles, complex structures, and image distortions, resulting in improved accuracy and efficiency compared to existing methods like Table Transformer. Our system achieves simultaneous table detection, table structure recognition, and table content recognition (TCR), preserving table structures and accurately extracting tabular data from document images. The integration of multiple models addresses the intricacies of table recognition, making our approach a promising solution for image-based table understanding, data extraction, and information retrieval applications. Our proposed approach achieves an IOU of 0.96 and an OCR Accuracy of 78%, showcasing a remarkable improvement of approximately 25% in the OCR Accuracy compared to the previous Table Transformer approach.

[1] H. Takeda,et al. Rethinking Image-based Table Recognition Using Weakly Supervised Methods , 2023, ICPRAM.

[2] R. Guo,et al. YOLO-table: disclosure document table detection with involution , 2022, International Journal on Document Analysis and Recognition (IJDAR).

[3] P. Staar,et al. TableFormer: Table Structure Understanding with Transformers , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] D. Nguyen. TableSegNet: a fully convolutional network for table detection and segmentation in document images , 2021, Int. J. Document Anal. Recognit..

[5] Dianhai Yu,et al. PP-LCNet: A Lightweight CPU Convolutional Neural Network , 2021, ArXiv.

[6] Dianhai Yu,et al. PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System , 2021, ArXiv.

[7] Julian Risch,et al. Multi-modal Retrieval of Tables and Texts Using Tri-encoder Models , 2021, MRQA.

[8] Mayank Singh,et al. ICDAR 2021 Competition on Scientific Table Image Recognition to LaTeX , 2021, ICDAR.

[9] Peng Gao,et al. PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex , 2021, ArXiv.

[10] Peng Gao,et al. PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML , 2021, ArXiv.

[11] Didier Stricker,et al. Current Status and Performance Analysis of Table Recognition in Document Images With Deep Neural Networks , 2021, IEEE Access.

[12] Nicolas Usunier,et al. End-to-End Object Detection with Transformers , 2020, ECCV.

[13] Lucian Popa,et al. Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[14] Burak Kantarci,et al. Holistic design for deep learning-based discovery of tabular structures in datasheet images , 2020, Eng. Appl. Artif. Intell..

[15] D. Prasad,et al. CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16] César Domínguez,et al. The Benefits of Close-Domain Fine-Tuning for Table Detection in Document Images , 2019, DAS.

[17] Xianbiao Qi,et al. MASTER: Multi-Aspect Non-local Network for Scene Text Recognition , 2019, Pattern Recognit..

[18] Yang Zhao,et al. Deep High-Resolution Representation Learning for Visual Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Burak Kantarci,et al. Deep Learning for the Detection of Tabular Information from Electronic Component Datasheets , 2019, 2019 IEEE Symposium on Computers and Communications (ISCC).

[20] Martin Holecek,et al. Table Understanding in Structured Documents , 2019, 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW).

[21] Andreas Dengel,et al. DeCNT: Deep Deformable CNN for Table Detection , 2018, IEEE Access.

[22] Xiang Li,et al. Shape Robust Text Detection With Progressive Scale Expansion Network , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Nuno Vasconcelos,et al. Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24] Muhammad Imran Malik,et al. Table Detection Using Deep Learning , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[25] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[26] Zhi Tang,et al. A Table Detection Method for PDF Documents Based on Convolutional Neural Networks , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[27] In Seop Na,et al. Table Detection from Document Image using Vertical Arrangement of Text Blocks , 2015 .

[28] Tamir Hassan,et al. ICDAR 2013 Table Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[29] Yalin Wang,et al. Table structure understanding and its performance evaluation , 2004, Pattern Recognit..

[30] Azriel Rosenfeld,et al. Document structure analysis algorithms: a literature survey , 2003, IS&T/SPIE Electronic Imaging.

[31] Thomas G Kieninger,et al. Table structure recognition based on robust block segmentation , 1998, Electronic Imaging.