论文信息 - Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution - 字舞流文

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Jiaxin Zhang | Lianwen Jin | Shuaitao Zhang | Guozhi Tang | Chongyu Liu | Y. Wu | Jiapeng Wang | Qianying Wang | Mingxiang Cai | Qianying Wang

[1] Eduard H. Hovy,et al. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[2] Ping Gong,et al. PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks , 2020, ArXiv.

[3] Zheng Huang,et al. ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[4] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Fei Wu,et al. TRIE: End-to-End Text Reading and Information Extraction for Document Understanding , 2020, ACM Multimedia.

[6] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[7] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.

[8] Seref Sagiroglu,et al. Development of adaptive and intelligent web-based educational systems , 2010, 2010 4th International Conference on Application of Information and Communication Technologies.

[9] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[11] Seunghyun Park,et al. Post-OCR parsing: building simple and robust parser via BIO tagging , 2019 .

[12] Steffen Bickel,et al. Chargrid: Towards Understanding 2D Documents , 2018, EMNLP.

[13] Mauricio Villegas,et al. TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages , 2020, Pattern Recognit. Lett..

[14] Friedrich M. Wahl,et al. Document Analysis System , 1982, IBM J. Res. Dev..

[15] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[16] Furu Wei,et al. LayoutLM: Pre-training of Text and Layout for Document Image Understanding , 2019, KDD.

[17] Regina Barzilay,et al. GraphIE: A Graph-Based Framework for Information Extraction , 2018, NAACL.

[18] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[19] Xiaojing Liu,et al. Graph Convolution for Multimodal Information Extraction from Visually Rich Documents , 2019, NAACL.

[20] K. Minton. Extraction Patterns for Information Extraction Tasks : A Survey , 1999 .

[21] Scott B. Huffman,et al. Learning information extraction patterns from examples , 1995, Learning for Natural Language Processing.

[22] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[23] Alexander Schill,et al. Automatic indexing of scanned documents: a layout-based approach , 2012, Electronic Imaging.

[24] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[25] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[26] Xiameng Qin,et al. EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[27] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.