Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

[1]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[2]  Ping Gong,et al.  PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks , 2020, ArXiv.

[3]  Zheng Huang,et al.  ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[4]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Fei Wu,et al.  TRIE: End-to-End Text Reading and Information Extraction for Document Understanding , 2020, ACM Multimedia.

[6]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[7]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[8]  Seref Sagiroglu,et al.  Development of adaptive and intelligent web-based educational systems , 2010, 2010 4th International Conference on Application of Information and Communication Technologies.

[9]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[11]  Seunghyun Park,et al.  Post-OCR parsing: building simple and robust parser via BIO tagging , 2019 .

[12]  Steffen Bickel,et al.  Chargrid: Towards Understanding 2D Documents , 2018, EMNLP.

[13]  Mauricio Villegas,et al.  TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages , 2020, Pattern Recognit. Lett..

[14]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[15]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[16]  Furu Wei,et al.  LayoutLM: Pre-training of Text and Layout for Document Image Understanding , 2019, KDD.

[17]  Regina Barzilay,et al.  GraphIE: A Graph-Based Framework for Information Extraction , 2018, NAACL.

[18]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[19]  Xiaojing Liu,et al.  Graph Convolution for Multimodal Information Extraction from Visually Rich Documents , 2019, NAACL.

[20]  K. Minton Extraction Patterns for Information Extraction Tasks : A Survey , 1999 .

[21]  Scott B. Huffman,et al.  Learning information extraction patterns from examples , 1995, Learning for Natural Language Processing.

[22]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[23]  Alexander Schill,et al.  Automatic indexing of scanned documents: a layout-based approach , 2012, Electronic Imaging.

[24]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[25]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[26]  Xiameng Qin,et al.  EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[27]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.