Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
暂无分享,去创建一个
Jiaxin Zhang | Lianwen Jin | Shuaitao Zhang | Guozhi Tang | Chongyu Liu | Y. Wu | Jiapeng Wang | Qianying Wang | Mingxiang Cai | Qianying Wang
[1] Eduard H. Hovy,et al. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.
[2] Ping Gong,et al. PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks , 2020, ArXiv.
[3] Zheng Huang,et al. ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[4] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Fei Wu,et al. TRIE: End-to-End Text Reading and Information Extraction for Document Understanding , 2020, ACM Multimedia.
[6] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.
[7] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.
[8] Seref Sagiroglu,et al. Development of adaptive and intelligent web-based educational systems , 2010, 2010 4th International Conference on Application of Information and Communication Technologies.
[9] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.
[11] Seunghyun Park,et al. Post-OCR parsing: building simple and robust parser via BIO tagging , 2019 .
[12] Steffen Bickel,et al. Chargrid: Towards Understanding 2D Documents , 2018, EMNLP.
[13] Mauricio Villegas,et al. TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages , 2020, Pattern Recognit. Lett..
[14] Friedrich M. Wahl,et al. Document Analysis System , 1982, IBM J. Res. Dev..
[15] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[16] Furu Wei,et al. LayoutLM: Pre-training of Text and Layout for Document Image Understanding , 2019, KDD.
[17] Regina Barzilay,et al. GraphIE: A Graph-Based Framework for Information Extraction , 2018, NAACL.
[18] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.
[19] Xiaojing Liu,et al. Graph Convolution for Multimodal Information Extraction from Visually Rich Documents , 2019, NAACL.
[20] K. Minton. Extraction Patterns for Information Extraction Tasks : A Survey , 1999 .
[21] Scott B. Huffman,et al. Learning information extraction patterns from examples , 1995, Learning for Natural Language Processing.
[22] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.
[23] Alexander Schill,et al. Automatic indexing of scanned documents: a layout-based approach , 2012, Electronic Imaging.
[24] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[25] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.
[26] Xiameng Qin,et al. EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[27] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.