LAMBERT: Layout-Aware Language Modeling for Information Extraction
暂无分享,去创建一个
Lukasz Garncarek | Rafal Powalski | Bartosz Topolski | Piotr Halama | Filip Grali'nski | Tomasz Stanislawek | Michal Turski | M. Turski | Lukasz Garncarek | Filip Grali'nski | Rafal Powalski | Tomasz Stanisławek | Bartosz Topolski | Piotr Halama | Tomasz Stanislawek
[1] Ping Gong,et al. PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks , 2020, ArXiv.
[2] Shlomo Argamon,et al. Building a test collection for complex document information processing , 2006, SIGIR.
[3] Seunghyun Park,et al. CORD: A Consolidated Receipt Dataset for Post-OCR Parsing , 2019 .
[4] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[5] Doug Downey,et al. Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks , 2020, ACL.
[6] Przemyslaw Biecek,et al. Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout , 2020, ArXiv.
[7] Yiming Yang,et al. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context , 2019, ACL.
[8] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.
[9] Francesca Cesarini,et al. Analysis and understanding of multi-class invoices , 2003, Document Analysis and Recognition.
[10] Furu Wei,et al. LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding , 2020, ACL.
[11] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[12] Fei Wu,et al. TRIE: End-to-End Text Reading and Information Extraction for Document Understanding , 2020, ACM Multimedia.
[13] Flávio S. Corrêa da Silva,et al. Semantic information extraction from images of complex documents , 2012, Applied Intelligence.
[14] Furu Wei,et al. LayoutLM: Pre-training of Text and Layout for Document Image Understanding , 2019, KDD.
[15] Lysandre Debut,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.
[16] Christian Reisswig,et al. BERTgrid: Contextualized Embedding for 2D Document Representation and Understanding , 2019, ArXiv.
[17] Steffen Bickel,et al. Chargrid: Towards Understanding 2D Documents , 2018, EMNLP.
[18] Xiaojing Liu,et al. Graph Convolution for Multimodal Information Extraction from Visually Rich Documents , 2019, NAACL.
[19] Vincent Poulain D'Andecy,et al. Field Extraction from Administrative Documents by Incremental Structural Templates , 2013, 2013 12th International Conference on Document Analysis and Recognition.
[20] Quoc V. Le,et al. GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism , 2018, ArXiv.
[21] Bidyut Baran Chaudhuri,et al. An End-to-End Administrative Document Analysis System , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.
[22] Eric Medvet,et al. A probabilistic approach to printed document understanding , 2011, International Journal on Document Analysis and Recognition (IJDAR).
[23] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..
[24] Louis-Philippe Morency,et al. Integrating Multimodal Information in Large Pretrained Transformers , 2020, ACL.
[25] Evgeniy Bart,et al. Information extraction by finding repeated structure , 2010, DAS '10.
[26] Yasuto Ishitani. Model-based Information Extraction Method Tolerant of OCR Errors for Document Images , 2002, Int. J. Comput. Process. Orient. Lang..
[27] Ashish Vaswani,et al. Self-Attention with Relative Position Representations , 2018, NAACL.
[28] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.