暂无分享,去创建一个
Bhargava Urala Kota | Yusheng Xie | Srikar Appalaraju | Bhavan Jasani | R. Manmatha | B. Kota | Srikar Appalaraju | Yusheng Xie | R. Manmatha | Bhavan Jasani
[1] Seunghyun Park,et al. CORD: A Consolidated Receipt Dataset for Post-OCR Parsing , 2019 .
[2] Geoffrey E. Hinton,et al. Big Self-Supervised Models are Strong Semi-Supervised Learners , 2020, NeurIPS.
[3] Mark Chen,et al. Language Models are Few-Shot Learners , 2020, NeurIPS.
[4] Jianfeng Gao,et al. UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training , 2020, ICML.
[5] BROS: A PRE-TRAINED LANGUAGE MODEL , 2020 .
[6] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Furu Wei,et al. LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding , 2020, ACL.
[8] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[9] Ujjwal Bhattacharya,et al. Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).
[10] Andreas Dengel,et al. DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).
[11] Konstantinos G. Derpanis,et al. Evaluation of deep convolutional nets for document image classification and retrieval , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).
[12] Gabriela Csurka,et al. What is the right way to represent document images? , 2016, ArXiv.
[13] Sandeep Tata,et al. Representation Learning for Information Extraction from Form-like Documents , 2020, ACL.
[14] Scott Cohen,et al. Deep Visual Template-Free Form Parsing , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[15] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..
[16] Kaiming He,et al. Improved Baselines with Momentum Contrastive Learning , 2020, ArXiv.
[17] Yu Cheng,et al. UNITER: UNiversal Image-TExt Representation Learning , 2019, ECCV.
[18] Furu Wei,et al. VL-BERT: Pre-training of Generic Visual-Linguistic Representations , 2019, ICLR.
[19] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[20] Yolande Belaïd,et al. An Invoice Reading System Using a Graph Convolutional Network , 2018, ACCV Workshops.
[21] Thomas Muller,et al. TaPas: Weakly Supervised Table Parsing via Pre-training , 2020, ACL.
[22] Thomas Wolf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.
[23] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.
[24] Przemyslaw Biecek,et al. Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout , 2020, ArXiv.
[25] R. Manmatha,et al. SCATTER: Selective Context Attentional Scene Text Recognizer , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Mohammad Mehdi Rashidi,et al. Modular Multimodal Architecture for Document Classification , 2019, ArXiv.
[27] 知秀 柴田. 5分で分かる!? 有名論文ナナメ読み:Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding , 2020 .
[28] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.
[29] Sergey Ioffe,et al. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.
[30] Antonio Jimeno-Yepes,et al. PubLayNet: Largest Dataset Ever for Document Layout Analysis , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[31] Ilya Sutskever,et al. Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.
[32] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.
[33] Vincent Poulain D'Andecy,et al. One-shot field spotting on colored forms using subgraph isomorphism , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).
[34] Shlomo Argamon,et al. Building a test collection for complex document information processing , 2006, SIGIR.
[35] Cho-Jui Hsieh,et al. VisualBERT: A Simple and Performant Baseline for Vision and Language , 2019, ArXiv.
[36] Serge J. Belongie,et al. Residual Networks Behave Like Ensembles of Relatively Shallow Networks , 2016, NIPS.
[37] Douwe Kiela,et al. Supervised Multimodal Bitransformers for Classifying Images and Text , 2019, ViGIL@NeurIPS.
[38] Sen Yoshida,et al. VisualMRC: Machine Reading Comprehension on Document Images , 2021, AAAI.
[39] Frank Hutter,et al. Decoupled Weight Decay Regularization , 2017, ICLR.
[40] Zheng Huang,et al. ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[41] Hongming Cai,et al. iRMP: From Printed Forms to Relational Data Model , 2016, 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).
[42] Stefan Lee,et al. ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks , 2019, NeurIPS.
[43] Seunghyun Park,et al. Spatial Dependency Parsing for Semi-Structured Document Information Extraction , 2020, FINDINGS.
[44] Steffen Bickel,et al. Chargrid: Towards Understanding 2D Documents , 2018, EMNLP.
[45] Peter Henderson,et al. Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning , 2020, ArXiv.
[46] Chris Tensmeyer,et al. Analysis of Convolutional Neural Networks for Document Image Classification , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).
[47] Tomasz Dwojak,et al. Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer , 2021, ICDAR.
[48] Yusheng Xie,et al. Towards Good Practices in Self-supervised Representation Learning , 2020, ArXiv.
[49] Jean-Philippe Thiran,et al. FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents , 2019, 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW).
[50] Marcus Liwicki,et al. Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).
[51] Alicia Fornés,et al. Table Detection in Invoice Documents by Graph Neural Networks , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[52] Lukasz Garncarek,et al. LAMBERT: Layout-Aware language Modeling using BERT for information extraction , 2020, ArXiv.
[53] Arnab Nandi,et al. Deterministic Routing between Layout Abstractions for Multi-Scale Classification of Visually Rich Documents , 2019, IJCAI.
[54] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[55] Furu Wei,et al. LayoutLM: Pre-training of Text and Layout for Document Image Understanding , 2019, KDD.
[56] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.
[57] Christian Reisswig,et al. BERTgrid: Contextualized Embedding for 2D Document Representation and Understanding , 2019, ArXiv.