暂无分享,去创建一个
[1] Daniel Ferrés,et al. PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles , 2018, LREC.
[2] Glen M. E. Duerr. The Panama Papers , 2016 .
[3] Kevin Duh,et al. AMR Parsing as Sequence-to-Graph Transduction , 2019, ACL.
[4] Hannah Bast,et al. A Benchmark and Evaluation for Text Extraction from PDF , 2017, 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL).
[5] Daniel Gillick,et al. Sentence Boundary Detection and the Problem with the U.S. , 2009, NAACL.
[6] Leo Breiman,et al. Random Forests , 2001, Machine Learning.
[7] Timothy Dozat,et al. Deep Biaffine Attention for Neural Dependency Parsing , 2016, ICLR.
[8] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .
[9] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[10] Juyeon Kang,et al. FinSBD-2021: The 3rd Shared Task on Structure Boundary Detection in Unstructured Text in the Financial Domain , 2021, WWW.
[12] Mirella Lapata,et al. Automatic Paragraph Identification: A Study across Languages and Domains , 2004, EMNLP.
[13] Furu Wei,et al. LayoutLM: Pre-training of Text and Layout for Document Image Understanding , 2019, KDD.
[14] Yoichi Hatsutori,et al. Estimating Legal Document Structure by Considering Style Information and Table of Contents , 2016, JSAI-isAI Workshops.
[15] Shinjae Yoo,et al. Visual Detection with Context for Document Layout Analysis , 2019, EMNLP.
[16] Furu Wei,et al. LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding , 2020, ACL.
[17] Form 10-Q. SECURITIES AND EXCHANGE COMMISSION , 1985 .