论文信息 - Document Domain Randomization for Deep Learning Document Layout Extraction - 字舞流文

Document Domain Randomization for Deep Learning Document Layout Extraction

Tobias Isenberg | Robert S. Laramee | Petra Isenberg | C. Lee Giles | Han-Wei Shen | Jian Wu | Michael Sedlmair | Jian Chen | Meng Ling | Torsten Moller | Han-Wei Shen | R. Laramee | M. Sedlmair | Jian Wu | P. Isenberg | Tobias Isenberg | Meng Ling | Jian Chen | Torsten Moller | Petra Isenberg

[1] Tobias Isenberg,et al. Three Benchmark Datasets for Scholarly Article Layout Analysis , 2021 .

[2] Tobias Isenberg,et al. VIS30K: A Collection of Figures and Tables From IEEE Visualization Conference Publications , 2020, IEEE Transactions on Visualization and Computer Graphics.

[3] Venu Govindaraju,et al. Chart Mining: A Survey of Methods for Automated Chart Analysis , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Matthias Bethge,et al. Five points to check when comparing visual perception in humans and machines , 2020, Journal of vision.

[5] Jian Chen,et al. DeepPaperComposer: A Simple Solution for Training Data Preparation for Parsing Research Papers , 2020, SDP@EMNLP.

[6] F. Rossi,et al. The State of the Art in Enhancing Trust in Machine Learning Models with the Use of Visualizations , 2020, Comput. Graph. Forum.

[7] Furu Wei,et al. DocBank: A Benchmark Dataset for Document Layout Analysis , 2020, COLING.

[8] D. Prasad,et al. CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9] Daniel S. Weld,et al. S2ORC: The Semantic Scholar Open Research Corpus , 2020, ACL.

[10] Antonio Jimeno-Yepes,et al. PubLayNet: Largest Dataset Ever for Document Layout Analysis , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[11] Matthias Bethge,et al. ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness , 2018, ICLR.

[12] Saman Arif,et al. Table Detection in Document Images using Foreground and Background Features , 2018, 2018 Digital Image Computing: Techniques and Applications (DICTA).

[13] Rui Li,et al. Toward A Deep Understanding of What Makes a Scientific Visualization Memorable , 2018, 2018 IEEE Scientific Visualization Conference (SciVis).

[14] Varun Jampani,et al. Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[15] Waleed Ammar,et al. Extracting Scientific Figures with Distantly Supervised Neural Networks , 2018, JCDL.

[16] Michael Stonebraker,et al. Beagle : Automated Extraction and Interpretation of Visualizations from the Web , 2017 .

[17] Daniel Kifer,et al. Multi-Scale Multi-Task FCN for Semantic Page Segmentation and Table Detection , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[18] Ersin Yumer,et al. Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Nir Shavit,et al. Deep Learning is Robust to Massive Label Noise , 2017, ArXiv.

[20] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[21] Sergey Levine,et al. (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.

[22] Ali Farhadi,et al. FigureSeer: Parsing Result-Figures in Research Papers , 2016, ECCV.

[23] Stephen James,et al. 3D Simulation for Robot Arm Control with Deep Q-Learning , 2016, ArXiv.

[24] Christopher Andreas Clark,et al. PDFFigures 2.0: Mining figures from research papers , 2016, 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL).

[25] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[26] Michael S. Bernstein,et al. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.

[27] Thomas Brox,et al. A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] C. Lee Giles,et al. Automatic Extraction of Figures from Scholarly Documents , 2015, DocEng.

[29] Jianxiong Xiao,et al. SUN RGB-D: A RGB-D scene understanding benchmark suite , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31] Yang Song,et al. An Overview of Microsoft Academic Service (MAS) and Applications , 2015, WWW.

[32] Thomas Brox,et al. FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[33] Christopher Andreas Clark,et al. Looking Beyond Text: Extracting Figures, Tables and Captions from Computer Science Papers , 2015, AAAI Workshop: Scholarly Big Data.

[34] Wei Zhang,et al. Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[35] Cornelia Caragea,et al. CiteSeer x : A Scholarly Big Dataset , 2014, ECIR.

[36] Hanspeter Pfister,et al. What Makes a Visualization Memorable? , 2013, IEEE Transactions on Visualization and Computer Graphics.

[37] Javier Nogueras-Iso,et al. A Semantic Approach for the Annotation of Figures: Application to High-Energy Physics , 2013, MTSR.

[38] Patrice Lopez,et al. GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications , 2009, ECDL.

[39] C. Lee Giles,et al. CiteSeer: an automatic citation indexing system , 1998, DL '98.