Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks
暂无分享,去创建一个
Ersin Yumer | Daniel Kifer | Xiao Yang | C. Lee Giles | Paul Asente | Mike Kraley | Daniel Kifer | Ersin Yumer | Xiao Yang | Paul Asente | Mike Kraley | P. Asente
[1] H. Emptoz,et al. A fast and efficient method for extracting text paragraphs and graphics from unconstrained documents , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.
[2] Alan Conway,et al. Page grammars and page parsing. A syntactic approach to document layout recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).
[3] Mahesh Viswanathan,et al. Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals , 1993, IEEE Trans. Pattern Anal. Mach. Intell..
[4] Robert M. Haralick,et al. Document page decomposition by the bounding-box project , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.
[5] Robert M. Haralick,et al. Recursive X-Y cut using bounding boxes of connected components , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.
[6] Richard Rogers,et al. UW-ISL document image analysis toolbox: an experimental environment , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.
[7] A. Peter Johnson,et al. A Fast Algorithm for Bottom-Up Document Layout Analysis , 1997, IEEE Trans. Pattern Anal. Mach. Intell..
[8] Adnan Amin,et al. Page Segmentation and Classification Utilizing Bottom-Up Approach , 2001, Int. J. Image Graph..
[9] Azriel Rosenfeld,et al. Document structure analysis algorithms: a literature survey , 2003, IS&T/SPIE Electronic Imaging.
[10] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.
[11] Marcel Worring,et al. The UvA color document dataset , 2004, International Journal of Document Analysis and Recognition (IJDAR).
[12] Paul A. Viola,et al. Learning nongenerative grammatical models for document analysis , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.
[13] R. Smith,et al. An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).
[14] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.
[15] Guillermo Sapiro,et al. Supervised Dictionary Learning , 2008, NIPS.
[16] Apostolos Antonacopoulos,et al. A Realistic Dataset for Performance Evaluation of Document Layout Analysis , 2009, 2009 10th International Conference on Document Analysis and Recognition.
[17] Min-Yen Kan,et al. Logical Structure Recovery in Scholarly Articles with Rich Document Features , 2010, Int. J. Digit. Libr. Syst..
[18] Syed Saqib Bukhari,et al. Improved document image segmentation algorithm using multiresolution morphology , 2011, Electronic Imaging.
[19] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[20] Oriol Ramos Terrades,et al. Document segmentation using Relative Location Features , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).
[21] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.
[22] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[23] Marc'Aurelio Ranzato,et al. DeViSE: A Deep Visual-Semantic Embedding Model , 2013, NIPS.
[24] Luc Vincent,et al. Document Image Applications , 2013 .
[25] Andrew Y. Ng,et al. Zero-Shot Learning Through Cross-Modal Transfer , 2013, NIPS.
[26] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[27] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[28] C. Clausner,et al. ICDAR2015 competition on recognition of documents with complex layouts - RDCL2015 , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).
[29] Wei Xu,et al. Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question , 2015, NIPS.
[30] Iasonas Kokkinos,et al. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.
[31] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[32] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.
[33] Sheng Zeng,et al. Weakly supervised semantic segmentation for social images , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[34] Marcus Liwicki,et al. Page segmentation of historical document images with convolutional autoencoders , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).
[35] Mario Fritz,et al. Ask Your Neurons: A Neural-Based Approach to Answering Questions about Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[36] Yann LeCun,et al. Stacked What-Where Auto-encoders , 2015, ArXiv.
[37] Seunghoon Hong,et al. Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[38] Hal Daumé,et al. Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.
[39] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] George Papandreou,et al. Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation , 2015, ArXiv.
[41] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[42] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[43] Ronan Collobert,et al. Learning to Refine Object Segments , 2016, ECCV.
[44] Yu Qiao,et al. A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.
[45] Gueesang Lee,et al. Dense prediction for text line segmentation in handwritten document images , 2016, 2016 IEEE International Conference on Image Processing (ICIP).
[46] Vladlen Koltun,et al. Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.
[47] Wei-Lun Chao,et al. Synthesized Classifiers for Zero-Shot Learning , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[48] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[49] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[50] Zhiwu Lu,et al. Learning from Weak and Noisy Labels for Semantic Segmentation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[51] Tomas Mikolov,et al. Bag of Tricks for Efficient Text Classification , 2016, EACL.
[52] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.