Convolutional Attention Networks for Scene Text Recognition
暂无分享,去创建一个
Yongdong Zhang | Yan Li | Zheng-Jun Zha | Yating Yang | Hongtao Xie | Shancheng Fang | Zhengjun Zha | Yongdong Zhang | Yating Yang | Hongtao Xie | Shancheng Fang | Yan Li
[1] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[2] Yongdong Zhang,et al. Deep Fusion of Multiple Semantic Cues for Complex Event Recognition , 2016, IEEE Transactions on Image Processing.
[3] Andrew Zisserman,et al. Deep Features for Text Spotting , 2014, ECCV.
[4] Chong-Wah Ngo,et al. Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues , 2014, Journal of Computer Science and Technology.
[5] Hartmut Neven,et al. PhotoOCR: Reading Text in Uncontrolled Conditions , 2013, 2013 IEEE International Conference on Computer Vision.
[6] Yongdong Zhang,et al. Coarse-to-Fine Description for Fine-Grained Visual Categorization , 2016, IEEE Transactions on Image Processing.
[7] Andrew Zisserman,et al. Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition , 2014, ArXiv.
[8] Yongdong Zhang,et al. Effective Uyghur Language Text Detection in Complex Background Images for Traffic Prompt Identification , 2018, IEEE Transactions on Intelligent Transportation Systems.
[9] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[10] Joelle Pineau,et al. End-to-End Text Recognition with Hybrid HMM Maxout Models , 2013, ICLR.
[11] Jiřı́ Matas,et al. Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[12] Shijian Lu,et al. Accurate Scene Text Recognition Based on Recurrent Neural Network , 2014, ACCV.
[13] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[14] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[15] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[16] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[17] Bin Deng,et al. Name-face association with web facial image supervision , 2017, Multimedia Systems.
[18] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[19] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.
[20] José A. Rodríguez-Serrano,et al. Label embedding for text recognition , 2013, BMVC.
[21] Andrew Zisserman,et al. Reading Text in the Wild with Convolutional Neural Networks , 2014, International Journal of Computer Vision.
[22] Nicu Sebe,et al. Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.
[23] Simon Osindero,et al. Recursive Recurrent Nets with Attention Modeling for OCR in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[24] Tao Wang,et al. End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).
[25] Florent Perronnin,et al. Large-scale image retrieval with compressed Fisher vectors , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[26] Myung J. Lee,et al. A synchronization algorithm for distributed multimedia environments , 2009, Multimedia Systems.
[27] C. V. Jawahar,et al. Scene Text Recognition using Higher Order Language Priors , 2009, BMVC.
[28] Ernest Valveny,et al. Word Spotting and Recognition with Embedded Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[29] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[30] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[31] Yann Dauphin,et al. Language Modeling with Gated Convolutional Networks , 2016, ICML.
[32] Ernest Valveny,et al. Visual Attention Models for Scene Text Recognition , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).
[33] Xiang Bai,et al. Robust Scene Text Recognition with Automatic Rectification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[34] Yan Li,et al. Uyghur Text Matching in Graphic Images for Biomedical Semantic Analysis , 2017, Neuroinformatics.
[35] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.
[36] Kai Wang,et al. Word Spotting in the Wild , 2010, ECCV.
[37] Wenyu Liu,et al. Strokelets: A Learned Multi-scale Representation for Scene Text Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[38] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.
[39] Kevin Murphy,et al. Attention-Based Extraction of Structured Information from Street View Imagery , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).
[40] Yongdong Zhang,et al. Automated pulmonary nodule detection in CT images using deep convolutional neural networks , 2019, Pattern Recognit..
[41] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.
[42] Jon Almazán,et al. ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.
[43] Andrew Zisserman,et al. Deep Structured Output Learning for Unconstrained Text Recognition , 2014, ICLR.
[44] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.
[45] Pan He,et al. Reading Scene Text in Deep Convolutional Sequences , 2015, AAAI.
[46] Albert Gordo,et al. Supervised mid-level features for word image representation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Geoffrey E. Hinton,et al. On the importance of initialization and momentum in deep learning , 2013, ICML.
[48] Feng Xia,et al. ShotVis: Smartphone-Based Visualization of OCR Information from Images , 2015, ACM Trans. Multim. Comput. Commun. Appl..
[49] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.
[50] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[51] Yongdong Zhang,et al. Supervised Hash Coding With Deep Neural Network for Environment Perception of Intelligent Vehicles , 2018, IEEE Transactions on Intelligent Transportation Systems.
[52] Albert Gordo,et al. Label Embedding: A Frugal Baseline for Text Recognition , 2015, International Journal of Computer Vision.
[53] Xiaoyan Gu,et al. Detecting Uyghur text in complex background images with convolutional neural network , 2017, Multimedia Tools and Applications.
[54] Xiang Bai,et al. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[55] Kai Wang,et al. End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.
[56] Simon M. Lucas,et al. ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..
[57] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.