ASTER: An Attentional Scene Text Recognizer with Flexible Rectification
暂无分享,去创建一个
Xiang Bai | Xinggang Wang | Mingkun Yang | Baoguang Shi | Pengyuan Lyu | Cong Yao | Xinggang Wang | C. Yao | X. Bai | Mingkun Yang | Pengyuan Lyu | Baoguang Shi
[1] Xiang Bai,et al. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[2] Kai Wang,et al. End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.
[3] Shijian Lu,et al. Accurate Scene Text Recognition Based on Recurrent Neural Network , 2014, ACCV.
[4] Andrew Zisserman,et al. Reading Text in the Wild with Convolutional Neural Networks , 2014, International Journal of Computer Vision.
[5] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[6] Palaiahnakote Shivakumara,et al. A robust arbitrary text detection system for natural scene images , 2014, Expert Syst. Appl..
[7] Jitendra Malik,et al. Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.
[8] Palaiahnakote Shivakumara,et al. Recognizing Text with Perspective Distortion in Natural Scenes , 2013, 2013 IEEE International Conference on Computer Vision.
[9] Wenyu Liu,et al. TextBoxes: A Fast Text Detector with a Single Deep Neural Network , 2016, AAAI.
[10] Shijian Lu,et al. Perspective rectification of document images using fuzzy set and morphological operations , 2005, Image Vis. Comput..
[11] Jerod J. Weinman,et al. Toward Integrated Scene Text Reading , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[12] Wenyu Liu,et al. A Unified Framework for Multioriented Text Detection and Recognition , 2014, IEEE Transactions on Image Processing.
[13] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Robinson Piramuthu,et al. Region-Based Discriminative Feature Pooling for Scene Text Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[15] Hartmut Neven,et al. PhotoOCR: Reading Text in Uncontrolled Conditions , 2013, 2013 IEEE International Conference on Computer Vision.
[16] Simon Osindero,et al. Recursive Recurrent Nets with Attention Modeling for OCR in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.
[18] Ernest Valveny,et al. Word Spotting and Recognition with Embedded Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[19] Andrew Zisserman,et al. Deep Structured Output Learning for Unconstrained Text Recognition , 2014, ICLR.
[20] Xiang Bai,et al. Robust Scene Text Recognition with Automatic Rectification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Xiaolin Hu,et al. Gated Recurrent Convolution Neural Network for OCR , 2017, NIPS.
[22] C. V. Jawahar,et al. Enhancing energy minimization framework for scene text recognition with top-down cues , 2016, Comput. Vis. Image Underst..
[23] Jiri Matas,et al. Real-Time Lexicon-Free Scene Text Localization and Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[24] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[25] Jiřı́ Matas,et al. Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[26] Pan He,et al. Reading Scene Text in Deep Convolutional Sequences , 2015, AAAI.
[27] Albert Gordo,et al. Supervised mid-level features for word image representation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[29] Chunhua Shen,et al. Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[30] Shuigeng Zhou,et al. Focusing Attention: Towards Accurate Text Recognition in Natural Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[31] Shijian Lu,et al. Document Flattening through Grid Modeling and Regularization , 2006, 18th International Conference on Pattern Recognition (ICPR'06).
[32] Albert Gordo,et al. Label Embedding: A Frugal Baseline for Text Recognition , 2015, International Journal of Computer Vision.
[33] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..
[34] Christoph Meinel,et al. STN-OCR: A single Neural Network for Text Detection and Text Recognition , 2017, ArXiv.
[35] Yonatan Wexler,et al. Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[36] Christoph Meinel,et al. SEE: Towards Semi-Supervised End-to-End Scene Text Recognition , 2017, AAAI.
[37] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[38] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[39] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.
[40] Ankush Gupta,et al. Synthetic Data for Text Localisation in Natural Images , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[41] Kai Wang,et al. Word Spotting in the Wild , 2010, ECCV.
[42] David S. Doermann,et al. Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[43] Tao Wang,et al. End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).
[44] Andrew Zisserman,et al. Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition , 2014, ArXiv.
[45] Zihan Zhou,et al. Learning to Read Irregular Text with Attention Mechanisms , 2017, IJCAI.
[46] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[47] C. V. Jawahar,et al. Top-down and bottom-up cues for scene text recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[48] Wenyu Liu,et al. Strokelets: A Learned Multi-scale Representation for Scene Text Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[49] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.
[50] Eric Lecolinet,et al. A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..
[51] Jiri Matas,et al. Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[52] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[53] Ernest Valveny,et al. ICDAR 2015 competition on Robust Reading , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).
[54] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[55] Andrew Zisserman,et al. Deep Features for Text Spotting , 2014, ECCV.
[56] Shuchang Zhou,et al. EAST: An Efficient and Accurate Scene Text Detector , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[57] Zhuowen Tu,et al. Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .
[58] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[59] Shijian Lu,et al. Accurate recognition of words in scenes without character segmentation using recurrent neural network , 2017, Pattern Recognit..
[60] S. Lucas,et al. ICDAR 2003 robust reading competitions: entries, results, and future directions , 2005, International Journal of Document Analysis and Recognition (IJDAR).
[61] Jiri Matas,et al. A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.
[62] Fred L. Bookstein,et al. Principal Warps: Thin-Plate Splines and the Decomposition of Deformations , 1989, IEEE Trans. Pattern Anal. Mach. Intell..
[63] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.
[64] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.
[65] C. V. Jawahar,et al. Perspective Correction Methods for Camera-Based Document Analysis , 2005 .
[66] Jon Almazán,et al. ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.
[67] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[68] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.