PETR: Rethinking the Capability of Transformer-Based Language Model in Scene Text Recognition
暂无分享,去创建一个
Yongdong Zhang | Shenggao Zhu | Hongtao Xie | Shancheng Fang | Jing Wang | Yuxin Wang | Mengting Xing
[1] Yongdong Zhang,et al. From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[2] Yongdong Zhang,et al. Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Ling Shao,et al. Multi-Stage Progressive Image Restoration , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Hanqing Lu,et al. Semi-Supervised Scene Text Recognition , 2021, IEEE Transactions on Image Processing.
[5] Xiaochun Cao,et al. SLOAN: Scale-Adaptive Orientation Attention Network for Scene Text Recognition , 2020, IEEE Transactions on Image Processing.
[6] Yi Jiang,et al. Sparse R-CNN: End-to-End Object Detection with Learnable Proposals , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Zhouhui Lian,et al. Exploring Font-independent Features for Scene Text Recognition , 2020, ACM Multimedia.
[8] Ankush Gupta,et al. Adaptive Text Recognition through Visual Matching , 2020, ECCV.
[9] Zhanghui Kuang,et al. RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition , 2020, ECCV.
[10] Nicolas Usunier,et al. End-to-End Object Detection with Transformers , 2020, ECCV.
[11] Weiping Wang,et al. SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Jiebo Luo,et al. On Vocabulary Reliance in Scene Text Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Lianwen Jin,et al. Text Recognition in the Wild , 2020, ACM Comput. Surv..
[14] Errui Ding,et al. Towards Accurate Scene Text Recognition With Semantic Reasoning Networks , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Xiang Bai,et al. TextScanner: Reading Characters in Order for Robust Scene Text Recognition , 2019, AAAI.
[16] Lianwen Jin,et al. Decoupled Attention Network for Text Recognition , 2019, AAAI.
[17] Xiang Bai,et al. ASTER: An Attentional Scene Text Recognizer with Flexible Rectification , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[18] Xiangjian He,et al. ReELFA: A Scene Text Recognizer with Encoded Location and Focused Attention , 2019, 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW).
[19] Kai Zhou,et al. ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[20] Kaigui Bian,et al. Symmetry-Constrained Rectification Network for Scene Text Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[21] Xiaoyong Shen,et al. 2D Attentional Irregular Scene Text Recognizer , 2019, ArXiv.
[22] Tao Mei,et al. Destruction and Construction Learning for Fine-Grained Image Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Errui Ding,et al. Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[24] Qinghua Hu,et al. Progressive Image Deraining Networks: A Better and Simpler Baseline , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[25] Shijian Lu,et al. ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Peng Wang,et al. Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition , 2018, AAAI.
[27] Yongdong Zhang,et al. Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling , 2018, ACM Multimedia.
[28] Jian Zhang,et al. Scene Text Recognition from Two-Dimensional Perspective , 2018, AAAI.
[29] Xiang Bai,et al. Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[30] Bo Xu,et al. NRTR: A No-Recurrence Sequence-to-Sequence Model for Scene Text Recognition , 2018, 2019 International Conference on Document Analysis and Recognition (ICDAR).
[31] Ya Su,et al. A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[32] Shuigeng Zhou,et al. AON: Towards Arbitrarily-Oriented Text Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[33] Shuigeng Zhou,et al. Focusing Attention: Towards Accurate Text Recognition in Natural Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[34] Yi-Chao Wu,et al. Scene Text Recognition with Sliding Convolutional Character Models , 2017, ArXiv.
[35] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[36] Lianwen Jin,et al. Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[37] Xu-Cheng Yin,et al. Text Detection, Tracking and Recognition in Video: A Comprehensive Survey , 2016, IEEE Transactions on Image Processing.
[38] Wenyu Liu,et al. Strokelets: A Learned Multi-Scale Mid-Level Representation for Scene Text Recognition , 2016, IEEE Transactions on Image Processing.
[39] A. Vedaldi,et al. Synthetic Data for Text Localisation in Natural Images , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Simon Osindero,et al. Recursive Recurrent Nets with Attention Modeling for OCR in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[41] Ernest Valveny,et al. ICDAR 2015 competition on Robust Reading , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).
[42] Xiang Bai,et al. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[43] Shuchang Zhou,et al. ICDAR 2015 Text Reading in the Wild Competition , 2015, ArXiv.
[44] Palaiahnakote Shivakumara,et al. A robust arbitrary text detection system for natural scene images , 2014, Expert Syst. Appl..
[45] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.
[46] Andrew Zisserman,et al. Deep Features for Text Spotting , 2014, ECCV.
[47] Wenyu Liu,et al. A Unified Framework for Multioriented Text Detection and Recognition , 2014, IEEE Transactions on Image Processing.
[48] Wenyu Liu,et al. Strokelets: A Learned Multi-scale Representation for Scene Text Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[49] Andrew Zisserman,et al. Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition , 2014, ArXiv.
[50] Yingli Tian,et al. Scene Text Recognition in Mobile Applications by Character Descriptor and Structure Configuration , 2014, IEEE Transactions on Image Processing.
[51] Palaiahnakote Shivakumara,et al. Recognizing Text with Perspective Distortion in Natural Scenes , 2013, 2013 IEEE International Conference on Computer Vision.
[52] Jon Almazán,et al. ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.
[53] Jiřı́ Matas,et al. Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[54] Kai Wang,et al. End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.
[55] C. V. Jawahar,et al. Scene Text Recognition using Higher Order Language Priors , 2009, BMVC.
[56] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[57] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.