Text Detection Based on Affine Transformation

Text detection and recognition plays an important roles in many computer vision based systems, since text can provide explicit content information. In natural scene, variations of scale, rotation and position are the main challenges for text detection and recognition algorithms. Thus, rectified text region is required for most text recognition algorithm. In this paper, we proposed a text detection method which can provide accurate text region. With the external quadrilateral of text region, the affine parameters can be estimated. Consequently, the distorted text region can be rectified according to the affine parameters. The proposed method can provide more accurate detection result for text region. In addition, it can enhance the performance of text recognition. The experiments show the effectiveness of the proposed method.

[1]  Anil K. Jain,et al.  FVC2000: Fingerprint Verification Competition , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Zhuowen Tu,et al.  Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[3]  Xiaoyue Jiang,et al.  Fast Chinese character detection from complex scenes , 2016, 2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA).

[4]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Ming Zhao,et al.  Text detection in images using sparse representation with discriminative dictionaries , 2010, Image Vis. Comput..

[6]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, CVPR 2004.

[8]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[9]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..