A new method to extract Uighur text region in images

In view of Uighur text characteristics being stable and its background being changeable and complex in Video Frames, the thought of target attention is used to solve the problem of detecting Uighur text region in image. First, a method of color reduction is adopted to remove color gradient of text region. Based on this, suspected Uighur character regions are selected out of images with complex background by features of Uighur character region(focused goal 1), in the primaries, which are many small connected regions and are served as suspected samples. Second, a group of characters reflecting the relations (focused goal 2) between adjacent regions are extracted and one of characters is leading. Then every character' information is combined into the leading character and a progressive cluster algorithm based on leading character is proposed. Finally, Uighur text regions in images are obtained with the aid of layer features (focused goal 3) of Uighur text region. Experiments show that the region of Uighur's scene text and caption text on complex background can be segmented successfully. The accuracy rate is 98% and false alarm rate is lower than 14%.

[1]  Ying Zhang,et al.  An image-based automatic Arabic translation system , 2009, Pattern Recognit..

[2]  Chunheng Wang,et al.  Text detection in images based on unsupervised classification of edge-based features , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[3]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Jin Jian-ming Printed Uyghur Texts Segmentation , 2005 .

[5]  Alberto Del Bimbo,et al.  Automatic caption localization in videos using salient points , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[6]  Jonas Andersson,et al.  On-line Arabic handwriting recognition with templates , 2009, Pattern Recognit..

[7]  Michael Spann,et al.  Segmentation and recognition of Arabic characters by structural classification , 1997, Image Vis. Comput..

[8]  W. A. Coberly,et al.  Linear dimension reduction and Bayes classification with unknown population parameters , 1982, Pattern Recognit..

[9]  Gérard G. Medioni,et al.  Text segmentation in color images using tensor voting , 2007, Image Vis. Comput..

[10]  Jagath Samarabandu,et al.  Multiscale Edge-Based Text Extraction from Complex Images , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[11]  Huadong Ma,et al.  Automatic Detection and Localization of Natural Scene Text in Video , 2010, 2010 20th International Conference on Pattern Recognition.