Farsi/Arabic text extraction from video images by corner detection

Video text information plays an important role in semantic-based video analysis, indexing and retrieval. In this paper, we proposed a novel Farsi text detection approach based on intrinsic characteristics of Farsi text lines, which is more robust to complex backgrounds and various font styles. First, by an edge detector operator, all the possible edges in vertical, horizontal, 45 and 135 degrees are extracted. Then, for extracting text strokes, some pre-processing such as dilation and erosion are done according to the font size. Afterward, by finding the edges cross points, corners map is extracted. To discard non-text corners and finding real font size, histogram analysis is done. After finding real font size, input image is rescaled and a new corner map is extracted. Finally, the detected candidate text areas undergo the empirical rules analysis to identify text areas and project profile analysis for verification and text lines extraction. Experimental results demonstrate that the proposed method is robust to font size, font colour, and background complexity.

[1]  M. Leon,et al.  TEXT DETECTION IN IMAGES AND VIDEO SEQUENCES , 2005 .

[2]  Palaiahnakote Shivakumara,et al.  A Robust Wavelet Transform Based Technique for Video Text Detection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[3]  Jing Zhang,et al.  Extraction of Text Objects in Video Documents: Recent Progress , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[4]  Xueming Qian,et al.  Text detection, localization, and tracking in compressed video , 2007, Signal Process. Image Commun..

[5]  M. Fujii,et al.  Field singularity correction in 2-D time domain Haar wavelet modeling of waveguide components , 1999, 1999 IEEE MTT-S International Microwave Symposium Digest (Cat. No.99CH36282).

[6]  Xueming Qian,et al.  Text Detection, Localization and Segmentation in Compressed Videos , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[7]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[8]  Michael R. Lyu,et al.  A new approach for video text detection , 2002, Proceedings. International Conference on Image Processing.

[9]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[10]  Ehsanollah Kabir,et al.  Farsi font recognition based on Sobel-Roberts features , 2010, Pattern Recognit. Lett..

[11]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..