Edge-Based Features for Localization of Artificial Urdu Text in Video Images

Content-based video indexing and retrieval has become an interesting research area with the tremendous growth in the amount of digital media. In addition to the audio-visual content, text appearing in videos can serve as a powerful tool for semantic indexing and retrieval of videos. This paper proposes a method based on edge-features for horizontally aligned artificial Urdu text detection from video images. The system exploits edge based segmentation to extract textual content from videos. We first find the vertical gradients in the input video image and average the gradient magnitude in a fixed neighborhood of each pixel. The resulting image is binarized and the horizontal run length smoothing algorithm (RLSA) is applied to merge possible text regions. An edge density filter is then applied to eliminate noisy non-text regions. Finally, the candidate regions satisfying certain geometrical constraints are accepted as text regions. The proposed approach evaluated on a data set of 150 video images exhibited promising results.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  S. Lucas,et al.  ICDAR 2003 robust reading competitions: entries, results, and future directions , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[3]  Edward M. Riseman,et al.  TextFinder: An Automatic System to Detect and Recognize Text In Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Simon M. Lucas,et al.  ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[5]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[6]  Yoshinobu Hotta,et al.  Text Detection in Images Based on Grayscale Decomposition and Stroke Extraction , 2009, 2009 Chinese Conference on Pattern Recognition.

[7]  Wen Gao,et al.  Coarse-to-fine video text detection , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[8]  Jean-Michel Jolion,et al.  Object count/area graphs for the evaluation of object detection and segmentation algorithms , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[9]  Chew Lim Tan,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .

[10]  Palaiahnakote Shivakumara,et al.  2009 10th International Conference on Document Analysis and Recognition A Gradient Difference based Technique for Video Text Detection , 2022 .

[11]  Rongrong Wang,et al.  A novel video caption detection approach using multi-frame integration , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[12]  Weiqiang Wang,et al.  Text Detection in Images Using Texture Feature from Strokes , 2006, PCM.

[13]  Ioannis Pratikakis,et al.  A Hybrid System for Text Detection in Video Frames , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[14]  Wayne Nilback An introduction to digital image processing , 1985 .

[15]  Jin Hyung Kim,et al.  Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Yap-Peng Tan,et al.  Contrast adaptive binarization of low quality document images , 2004, IEICE Electron. Express.

[17]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[18]  S. Ranganath,et al.  Video-text extraction and recognition , 2004, 2004 IEEE Region 10 Conference TENCON 2004..

[19]  Jean-Michel Jolion,et al.  Text localization, enhancement and binarization in multimedia documents , 2002, Object recognition supported by user interaction for service robots.

[20]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.