Character Segmentation and Recognition

This chapter presents methods for character segmentation from text lines and recognition of video characters. It is noted that character segmentation from video text lines detected by video text detection method is not as easy as segmenting characters from scanned document images due to low resolution and complex background of video. This chapter presents a method for word segmentation based on the combination of Fourier and moments. Then, the segmented words are used for character segmentation using top and bottom profile features of the words. This chapter also presents a method which does not require words for character segmentation. Instead, it segments character from text lines directly by exploring gradient vector flow (GVF) for identifying the space between words. Further, this chapter introduces a recognition method without the use of an OCR engine. The method proposes structural features based on eight-directional sectors to facilitate character recognition y calculating representatives for each class of the characters.

[1]  Jean-Marc Odobez,et al.  Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[2]  Minoru Mori Video text recognition using feature compensation as category-dependent feature extraction , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[3]  Li Linlin,et al.  Edge Based Binarization for Video Text Images , 2010, ICPR 2010.

[4]  Jean-Marc Odobez,et al.  Video text recognition using sequential Monte Carlo and error voting methods , 2005, Pattern Recognit. Lett..

[5]  Jin Wang,et al.  Segmentation of merged characters by neural networks and shortest-path , 1993, SAC '93.

[6]  Z. Saidane,et al.  Robust Binarization for Video Text Recognition , 2007 .

[7]  Wonjun Kim,et al.  A New Approach for Overlay Text Detection and Extraction From Complex Video Scene , 2009, IEEE Transactions on Image Processing.

[8]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[9]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[10]  He Zhang,et al.  A new video text extraction approach , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[11]  M. Pauline Baker,et al.  Computer Graphics, C Version , 1996 .

[12]  Wen Gao,et al.  A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[13]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[14]  Jing Zhang,et al.  Extraction of Text Objects in Video Documents: Recent Progress , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[15]  Jerry L. Prince,et al.  Snakes, shapes, and gradient vector flow , 1998, IEEE Trans. Image Process..

[16]  Jin Hyung Kim,et al.  Complementary combination of holistic and component analysis for recognition of low-resolution video character images , 2008, Pattern Recognit. Lett..

[17]  Chew Lim Tan,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .

[18]  David S. Doermann,et al.  Progress in camera-based document image analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[19]  Jean-Michel Jolion,et al.  Extraction and recognition of artificial text in multimedia documents , 2003, Formal Pattern Analysis & Applications.

[20]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[21]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[22]  W. Effelsberg,et al.  Robust Character Recognition in Low-Resolution Images and Videos , 2005 .

[23]  Keechul Jung,et al.  Neural network-based text location in color images , 2001, Pattern Recognit. Lett..

[24]  Shijian Lu,et al.  Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[25]  Xinbo Gao,et al.  A spatial-temporal approach for video caption detection and recognition , 2002, IEEE Trans. Neural Networks.

[26]  Evangelos A. Yfantis,et al.  An OCR-independent character segmentation using shortest-path in grayscale document images , 2007, ICMLA 2007.

[27]  Deepu Rajan,et al.  Image classification: Are rule-based systems effective when classes are fixed and known? , 2008, 2008 19th International Conference on Pattern Recognition.

[28]  Jin Hyung Kim,et al.  Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..