论文信息 - A novel ring radius transform for video character reconstruction

A novel ring radius transform for video character reconstruction

Character recognition in video is a challenging task because low resolution and complex background of video cause disconnections, loss of information, loss of shapes of the characters etc. In this paper, we introduce a novel ring radius transform (RRT) and the concept of medial pixels on characters with broken contours in the edge domain for reconstruction. For each pixel, the RRT assigns a value which is the distance to the nearest edge pixel. The medial pixels are those which have the maximum radius values in their neighborhood. We demonstrate the application of these concepts in the problem of character reconstruction to improve the character recognition rate in video images. With ring radius transform and medial pixels, our approach exploits the symmetry information between the inner and outer contours of a broken character to reconstruct the gaps. Experimental results and comparison with two existing methods show that the proposed method outperforms the existing methods in terms of measures such as relative error and character recognition rate.

Palaiahnakote Shivakumara | Umapada Pal | Chew Lim Tan | Trung Quy Phan | Souvik Bhowmick

[1] Hyung Jeong Yang,et al. Automatic detection and recognition of Korean text in outdoor signboard images , 2010, Pattern Recognit. Lett..

[2] Palaiahnakote Shivakumara,et al. A Gradient Vector Flow-Based Method for Video Character Segmentation , 2011, 2011 International Conference on Document Analysis and Recognition.

[3] Jean-Michel Jolion,et al. Extraction and recognition of artificial text in multimedia documents , 2003, Formal Pattern Analysis & Applications.

[4] Chew Lim Tan,et al. Restoration of Archival Documents Using a Wavelet Technique , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Nadia Bali,et al. Automatic accurate broken character restoration for patrimonial documents , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[6] Bernt Schiele,et al. Analyzing appearance and contour based methods for object categorization , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[7] Jean-Marc Odobez,et al. Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[8] Hong Yan,et al. Mending broken handwriting with a macrostructure analysis method to improve recognition , 1999, Pattern Recognit. Lett..

[9] Jin Hyung Kim,et al. Complementary combination of holistic and component analysis for recognition of low-resolution video character images , 2008, Pattern Recognit. Lett..

[10] Z. Saidane,et al. Robust Binarization for Video Text Recognition , 2007 .

[11] David S. Doermann,et al. Binarization of low quality text using a Markov random field model , 2002, Object recognition supported by user interaction for service robots.

[12] Jean-Marc Odobez,et al. Video text recognition using sequential Monte Carlo and error voting methods , 2005, Pattern Recognit. Lett..

[13] Chew Lim Tan,et al. Character Recognition under Severe Perspective Distortion , 2008, 2009 10th International Conference on Document Analysis and Recognition.

[14] Hong Yan,et al. Reconstruction of broken handwritten digits based on structural morphological features , 2001, Pattern Recognit..

[15] P. Nagabhushan,et al. Incremental circle transform and eigenvalue analysis for object recognition: an integrated approach , 2000, Pattern Recognit. Lett..

[16] Hubert Emptoz,et al. Degraded character image restoration using active contours: a first approach , 2002, DocEng '02.

[17] H. S. Nagendraswamy,et al. Symbolic representation of two-dimensional shapes , 2007, Pattern Recognit. Lett..

[18] Hua Huang,et al. Probabilistic contour extraction using hierarchical shape representation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[19] Xinbo Gao,et al. A spatial-temporal approach for video caption detection and recognition , 2002, IEEE Trans. Neural Networks.

[20] Tao Zhang,et al. Automatic Video Text Localization and Recognition , 2007, Fourth International Conference on Image and Graphics (ICIG 2007).

[21] Cheng-Lin Liu,et al. A Robust System to Detect and Localize Texts in Natural Scene Images , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[22] Li Linlin,et al. Edge Based Binarization for Video Text Images , 2010, ICPR 2010.

[23] Xilin Chen,et al. Automatic detection and recognition of signs from natural scenes , 2004, IEEE Transactions on Image Processing.

[24] David S. Doermann,et al. Progress in camera-based document image analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[25] Frédéric Bouchara,et al. Document Image Binarisation Using Markov Field Model , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[26] Chew Lim Tan,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .

[27] Yonatan Wexler,et al. Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28] Pheng-Ann Heng,et al. A double-threshold image binarization method based on edge detector , 2008, Pattern Recognit..

[29] Nicolai Petkov,et al. Robustness of shape descriptors to incomplete contour representations , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] Cheng-Lin Liu,et al. Text Localization in Natural Scene Images Based on Conditional Random Field , 2009, 2009 10th International Conference on Document Analysis and Recognition.