论文信息 - Character shape restoration system through medial axis points in video

Character shape restoration system through medial axis points in video

Shape restoration for characters in video is challenging because natural scene characters usually suffer from low resolution, complex background and perspective distortion. In this paper, we propose histogram gradient division and reverse gradient orientation in a new way to select Text Pixel Candidates (TPC) for a given input character. We apply a ring radius transform on TPC in different directions, namely, horizontal, vertical, principal and secondary diagonals in a TPC image to obtain respective radius maps, where each pixel is assigned a value that is the radius to the nearest TPC. This helps in finding Medial Axis Points (MAP) by searching for the maximum radius values from their neighborhoods in a radius image. The union of all the medial axis points obtained from the respective directions at each location is considered as Candidate Medial Axis Points (CMAP) of the character. Then color difference and k-means clustering are proposed to eliminate false CMAP, which outputs Potential Medial Axis Points (PMAP). We finally propose a novel way to restore the shape of the character from the PMAP. The method is tested on a video dataset and the benchmark ICDAR 2013 dataset to show its effectiveness for complex background and low resolution. Experimental results show that the proposed method is superior to the existing methods in terms of shape restoration error and recognition rate.

[1] Kyung-Joong Kim,et al. Design of a visual perception model with edge-adaptive Gabor filter and support vector machine for traffic sign detection , 2013, Expert Syst. Appl..

[2] Erik G. Learned-Miller,et al. Improving Open-Vocabulary Scene Text Recognition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[3] Yonatan Wexler,et al. Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4] Palaiahnakote Shivakumara,et al. Scene Character Reconstruction through Medial Axis , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[5] Jean-Marc Odobez,et al. Video text recognition using sequential Monte Carlo and error voting methods , 2005, Pattern Recognit. Lett..

[6] Toru Wakahara,et al. Binarization and Recognition of Degraded Characters Using a Maximum Separability Axis in Color Space and GAT Correlation , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[7] Z. Saidane,et al. Robust Binarization for Video Text Recognition , 2007 .

[8] Li Linlin,et al. Edge Based Binarization for Video Text Images , 2010, ICPR 2010.

[9] Frédo Durand,et al. Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[10] Jin Wang,et al. Segmentation of merged characters by neural networks and shortest path , 1994, Pattern Recognit..

[11] Ioannis Pratikakis,et al. ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[12] Xinbo Gao,et al. Detection and recognition of text superimposed in images base on layered method , 2014, Neurocomputing.

[13] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[14] Zhuowen Tu,et al. Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[15] Bernard Gosselin,et al. Color text extraction with selective metric-based clustering , 2007, Comput. Vis. Image Underst..

[16] Ioannis Pratikakis,et al. A combined approach for the binarization of handwritten document images , 2014, Pattern Recognit. Lett..

[17] David S. Doermann,et al. Progress in camera-based document image analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18] N. Otsu. A threshold selection method from gray level histograms , 1979 .

[19] Chang Hong Lin,et al. A robust video text detection approach using SVM , 2012, Expert Syst. Appl..

[20] Mohamed Cheriet,et al. A learning framework for the optimization and automation of document binarization methods , 2013, Comput. Vis. Image Underst..

[21] Shijian Lu,et al. Camera Text Recognition based on Perspective Invariants , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[22] Cheng-Lin Liu,et al. A Hybrid Approach to Detect and Localize Texts in Natural Scene Images , 2011, IEEE Transactions on Image Processing.

[23] Palaiahnakote Shivakumara,et al. Wavelet-gradient-fusion for video text binarization , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[24] Jun Guo,et al. Text extraction from natural scene image: A survey , 2013, Neurocomputing.

[25] Khairuddin Omar,et al. An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows , 2011, Pattern Recognit. Lett..

[26] Anil K. Jain,et al. Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[27] Chew Lim Tan,et al. Character Recognition under Severe Perspective Distortion , 2008, 2009 10th International Conference on Document Analysis and Recognition.

[28] Palaiahnakote Shivakumara,et al. A Gradient Vector Flow-Based Method for Video Character Segmentation , 2011, 2011 International Conference on Document Analysis and Recognition.

[29] Alan L. Yuille,et al. Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[30] Toru Wakahara,et al. Binarization of Color Characters in Scene Images Using k-means Clustering and Support Vector Machines , 2010, 2010 20th International Conference on Pattern Recognition.

[31] Shijian Lu,et al. New Spatial-Gradient-Features for Video Script Identification , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[32] Jing Zhang,et al. Extraction of Text Objects in Video Documents: Recent Progress , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[33] Jon Almazán,et al. ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[34] Jiangtao Wen,et al. A new binarization method for non-uniform illuminated document images , 2013, Pattern Recognit..

[35] Mohamed Cheriet,et al. A multi-scale framework for adaptive binarization of degraded document images , 2010, Pattern Recognit..

[36] Palaiahnakote Shivakumara,et al. A robust arbitrary text detection system for natural scene images , 2014, Expert Syst. Appl..

[37] Jürgen Beyerer,et al. Performance improvement of character recognition in industrial applications using prior knowledge for more reliable segmentation , 2013, Expert Syst. Appl..

[38] Umapada Pal,et al. Recent Advances in Video Based Document Processing: A Review , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[39] Xinbo Gao,et al. A spatial-temporal approach for video caption detection and recognition , 2002, IEEE Trans. Neural Networks.

[40] Pheng-Ann Heng,et al. A double-threshold image binarization method based on edge detector , 2008, Pattern Recognit..

[41] Harald Sack,et al. A skeleton based binarization approach for video text recognition , 2012, 2012 13th International Workshop on Image Analysis for Multimedia Interactive Services.

[42] Shijian Lu,et al. Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[43] C. V. Jawahar,et al. An MRF Model for Binarization of Natural Scene Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[44] Wayne Niblack,et al. An introduction to digital image processing , 1986 .

[45] Simon M. Lucas,et al. ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[46] Rui Wang,et al. Scene Text Segmentation via Inverse Rendering , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[47] Mohamed Cheriet,et al. AdOtsu: An adaptive and parameterless generalization of Otsu's method for document image binarization , 2012, Pattern Recognit..

[48] Ioannis Pratikakis,et al. Adaptive degraded document image binarization , 2006, Pattern Recognit..

[49] Jean-Marc Odobez,et al. Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[50] Palaiahnakote Shivakumara,et al. A novel ring radius transform for video character reconstruction , 2013, Pattern Recognit..

[51] Lei Huang,et al. A Novel Method for Embedded Text Segmentation Based on Stroke and Color , 2011, 2011 International Conference on Document Analysis and Recognition.

[52] Palaiahnakote Shivakumara,et al. A new Iterative-Midpoint-Method for video character gap filling , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).