A new wavelet-Laplacian method for arbitrarily-oriented character segmentation in video text lines

Character segmentation is an important topic to improve the overall performance of text recognition methods due to low resolution, complex background and lots of visual variations in video. This paper presents a novel idea for segmenting characters from arbitrarily-oriented text lines based on wavelet and Laplacian combination. Firstly, we explore wavelet which decomposes a given input image into sub-levels like a pyramid structure for segmenting words based on the fact that as decomposition level increases, the gap between characters decreases due to the reduction in the size of the input image, which results in a single component for each word. Secondly, for each segmented word, we propose Laplacian wavelet combination in a new way to extract text candidates. Thirdly, we propose horizontal and vertical sampling for character segmentation from words. The proposed method is tested on curved, non-horizontal and horizontal text lines of video and the ICDAR 2005 natural scene dataset to evaluate its performance. A comparative study with an existing method shows that the proposed method outperforms it in terms of precision and f-measure.

[1]  Xian-Sheng Hua,et al.  An automatic performance evaluation protocol for video text detection algorithms , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Shijian Lu,et al.  Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Palaiahnakote Shivakumara,et al.  A New Method for Character Segmentation from Multi-oriented Video Words , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[5]  Chew Lim Tan,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .

[6]  Palaiahnakote Shivakumara,et al.  Recognizing Text with Perspective Distortion in Natural Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[7]  Chunheng Wang,et al.  Scene Text Recognition Using Structure-Guided Character Detection and Linguistic Knowledge , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[9]  Jing Zhang,et al.  A Novel Text Detection System Based on Character and Link Energies , 2014, IEEE Transactions on Image Processing.

[10]  Palaiahnakote Shivakumara,et al.  A Gradient Vector Flow-Based Method for Video Character Segmentation , 2011, 2011 International Conference on Document Analysis and Recognition.

[11]  Palaiahnakote Shivakumara,et al.  A New Laplacian Method for Arbitrarily-Oriented Word Segmentation in Video , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[12]  Shijian Lu,et al.  A New Fourier-Moments Based Video Word and Character Extraction Method for Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[13]  Wenyu Liu,et al.  A Unified Framework for Multioriented Text Detection and Recognition , 2014, IEEE Transactions on Image Processing.

[14]  C. V. Jawahar,et al.  Top-down and bottom-up cues for scene text recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Palaiahnakote Shivakumara,et al.  A New Method for Word Segmentation from Arbitrarily-Oriented Video Text Lines , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).