Text Detection Through Hidden Markov Random Field and EM-Algorithm

The text is a dominant source and delivers semantic information about a particular content of the respective image or video. Human often gives importance to the text than any other objects in an image or a video frame. Text detection is one of the prime part of the text information extraction process. Text detection process is an exciting and emerging research area in the zone of pattern recognition, and computer vision due to the complex background, illumination, and arbitrary orientation. In this paper, the Hidden Markov Random Field (HMRF) method and Expectation-Maximization (EM) algorithm are employed to detect the arbitrarily oriented multilingual text present in an image or a video frame. The proposed method calculates the max-min cluster to maximize the discrimination between textual and non-textual region. HMRF separates the textual region. EM algorithm maximizes the likelihood of the parameters. Laplacian of Gaussian process is used to identify the potential text information. The double line structure concept is employed to extract the true text region. The proposed method is evaluated on Hua’s dataset, arbitrarily oriented dataset, and horizontal dataset with performance measures recall, precision, and f-measure. The outcome shows that the approach is promising and encouraging.

[1]  Palaiahnakote Shivakumara,et al.  Color and Gradient Features for Text Segmentation from Video Frames , 2017, ArXiv.

[2]  Palaiahnakote Shivakumara,et al.  New Fourier-Statistical Features in RGB Space for Video Text Detection , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Xian-Sheng Hua,et al.  An automatic performance evaluation protocol for video text detection algorithms , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Palaiahnakote Shivakumara,et al.  A New Method for Word Segmentation from Arbitrarily-Oriented Video Text Lines , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[5]  Matko Saric,et al.  Scene text segmentation using low variation extremal regions and sorting based character grouping , 2017, Neurocomputing.

[6]  Yonghong Song,et al.  Scene text detection based on skeleton-cut detector , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[7]  Edward K. Wong,et al.  A new robust algorithm for video text extraction , 2003, Pattern Recognit..

[8]  Xiangyang Xue,et al.  Arbitrary-Oriented Scene Text Detection via Rotation Proposals , 2017, IEEE Transactions on Multimedia.

[9]  D. S. Guru,et al.  A Novel Arbitrary-Oriented Multilingual Text Detection in Images/Video , 2018 .

[10]  Mikhail Zarechensky,et al.  Text Detection in Natural Scenes with Multilingual Text , 2014, SYRCoDIS.

[11]  Chunheng Wang,et al.  Text detection in images based on unsupervised classification of edge-based features , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[12]  Palaiahnakote Shivakumara,et al.  Multi-oriented text detection for intra-frame in H.264/AVC video , 2014, 2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS).

[13]  Baihua Xiao,et al.  A robust system for text extraction in video , 2007, 2007 International Conference on Machine Vision.

[14]  Palaiahnakote Shivakumara,et al.  Detection of Curved Text in Video: Quad Tree Based Method , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[15]  Palaiahnakote Shivakumara,et al.  A Robust Symmetry-Based Method for Scene/Video Text Detection through Neural Network , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[16]  Gérard G. Medioni,et al.  Text segmentation in color images using tensor voting , 2007, Image Vis. Comput..

[17]  M. S. Pavithra,et al.  A comprehensive of transforms, Gabor filter and k-means clustering for text detection in images and video , 2016 .