Novel Text Recognition Based on Modified K-Clustering and Hidden Markov Models

Currently, many researchers have paid more attention to identifying scene texts from the image with background interferences. This study aims to develop an App software system with text recognition on smartphones. Otsu edge detection is applied to binarize the image and to find the parameters (i.e. weights) in a K -cluster. The modified K -cluster algorithm is used to detect the text from an image. The noise in complex background is also filtered out. The detected text gradients are evaluated by histogram of gradient. Accordingly, the distribution of the detected text gradients is generated. Finally, the gradient distribution is utilized by hidden Markov models to recognize the text. The experimental results have shown that the proposed approach can successfully outperform other methods.

[1]  Jiřı́ Matas,et al.  Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Jiri Matas,et al.  A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[3]  Jean-Marc Odobez,et al.  Robust video text segmentation and recognition with multiple hypotheses , 2002, Proceedings. International Conference on Image Processing.

[4]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[5]  Brian C. Lovell,et al.  Improved estimation of hidden Markov model parameters from multiple observation sequences , 2002, Object recognition supported by user interaction for service robots.

[6]  Paul M. Baggenstoss A modified Baum-Welch algorithm for hidden Markov models with multiple observation spaces , 2001, IEEE Trans. Speech Audio Process..

[7]  Wenjie Fan,et al.  Image Recognition Technology Based on Deep Learning , 2018, Wirel. Pers. Commun..

[8]  Shih-Fu Chang,et al.  A Bayesian framework for fusing multiple word knowledge models in videotext recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[9]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[10]  Sabu Emmanuel,et al.  Introduction to Machine Learning , 2019, Machine Learning Approaches in Cyber Security Analytics.

[11]  Rong Huang,et al.  Scene character detection and recognition based on multiple hypotheses framework , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[12]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[13]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[14]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, CVPR 2004.

[15]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[16]  Jong Yun Lee,et al.  Algorithm of a Perspective Transform-Based PDF417 Barcode Recognition , 2016, Wirel. Pers. Commun..

[17]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[18]  Eun-Mi Park,et al.  A Study on Financing Security for Smartphones Using Text Mining , 2018, Wirel. Pers. Commun..

[19]  Robert Sabourin,et al.  Recognition and verification of unconstrained handwritten words , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  Chunheng Wang,et al.  Scene text recognition by learning co-occurrence of strokes based on spatiality embedded dictionary , 2015, IET Comput. Vis..

[22]  Kai Wang,et al.  End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[23]  Atul Negi,et al.  Two-stage hybrid binarization around fringe map based text line segmentation for document images , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).