Font Acknowledgment and Character Extraction of Digital and Scanned Images

The font recognition and character extraction is of immense importance as these are many scenarios where data are in such a form, which cannot be processed like in image form or as a hard copy. So the procedure developed in this paper is basically related to identifying the font (Times New Roman, Arial and Comic Sans MS) and afterwards recovering the text using simple correlation based method where the binary templates are correlated to the input image text characters. All of this extraction is done in the presence of a little noise as images may have noisy patterns due to photocopying. The significance of this method exists in extraction of data from various monitoring (Surveillance) camera footages or even more. The method is developed on Matlab\c{opyright} which takes input image and recovers text and font information from it in a text file.

[1]  Bidyut Baran Chaudhuri,et al.  Automatic recognition of printed Oriya script , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[2]  M. Rashad,et al.  Arabic character recognition using statistical and geometric moment features , 2012, 2012 Japan-Egypt Conference on Electronics, Communications and Computers.

[3]  Bidyut Baran Chaudhuri,et al.  Fusion of combination rules of an ensemble of MLP classifiers for improved recognition accuracy of handprinted Bangia numerals , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[4]  I. S. I. Abuhaiba,et al.  Arabic Font Recognition Based on Templates , 2003, Int. Arab J. Inf. Technol..

[5]  Rolf Ingold,et al.  Using Typography in Document Image Analysis , 1998, EP.

[6]  Jonathan J. Hull,et al.  Font and Function Word Identification in Document Recognition , 1996, Comput. Vis. Image Underst..

[7]  Xue-Dong Tian,et al.  Optical font recognition based on Gabor filter , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[8]  Wael Badawy,et al.  Automatic License Plate Recognition (ALPR): A State-of-the-Art Review , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  A. Chilambuchelvan,et al.  A hybrid approach to extract scene text from videos , 2012, 2012 International Conference on Computing, Electronics and Electrical Technologies (ICCEET).

[10]  Madasu Hanmandlu,et al.  Neural based handwritten character recognition , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[11]  Guo-hong Li,et al.  An approach to offline handwritten Chinese character recognition based on segment evaluation of adaptive duration , 2004, Journal of Zhejiang University. Science.

[12]  Wei Zhao,et al.  Printed Arabic Character Recognition Using HMM , 2004, J. Comput. Sci. Technol..

[13]  Natasha Dejdumrong,et al.  Thai Font Type Recognition Using SIFT , 2012, 2012 Ninth International Conference on Computer Graphics, Imaging and Visualization.

[14]  R. Kabbani Selecting most efficient Arabic OCR features extraction methods using Key Performance Indicators , 2012, CCCA12.