A novel approach for Kannada text extraction

Popularity of the digital cameras is increasing rapidly day by day because of advanced applications and availability of digital cameras. The detection and extraction of text regions in an image is a well known problem in the computer vision. Text in images contains useful semantic information which can be used to fully understand the images. Proposed method aims at detecting and extracting Kannada text from government organization signboard images acquired by digital camera. Segmentation is performed using edge detection method and heuristic features are used to remove the non text regions. Kannada text identification is performed using the structural feature boundary length of the object strokes. Rule based method is employed to validate the objects as Kannada text. The proposed method is effective, efficient and encouraging results are obtained. It has the precision rate of 84.21%, recall rate of 83.16% and Kannada text identification accuracy of 75.77%. Hence proposed method is robust with font size, small orientation and alignment of text.

[1]  Rafael C. González,et al.  Digital image processing using MATLAB , 2006 .

[2]  Jin Hyung Kim,et al.  2009 10th International Conference on Document Analysis and Recognition Scene Text Extraction using Focus of Mobile Camera , 2022 .

[3]  Matthieu Cord,et al.  TEXT EXTRACTION FROM STREET LEVEL IMAGES , 2009 .

[4]  Ujjwal Bhattacharya,et al.  Devanagari and Bangla Text Extraction from Natural Scene Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[5]  V. Kumar,et al.  FLD Based Unconstrained Handwritten Kannada Character Recognition , 2008, 2008 Second International Conference on Future Generation Communication and Networking Symposia.

[6]  Chew Lim Tan,et al.  Text extraction from gray scale document images using edge information , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[7]  Nobuo Ezaki,et al.  Improved text-detection methods for a camera-based text reading system for blind persons , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[8]  Lionel Prevost,et al.  2009 10th International Conference on Document Analysis and Recognition Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm , 2022 .

[9]  Jagath Samarabandu,et al.  Multiscale Edge-Based Text Extraction from Complex Images , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[10]  Jonghyun Park,et al.  Korean Text Detection and Binarization in Color Signboards , 2008, 2008 International Conference on Advanced Language Processing and Web Information Technology.

[11]  Songfeng Lu,et al.  A density-based approach for text extraction in images , 2008, 2008 19th International Conference on Pattern Recognition.

[12]  R. L. Pratap,et al.  Middle Zone Component Extraction and Recognition of Telugu Document Image , 2007 .

[13]  L. Pratap Reddy,et al.  Middle Zone Component Extraction and Recognition of Telugu Document Image , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[14]  S. Angadi,et al.  A Texture Based Methodology for Text Region Extraction from Low Resolution Natural Scene Images , 2009 .