Segmentation and Recognition of Handwritten Kannada Text Using Relevance Feedback and Histogram of Oriented Gradients – A Novel Approach

India is a multilingual country with 22 official languages and more than 1600 languages in existence. Kannada is one of the official languages and widely used in the state of Karnataka whose population is over 65 million. Kannada is one of the south Indian languages and it stands in the 33rd position among the list of widely spoken languages across the world. However, the survey reveals that much more effort is required to develop a complete Optical Character Recognition (OCR) system. In this direction the present research work throws light on the development of suitable methodology to achieve the goal of developing an OCR. It is noted that the overall accuracy of the OCR system largely depends on the accuracy of the segmentation phase. So it is desirable to have a robust and efficient segmentation method. In this paper, a method has been proposed for proper segmentation of the text to improve the performance of OCR at the later stages. In the proposed method, the segmentation has been done using horizontal projection profile and windowing. The result obtained is passed to the recognition module. The Histogram of Oriented Gradient (HoG) is used for the recognition in combination with the support vector machine (SVM). The result is taken as the feedback and fed to the segmentation module to improve the accuracy. The experimentation is delivered promising results.

[1]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2]  R. D. Sudhaker Samuel,et al.  SCRIPT INDEPENDENT HANDWRITTEN NUMERAL RECOGNITION , 2006 .

[3]  Sargur N. Srihari,et al.  A statistical approach to line segmentation in handwritten documents , 2007, Electronic Imaging.

[4]  S. Noushath,et al.  Robust Unconstrained Handwritten Digit Recognition using Radon Transform , 2007, 2007 International Conference on Signal Processing, Communications and Networking.

[5]  David Doermann,et al.  A New Algorithm for Detecting Text Line in Handwritten Documents , 2006 .

[6]  Atul Negi,et al.  Using Fringe Maps for Text Line Segmentation in Printed or Handwritten Document Images , 2010, 2010 Second Vaagdevi International Conference on Information Technology for Real World Problems.

[7]  R MamathaH,et al.  Morphological Operations and Projection Profiles based Segmentation of Handwritten Kannada Document , 2012 .

[8]  G. Lorette,et al.  Advances in Handwriting and Drawing: a multidisciplinary approach , 1994 .

[9]  V. N. Manjunath Aradhya,et al.  Text line segmentation of unconstrained handwritten Kannada script , 2011, ICCCS '11.

[10]  Srikantamurthy Krishnamurthy,et al.  Skew Detection, Correction and Segmentation of Handwritten Kannada Document , 2012 .

[11]  Alireza Alaei,et al.  A Benchmark Kannada Handwritten Document Dataset and Its Segmentation , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  C. V. Jawahar,et al.  Learning Segmentation of Documents with Complex Scripts , 2006, ICVGIP.

[13]  Karamjit Bhatia,et al.  Combined horizontal and vertical projection feature extraction technique for Gurmukhi handwritten character recognition , 2015, 2015 International Conference on Advances in Computer Engineering and Applications.

[14]  Naresh Kumar Garg,et al.  A New Method for Line Segmentation of Handwritten Hindi Text , 2010, 2010 Seventh International Conference on Information Technology: New Generations.

[15]  Carlos A. B. Mello,et al.  Segmentation of connected handwritten digits using Self-Organizing Maps , 2013, Expert Syst. Appl..

[16]  Noorzaily Mohamed Noor Off-line Handwriting Text Line Segmentation : A Review , 2008 .

[17]  Laurence Likforman-Sulem,et al.  A Hough based algorithm for extracting text lines in handwritten documents , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[18]  Amit Choudhary,et al.  A New Character Segmentation Approach for Off-Line Cursive Handwritten Words , 2013, ITQM.