Isolated Telugu Palm leaf character recognition using Radon Transform — A novel approach

This paper deals with the Palm leaf character recognition (PLCR) using Radon Transform applied to Telugu Characters. A large collection of these Palm leaf characters are available in the classical Indian languages like Sanskrit, Tamil, Pali etc as well as in more modern languages like Telugu. Manuscripts on Palm leaves in India are the most unique collection for Centuries pertaining to wisdom and knowledge containing religious texts and treaties on a host of subjects such as art, medicine, astronomy, astrology, mathematics, law and music in various traditional and modern languages. The palm leaves are natural organic products and are therefore very susceptible to deterioration due to climatic factors (relative humidity, temperature), light and insects. Hence, preservation and digitization of these palm leaves/manuscripts is important. These characters on the palm leaf have the additional properties like depth (which is proportional to the pen pressure applied by the scriber), an added feature which can be gainfully exploited in PLCR. This paper explores how these 3D features can be extracted and how they can be gainfully used in the recognition and classification process using Radon Transform and Nearest Neighborhood Classifier (NNC). The best percentage of accuracy obtained in the proposed method is 93%.

[1]  Zhixin Shi,et al.  Digital Image Enhancement using Normalization Techniques and their application to Palm Leaf Manuscripts , 2005 .

[2]  Bidyut Baran Chaudhuri,et al.  Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Hiroshi Murase,et al.  On-line handwriting recognition , 1992, Proc. IEEE.

[4]  Anthony J. Robinson,et al.  An Off-Line Cursive Handwriting Recognition System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  S. Noushath,et al.  Robust Unconstrained Handwritten Digit Recognition using Radon Transform , 2007, 2007 International Conference on Signal Processing, Communications and Networking.

[6]  J. Tsukumo,et al.  Classification of handprinted Chinese characters using nonlinear normalization and correlation methods , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[7]  Sargur N. Srihari,et al.  A System to Locate and Recognize ZIP Codes in Handwritten Addresses , 1989 .

[8]  G. Hemantha Kumar,et al.  Multilingual OCR system for South Indian scripts and English documents: An approach based on Fourier transform and principal component analysis , 2008, Eng. Appl. Artif. Intell..

[9]  T. Yorozu,et al.  Electron Spectroscopy Studies on Magneto-Optical Media and Plastic Substrate Interface , 1987, IEEE Translation Journal on Magnetics in Japan.

[10]  Hiromitsu Yamada,et al.  A nonlinear normalization method for handprinted kanji character recognition - line density equalization , 1990, Pattern Recognit..

[11]  Adnan Amin,et al.  Hand printed Arabic character recognition system , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[12]  Panyam Narahari Sastry,et al.  CLASSIFICATION AND IDENTIFICATION OF TELUGU HANDWRITTEN CHARACTERS EXTRACTED FROM PALM LEAVES USING DECISION TREE APPROACH , 2010 .