BESAC: Binary External Symmetry Axis Constellation for unconstrained handwritten character recognition

Abstract We propose a novel perception driven feature extraction called binary external symmetry axis constellation (BESAC) and a fast Boolean matching character recognition technique. A constellation model using a set of external symmetry axes which are perceptually significant can uniquely represent a handwritten character pattern. This model generates two histograms of orientations that are binary coded and concatenated to produce the proposed BESAC feature. A two stage classification strategy is adopted where a parallel Hamming Distance dissimilarity matching is performed on the extracted BESAC feature to achieve fast recognition along with perceptual closure part detection on look-alike characters. We adopt a 10-fold cross validation strategy to evaluate the performance of our algorithm on two major Indian languages, Bangla and Odia with four benchmark databases (ISI Kolkata Bangla numeral, ISI Kolkata Odia and IITBBS Odia numeral, and a newly created IITBBS Odia character database). The average time for classifying an unknown handwritten character is reported to be significantly less than the existing methods. The average recognition accuracy using the proposed approach is proved to outperform the state-of-the-art accuracy results on ISI Kolkata Odia numeral database (99.35%), IITBBS Odia numeral (98.9%), ISI Kolkata Bangla numeral database (99.48%) and IITBBS Odia character (95.01%) database.

[1]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[2]  Ujjwal Bhattacharya,et al.  An HMM Based Recognition Scheme for Handwritten Oriya Numerals , 2006, 9th International Conference on Information Technology (ICIT'06).

[3]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Donald D. Hoffman,et al.  Parts of recognition , 1984, Cognition.

[5]  Umapada Pal,et al.  Offline Recognition of Devanagari Script: A Survey , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[6]  Ching Y. Suen,et al.  A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters , 2009, Pattern Recognit..

[7]  P. Nagabhushan,et al.  Modified region decomposition method and optimal depth decision tree in the recognition of non-uniform sized characters - An experimentation with Kannada characters , 1999, Pattern Recognit. Lett..

[8]  Sebastiano Impedovo,et al.  More than twenty years of advancements on Frontiers in handwriting recognition , 2014, Pattern Recognit..

[9]  Ganapati Panda,et al.  On extraction of features for handwritten Odia numeral recognition in transformed domain , 2015, 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR).

[10]  Binu P. Chacko,et al.  Handwritten character recognition using wavelet energy and extreme learning machine , 2012, Int. J. Mach. Learn. Cybern..

[11]  Nafiz Arica,et al.  An overview of character recognition focused on off-line handwriting , 2001, IEEE Trans. Syst. Man Cybern. Syst..

[12]  Song Wang,et al.  Two perceptually motivated strategies for shape classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Ganapati Panda,et al.  Gestalt configural superiority effect: A complexity paradigm for handwritten numeral recognition , 2015, 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR).

[14]  Ching Y. Suen,et al.  Hierarchical attributed graph representation and recognition of handwritten chinese characters , 1991, Pattern Recognit..

[15]  Apurva A. Desai,et al.  Gujarati handwritten numeral optical character reorganization through neural network , 2010, Pattern Recognit..

[16]  Feng Tian,et al.  Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Jianmin Jiang,et al.  Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking , 2011, Pattern Recognit. Lett..

[18]  Swati Nigam,et al.  Multifont Oriya Character Recognition Using Curvelet Transform , 2011, ICIS 2011.

[19]  James R. Pomerantz,et al.  Perceptual Organization in Information Processing , 2017 .

[20]  Tetsushi Wakabayashi,et al.  A System for Off-Line Oriya Handwritten Character Recognition Using Curvature Feature , 2007 .

[21]  Hongyuan Wang,et al.  Skeleton growing and pruning with bending potential ratio , 2011, Pattern Recognit..

[22]  Fatos T. Yarman-Vural,et al.  BAS: a perceptual shape descriptor based on the beam angle statistics , 2003, Pattern Recognit. Lett..

[23]  Ganapati Panda,et al.  Handwritten numeral recognition using non-redundant Stockwell transform and bio-inspired optimal zoning , 2015, IET Image Process..

[24]  Fei Yin,et al.  Online and offline handwritten Chinese character recognition: Benchmarking on new databases , 2013, Pattern Recognit..

[25]  Longin Jan Latecki,et al.  Shape Similarity Measure Based on Correspondence of Visual Parts , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Bidyut Baran Chaudhuri,et al.  Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Bidyut Baran Chaudhuri,et al.  Databases for research on recognition of handwritten characters of Indian scripts , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[28]  SOUMEN BAG,et al.  A survey on optical character recognition for Bangla and Devanagari scripts , 2013, Sadhana.

[29]  Tetsushi Wakabayashi,et al.  Handwritten Character Recognition of Popular South Indian Scripts , 2006, SACH.

[30]  U Pal,et al.  A Complete System for Bangla Handwritten Numeral Recognition , 2006 .

[31]  Wenyu Liu,et al.  Skeleton Pruning by Contour Partitioning with Discrete Curve Evolution , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Yves Lecourtier,et al.  A structural/statistical feature based vector for handwritten character recognition , 1998, Pattern Recognit. Lett..

[33]  Tetsushi Wakabayashi,et al.  Handwritten Numeral Recognition of Six Popular Indian Scripts , 2007 .

[34]  Bhabatosh Chanda,et al.  Off-line Recognition of Hand-Written Bengali Numerals Using Morphological Features , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[35]  Ying Wen,et al.  A classifier for Bangla handwritten numeral recognition , 2012, Expert Syst. Appl..

[36]  John Daugman How iris recognition works , 2004 .

[37]  Subhadip Basu,et al.  A novel framework for automatic sorting of postal documents with multi-script address blocks , 2010, Pattern Recognit..

[38]  C. Chaudhuri,et al.  A Study on the Effect of Varying Training set Sizes on Recognition Performance with Handwritten Bangla Numerals , 2005, 2005 Annual IEEE India Conference - Indicon.

[39]  Donald D. Hoffman,et al.  Salience of visual parts , 1997, Cognition.

[40]  Abdel Belaïd,et al.  A System for Bangla Handwritten Numeral Recognition , 2002 .

[41]  N. Shanthi,et al.  A novel SVM-based handwritten Tamil character recognition system , 2010, Pattern Analysis and Applications.

[42]  Pengfei Shi,et al.  Handwritten Bangla numeral recognition system and its application to postal automation , 2007, Pattern Recognit..

[43]  Fumitaka Kimura,et al.  Oriya handwritten numeral recognition system , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).