Handwritten character recognition through two-stage foreground sub-sampling

In this paper, we present a methodology for off-line handwritten character recognition. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the character image so that the resulting sub-images at each iteration have balanced (approximately equal) numbers of foreground pixels, as far as this is possible. Feature extraction is followed by a two-stage classification scheme based on the level of granularity of the feature extraction method. Classes with high values in the confusion matrix are merged at a certain level and for each group of merged classes, granularity features from the level that best distinguishes them are employed. Two handwritten character databases (CEDAR and CIL) as well as two handwritten digit databases (MNIST and CEDAR) were used in order to demonstrate the effectiveness of the proposed technique. The recognition result achieved, in comparison to the ones reported in the literature, is the highest for the well-known CEDAR Character Database (94.73%) and among the best for the MNIST Database (99.03%)

[1]  Venu Govindaraju,et al.  OCR in a Hierarchical Feature Space , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Hanan Samet,et al.  The Design and Analysis of Spatial Data Structures , 1989 .

[3]  Jung-Hsien Chiang,et al.  Handwritten word recognition with character and inter-character neural networks , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[4]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[5]  Flávio Bortolozzi,et al.  Foreground and background information in an HMM-based method for recognition of isolated characters and numeral strings , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[6]  I. Pratikakis,et al.  Greek Handwritten Character Recognition , 2007 .

[7]  Jianchang Mao,et al.  A comparative study of different classifiers for handprinted character recognition , 1994 .

[8]  Hiroshi Yamada,et al.  Cursive handwritten word recognition using multiple segmentation determined by contour analysis , 1996 .

[9]  Louis Vuurpijl,et al.  Architectures for detecting and solving conflicts: two-stage classification and support vector classifiers , 2003, Document Analysis and Recognition.

[10]  Cheng-Lin Liu,et al.  Classification and Learning for Character Recognition: Comparison of Methods and Remaining Problems , 2005 .

[11]  M. Blumenstein,et al.  A modified direction feature for cursive character recognition , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[12]  Ching Y. Suen,et al.  HYBRID SCHEMES OF HOMOGENEOUS AND HETEROGENEOUS CLASSIFIERS FOR CURSIVE WORD RECOGNITION , 2004 .

[13]  Anandarup Roy,et al.  SVM-based hierarchical architectures for handwritten Bangla character recognition , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[14]  Character representation and recognition using quad tree-based fractal encoding scheme , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[15]  Alessandro L. Koerich Unconstrained handwritten character recognition using different classification strategies , 2003 .

[16]  Sameer Singh,et al.  Cursive digit and character recognition in CEDAR database , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[17]  Nafiz Arica,et al.  An overview of character recognition focused on off-line handwriting , 2001, IEEE Trans. Syst. Man Cybern. Syst..

[18]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[19]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[20]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[21]  Anil K. Jain,et al.  Representation and Recognition of Handwritten Digits Using Deformable Templates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Luiz Eduardo Soares de Oliveira,et al.  Automatic Recognition of Handwritten Numerical Strings: A Recognition and Verification Strategy , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Basilios Gatos,et al.  An Efficient Feature Extraction and Dimensionality Reduction Scheme for Isolated Greek Handwritten Character Recognition , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[24]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .

[25]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Francesco Camastra,et al.  Combining neural gas and learning vector quantization for cursive character recognition , 2003, Neurocomputing.

[28]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[30]  Volker Sorge,et al.  Database-Driven Mathematical Character Recognition , 2005, GREC.

[31]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[32]  Y. Miyake,et al.  Machine and human recognition of segmented characters from handwritten words , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[33]  John A. Fitzgerald,et al.  Application of fuzzy logic to online recognition of handwritten symbols , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.