Segment confidence-based binary segmentation (SCBS) for cursive handwritten words

A novel segment confidence-based binary segmentation (SCBS) for cursive handwritten words is presented in this paper. SCBS is a character segmentation strategy for off-line cursive handwriting recognition. Unlike the approaches in the literature, SCBS is an unordered segmentation approach. SCBS is repetition of binary segmentation and fusion of segment confidence. Each repetition generates only one final segmentation point. The binary segmentation module is a contour tracing algorithm to find a segmentation path to divide a segment into two segments. A set of segments before binary segmentation is called pre-segments, and a set of segments after binary segmentation is called post-segments. SCBS uses over-segmentation technique to generate suspicious segmentation points on pre-segments. On each suspicious segmentation point, binary segmentation is performed and the highest fusion value is recorded. If the highest fusion value is greater than the one of pre-segments, the suspicious segmentation point becomes the final segmentation point for the iteration. If not, no more segmentation is required. Segment confidence is obtained by fusing mean character, lexical and shape confidences. The proposed approach has been evaluated on local and benchmark (CEDAR) databases.

[1]  María José Castro Bleda,et al.  Holistic cursive word recognition based on perceptual features , 2007, Pattern Recognit. Lett..

[2]  Michael S. Brown,et al.  A unified framework for document restoration using inpainting and shape-from-shading , 2009, Pattern Recognit..

[3]  Brijesh Verma,et al.  A novel multiple experts and fusion based segmentation algorithm for cursive handwriting recognition , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[4]  Umapada Pal,et al.  Handwriting segmentation of unconstrained Oriya text , 2006 .

[5]  Shuyan Zhao,et al.  Two-stage segmentation of unconstrained handwritten Chinese character , 2003, Pattern Recognit..

[6]  Nafiz Arica,et al.  An overview of character recognition focused on off-line handwriting , 2001, IEEE Trans. Syst. Man Cybern. Syst..

[7]  Berrin A. Yanikoglu,et al.  Segmentation of off-line cursive handwriting using linear programming , 1998, Pattern Recognit..

[8]  Alessandro Vinciarelli,et al.  A survey on off-line Cursive Word Recognition , 2002, Pattern Recognit..

[9]  Ching Y. Suen,et al.  A genetic framework using contextual knowledge for segmentation and recognition of handwritten numeral strings , 2007, Pattern Recognit..

[10]  Fatos T. Yarman-Vural,et al.  Optical Character Recognition for Cursive Handwriting , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Ashraf Elnagar,et al.  Segmentation of connected handwritten numeral strings , 2003, Pattern Recognit..

[12]  Venu Govindaraju,et al.  Holistic recognition of handwritten character pairs , 2000, Pattern Recognit..

[13]  Pengfei Shi,et al.  A metasynthetic approach for segmenting handwritten Chinese character strings , 2005, Pattern Recognit. Lett..

[14]  Cheng-Lin Liu,et al.  Handwritten digit recognition: benchmarking of state-of-the-art techniques , 2003, Pattern Recognit..

[15]  Luiz Eduardo Soares de Oliveira,et al.  Filtering segmentation cuts for digit string recognition , 2008, Pattern Recognit..

[16]  Jin Hyung Kim,et al.  Complementary combination of holistic and component analysis for recognition of low-resolution video character images , 2008, Pattern Recognit. Lett..

[17]  R Plamondon,et al.  Studying the variability of handwriting patterns using the Kinematic Theory. , 2009, Human movement science.

[18]  Jung-Hsien Chiang,et al.  A hybrid neural network model in handwritten word recognition , 1998, Neural Networks.

[19]  Mokhtar Sellami,et al.  Semi-continuous HMMs with explicit state duration for unconstrained Arabic word modeling and recognition , 2008, Pattern Recognit. Lett..

[20]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Alexander V. Shafarenko,et al.  Word segmentation of handwritten text using supervised classification techniques , 2007, Appl. Soft Comput..

[22]  Umapada Pal,et al.  Touching numeral segmentation using water reservoir concept , 2003, Pattern Recognit. Lett..

[23]  Pengfei Shi,et al.  A background-thinning-based approach for separating and recognizing connected handwritten digit strings , 1999, Pattern Recognit..

[24]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  S. Arumugam,et al.  Fuzzy technique based recognition of handwritten characters , 2003, Image Vis. Comput..

[26]  Zaher Al Aghbari,et al.  HAH manuscripts: A holistic paradigm for classifying and retrieving historical Arabic handwritten documents , 2009, Expert Syst. Appl..

[27]  Ashraf Elnagar,et al.  Multiagents to separating handwritten connected digits , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[28]  Brijesh Verma,et al.  Analysis of segmentation performance on the CEDAR benchmark database , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[29]  Shijian Lu,et al.  Retrieval of machine-printed Latin documents through Word Shape Coding , 2008, Pattern Recognit..

[30]  Hiroshi Kawakami,et al.  Morphological preprocessing method to thresholding degraded word images , 2009, Pattern Recognit. Lett..

[31]  Horst Bunke,et al.  HMM-based handwritten word recognition: on the optimization of the number of states, training iterations and Gaussian components , 2004, Pattern Recognit..

[32]  Hiromichi Fujisawa,et al.  Forty years of research in character and document recognition - an industrial perspective , 2008, Pattern Recognit..

[33]  Brijesh Verma A contour code feature based segmentation for handwriting recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[34]  Patrick Shen-Pei Wang,et al.  A Knowledge Based Segmentation Algorithm for Enhanced Recognition of Handwritten Courtesy Amounts , 2022 .

[35]  Husni Al-Muhtaseb,et al.  Recognition of off-line printed Arabic text using Hidden Markov Models , 2008, Signal Process..

[36]  Tianwen Zhang,et al.  Off-line recognition of realistic Chinese handwriting using segmentation-free strategy , 2009, Pattern Recognit..

[37]  Graham Leedham,et al.  Knowledge-based English cursive script segmentation , 2000, Pattern Recognit. Lett..

[38]  Misako Suwa Segmentation of connected handwritten numerals by graph representation , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[39]  C. Renaudin,et al.  A General Method of Segmentation-Recognition Collaboration Applied to Pairs of Touching and Overlapping Symbols , 2007 .

[40]  Rohini K. Srihari,et al.  Automatic scoring of short handwritten essays in reading comprehension tests , 2008, Artif. Intell..

[41]  Its'hak Dinstein,et al.  Adaptive shape prior for recognition and variational segmentation of degraded historical characters , 2009, Pattern Recognit..

[42]  Ching Y. Suen,et al.  Automatic segmentation and recognition system for handwritten dates on Canadian bank cheques , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[43]  Sameh M. Awaidah,et al.  A multiple feature/resolution scheme to Arabic (Indian) numerals recognition using hidden Markov models , 2009, Signal Process..

[44]  M. Taylan Das,et al.  Signature verification (SV) toolbox: Application of PSO-NN , 2009, Eng. Appl. Artif. Intell..

[45]  Amer Dawoud,et al.  Iterative Cross Section Sequence Graph for Handwritten Character Segmentation , 2007, IEEE Transactions on Image Processing.

[46]  Paul D. Gader,et al.  Fusion of multiple handwritten word recognition techniques , 2001, Pattern Recognit. Lett..

[47]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Pengfei Shi,et al.  Segmentation of Connected Handwritten Chinese Characters Based on Stroke Analysis and Background Thinning , 2000, PRICAI.