Ligature Modeling for Online Cursive Script Recognition

Online recognition of cursive words is a difficult task owing to variable shape and ambiguous letter boundaries. The approach proposed is based on hidden Markov modeling of letters and inter-letter patterns called ligatures occurring in cursive script. For each of the letters and the ligatures we create one HMM that models temporal and spatial variability of handwriting. By networking the two kinds of HMMs, we can design a network model for all words or composite characters. The network incorporates the knowledge sources of grammatical and structural constraints so that it can better capture the characteristics of handwriting. Given the network, the problem of recognition is formulated into that of finding the most likely path from the start node to the end node. A dynamic programming-based search for the optimal input-network alignment performs character recognition and letter segmentation simultaneously and efficiently. Experiments on Korean character showed correct recognition of up to 93.3% on unconstrained samples. It has also been compared with several other schemes of HMM-based recognition to characterize the proposed approach.

[1]  Kay-Fu Lee,et al.  Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition , 1990, IEEE Trans. Acoust. Speech Signal Process..

[2]  Peter F. Brown,et al.  The acoustic-modeling problem in automatic speech recognition , 1987 .

[3]  Philip A. Chou,et al.  Document Image Decoding Using Markov Source Models , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Oscar E. Agazzi,et al.  Machine vision for keyword spotting using pseudo 2D hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Charles C. Tappert,et al.  Cursive Script Recognition by Elastic Matching , 1982, IBM J. Res. Dev..

[6]  Anne-Marie Derouault,et al.  Context-dependent phonetic Markov models for large vocabulary speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Kai-Fu Lee,et al.  Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition , 1990 .

[8]  Jin Hyung Kim,et al.  A statistical approach with HMMs for on-line cursive Hangul (Korean script) recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[9]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[10]  Ching Y. Suen,et al.  Extraction of key letters for cursive script recognition , 1993, Pattern Recognit. Lett..

[11]  Aaron E. Rosenberg,et al.  On the use of inter-word context-dependent units for word juncture modeling , 1992 .

[12]  김진형,et al.  On-line Cursive Script Recognition by a Letter Spotting Technique Based on HMMs , 1994 .

[13]  Richard Rose,et al.  A hidden Markov model based keyword recognition system , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[14]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[15]  Dan S. Bloomberg,et al.  Word spotting in scanned images using hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  M. Bush,et al.  Network-based connected digit recognition , 1987, IEEE Trans. Acoust. Speech Signal Process..

[17]  Isabelle Guyon,et al.  Recognition-based segmentation of on-line run-on handprinted words: Input vs. output segmentation , 1994, Pattern Recognit..

[18]  Jerome R. Bellegarda,et al.  Tied mixture continuous parameter modeling for speech recognition , 1990, IEEE Trans. Acoust. Speech Signal Process..

[19]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[20]  Aaron E. Rosenberg,et al.  Word juncture modeling using phonological rules for HMM-based continuous speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[21]  Hsiao-Wuen Hon,et al.  Speaker-independent phone recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[22]  Jean R. Ward,et al.  A model for variability effects in handprinting with implications for the design of handwriting character recognition systems , 1988, IEEE Trans. Syst. Man Cybern..

[23]  Baruch Mazor,et al.  Continuous word spotting for applications in telecommunications , 1992, ICSLP.

[24]  Chin-Hui Lee,et al.  Automatic recognition of keywords in unconstrained speech using hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[25]  John Makhoul,et al.  Context-dependent modeling for acoustic-phonetic recognition of continuous speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[26]  신봉기,et al.  (An) HMM-based statistical framework for modeling on-line cursive script = 온라인 필기 모형화를 위한 은닉 마르코프 모형 기반의 통계적 방법론 , 1995 .

[27]  Jin-Young Ha,et al.  Unconstrained handwritten word recognition with interconnected hidden markov models = 상호 연결된 은닉 마르코프 모델을 이용한 무제약 필기 단어 인식 , 1994 .