Unconstrained Farsi handwritten word recognition using fuzzy vector quantization and hidden Markov models

An unconstrained Farsi handwritten word recognition system based on fuzzy vector quantization (FVQ) and hidden Markov model (HMM) for reading city names in postal addresses is presented. Preprocessing techniques including binarization, noise removal, slope correction and baseline estimation are described. Each word image is represented by its contour information. The histogram of chain code slopes of the image strips (frames), scanned from right to left by a sliding window, is used as feature vectors. Fuzzy c-means (FCM) clustering is used for generating a fuzzy codebook. A separate HMM is trained by modified Baum‐Welch algorithm for each city name. A test image is recognized by finding the best match (likelihood) between the image and all of the HMM word models using forward algorithm. Experimental results show the advantages of using FVQ/HMM recognizer engine instead of conventional discrete HMMs. ” 2001 Elsevier Science B.V. All rights reserved.

[1]  D. Guillevic,et al.  HMM-KNN word recognition engine for bank cheque processing , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[2]  Eiichi Tsuboka,et al.  Mathematical considerations and improvement on the fuzzy vector quantization‐based hidden markov model , 1995 .

[3]  Hong Yan,et al.  Algorithm for stroke width compensation of handwritten characters , 1996 .

[4]  S. Srihari,et al.  Variable duration hidden markov model and morphological segmentation for handwritten word recognition , 1995, IEEE Transactions on Image Processing.

[5]  Edward A. Lee,et al.  Fuzzy vector quantazation applied to hidden Markov modeling , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Adnan Amin,et al.  Off-line Arabic character recognition: the state of the art , 1998, Pattern Recognit..

[7]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[8]  Gyeonghwan Kim,et al.  A Lexicon Driven Approach to Handwritten Word Recognition for Real-Time Applications , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[10]  Sargur N. Srihari,et al.  Variable duration hidden Markov model and morphological segmentation for handwritten word recognition , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.