论文信息 - Multi-font recognition of printed Arabic using the BBN BYBLOS speech recognition system

Multi-font recognition of printed Arabic using the BBN BYBLOS speech recognition system

We use a hidden Markov model (HMM) based continuous speech recognition system to perform off-line character recognition (OCR) of Arabic printed text. The HMM trainer and recognizer are used without change, however we modify the feature extraction stage to compute features relevant to OCR. Although we begin by segmenting the page into a collection of lines, no further segmentation is necessary for either recognition or training. Experiments on the ARPA Arabic data corpus yield a range of character error rates from under one percent for a single computer font to 2.8% for multiple-font recognition of a wide range of material from books, magazines and newspapers.

Christopher Raphael | Ying Zhao | Richard M. Schwartz | John Makhoul | Christopher LaPre

[1] Richard M. Schwartz,et al. Improved hidden Markov modeling of phonemes for continuous speech recognition , 1984, ICASSP.

[2] Ching Y. Suen,et al. Historical review of OCR research and development , 1992, Proc. IEEE.

[3] Qi Tian,et al. Survey: omnifont-printed character recognition , 1991, Other Conferences.

[4] Richard M. Schwartz,et al. On-line cursive handwriting recognition using speech recognition methods , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[5] J Makhoul,et al. State of the art in continuous speech recognition. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[6] Jerome R. Bellegarda,et al. Tied mixture continuous parameter models for large vocabulary isolated speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[7] J. Makhoul,et al. Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[8] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.