论文信息 - A robust omnifont open-vocabulary Arabic OCR system using pseudo-2D-HMM

A robust omnifont open-vocabulary Arabic OCR system using pseudo-2D-HMM

Recognizing old documents is highly desirable since the demand for quickly searching millions of archived documents has recently increased. Using Hidden Markov Models (HMMs) has been proven to be a good solution to tackle the main problems of recognizing typewritten Arabic characters. These attempts however achieved a remarkable success for omnifont OCR under very favorable conditions, they didn't achieve the same performance in practical conditions, i.e. noisy documents. In this paper we present an omnifont, large-vocabulary Arabic OCR system using Pseudo Two Dimensional Hidden Markov Model (P2DHMM), which is a generalization of the HMM. P2DHMM offers a more efficient way to model the Arabic characters, such model offer both minimal dependency on the font size/style (omnifont), and high level of robustness against noise. The evaluation results of this system are very promising compared to a baseline HMM system and best OCRs available in the market (Sakhr and NovoDynamics). The recognition accuracy of the P2DHMM classifier is measured against the classic HMM classifier, the average word accuracy rates for P2DHMM and HMM classifiers are 79% and 66% respectively. The overall system accuracy is measured against Sakhr and NovoDynamics OCR systems, the average word accuracy rates for P2DHMM, NovoDynamics, and Sakhr are 74%, 71%, and 61% respectively.

Sherif Abdou | Mohsen Rashwan | Abdullah M. Rashwan | Ahmed Abdel-Hameed | Ahmed Husien Khalil

[1] Mohamed S. El-Mahallawy,et al. Histogram-Based Lines and Words Decomposition for Arabic Omni Font-Written OCR Systems; Enhancements and Evaluation , 2007, CAIP.

[2] Mohamed Attia,et al. Autonomously normalized horizontal differentials as features for HMM-based Omni font-written OCR systems for cursively scripted languages , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[3] Robert M. Gray,et al. An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[4] Tapas Kanungo,et al. OmniPage vs. Sakhr: paired model evaluation of two Arabic OCR products , 1999, Electronic Imaging.

[5] Chng Eng Siong,et al. Automatic Sports Video Genre Classification using Pseudo-2D-HMM , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[6] Mohammad S. Khorsheed,et al. Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK) , 2007, Pattern Recognit. Lett..

[7] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8] Richard M. Schwartz,et al. An Omnifont Open-Vocabulary OCR System for English and Arabic , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[9] M. Castellano,et al. Face Recognition by Observation-Sequence-Based Methods Based on Pseudo 2D HMM and Neural Networks , 2007, 2007 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications.