Design of an Embedded Arabic Optical Character Recognition

This work presents an embedded Arabic OCR system. The proposed system is compact and portable which make it useful for many applications such as blind assistance and language translation. OCR system consists of the sub-systems: image acquisition, pre-processing, segmentation, feature extraction, classification, and post- processing. For each sub-system there are several of algorithms and techniques to be implemented. Working with PCs gives the designer freedom to select the algorithms and techniques according to the required performance, reliability and reusability. However with the embedded systems we are facing many problems and challenges. Such challenges are associated with memory, speed, and computational power. FPGA is selected as the hardware platform for realizing that recognition task. An OCR system is designed and implemented on PC. Then this system is transferred to FPGA after a set of optimization procedures. Utilizing the features of FPGA technology, Hardware / Software co-design is accomplished on an FPGA board. In that design the systems is partitioned into software modules and hardware components to get the advantages of software flexibility and hardware speed. A database of 3000 Arabic characters is used to train and test the performance of the system. The effects of changing the number of features and classification parameters on accuracy, memory and speed are measured. Design points are selected in order to improve the memory required, speed and computation power without affecting the accuracy.

[1]  Yasuhiro Okada,et al.  Structural character recognition using simulated annealing , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[2]  E. Lecolinet,et al.  Strategies in character segmentation: a survey , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[3]  Neil W. Bergmann,et al.  An Arabic optical character recognition system using recognition-based segmentation , 2001, Pattern Recognit..

[4]  Adnan Amin,et al.  Off line Arabic character recognition: a survey , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[5]  Steven Pigeon,et al.  VIP: an FPGA-based processor for image processing and neural networks , 1996, Proceedings of Fifth International Conference on Microelectronics for Neural Networks.

[6]  Nafiz Arica,et al.  An overview of character recognition focused on off-line handwriting , 2001, IEEE Trans. Syst. Man Cybern. Syst..

[7]  Nikos Fakotakis,et al.  Handwritten word recognition based on structural characteristics and lexical support , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[8]  Jamshid Shanbehzadeh,et al.  Fast Zernike wavelet moments for Farsi character recognition , 2007, Image Vis. Comput..

[9]  Hiroyuki Hase,et al.  A new robust quadratic discriminant function , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[10]  Venu Govindaraju,et al.  Offline Arabic handwriting recognition: a survey , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  R. K. Agrawal,et al.  Discriminant Function Revisited for Incremental Learning , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[12]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Adnan Amin,et al.  Recognition of printed arabic text based on global features and decision tree learning techniques , 2000, Pattern Recognit..

[14]  Teng Long,et al.  Building Compact MQDF Classifier for Off-line Handwritten Chinese Characters by Subspace Distribution Sharing , 2007 .

[15]  Leonardo Maria Reyneri A Simulink-based hybrid codesign tool for rapid prototyping of FPGA's in signal processing systems , 2004, Microprocess. Microsystems.

[16]  Richard E. Haskell,et al.  FPGA integrated co-design , 2001, Proceedings 2001 International Conference on Microelectronic Systems Education.

[17]  Lawrence D. Jackel,et al.  Application of the ANNA neural network chip to high-speed character recognition , 1992, IEEE Trans. Neural Networks.

[18]  Ernest Valveny,et al.  Learning of structural descriptions of graphic symbols using deformable template matching , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[19]  Toshio Kondo,et al.  Kanji Character Recognition Unit With Hand-Scanner Using SIMD Processor , 1988, Other Conferences.