论文信息 - Machine recognition and correction of printed Arabic text

Machine recognition and correction of printed Arabic text

A method for automatic recognition of a multifont Arabic text entered from a scanner of 300 dpi density is presented. The system is based on two components, one for character recognition and one for word recognition. Character recognition is further divided into three phases: the digitization process, segmentation of words into characters, and identification of characters. The word recognition component is based on the Viterbi algorithm and can handle some identification errors. Character recognition was achieved despite several impeding properties of the Arabic script, especially the connectivity of characters. The processing speed is close to three characters per second with a 90% recognition rate. All algorithms were written in Pascal and run on an IBM PC/AT. >

Adnan Amin | Jean F. Mari | A. Amin | Jean-François Mari

[1] L. D. Harmon,et al. Automatic recognition of print and script , 1972 .

[2] M. Berthod,et al. Automatic recognition of handprinted characters—The state of the art , 1980, Proceedings of the IEEE.

[3] Roy L. Hoffman,et al. Segmentation Methods for Recognition of Machine-Printed Characters , 1971, IBM J. Res. Dev..

[4] Theodosios Pavlidis,et al. On the Recognition of Printed Characters of Any Font and Size , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Patrick Shen-pei Wang,et al. An application of array grammars to clustering analysis for syntactic patterns , 1984, Pattern Recognit..

[6] Herbert Freeman,et al. On the Encoding of Arbitrary Geometric Configurations , 1961, IRE Trans. Electron. Comput..

[7] Jr. G. Forney,et al. The viterbi algorithm , 1973 .