Persian cursive script recognition

The main objective of this paper is to design a Persian text recognition system. As even typed Persian scripts are cursive, our system includes a segmentation stage in order to separate the constituent characters. This stage is also useful for highly declined or italic Latin texts. A new segmentation algorithm with two consecutive steps is introduced in this paper. The first step separates isolated and non overlapped characters as well as some overlapped ones. The second step segments not connected overlapped characters. The novel segmentation method has been tested on some real world script and has shown an accuracy rate of more than 99.7%. In the recognition stage which involves a statistical approach, two types of feature sets along with different classification methods are evaluated.

[1]  Talaat S. El-Sheikh,et al.  Computer recognition of arabic cursive scripts , 1988, Pattern Recognit..

[2]  Sherif Sami El-Dabi,et al.  Arabic character recognition system: A statistical approach for recognizing cursive typewritten text , 1990, Pattern Recognit..

[3]  Ghassan A. Kubba The Impact of Computers on Arabic Writing, Character Processing and teaching , 1980, IFIP Congress.

[4]  Nobuyasu Itoh,et al.  A spelling correction method and its application to an OCR system , 1990, Pattern Recognit..

[5]  Hussein Almuallim,et al.  A Method of Recognition of Arabic Cursive Handwriting , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Behrooz Parhami,et al.  Automatic recognition of printed Farsi texts , 1981, Pattern Recognit..

[7]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[8]  SATOSI WATANABE,et al.  Ungrammatical grammar in pattern recognition , 1971, Pattern Recognit..

[9]  S. S. Upda,et al.  Recognition of Arabic Characters , 1992, IEEE Trans. Pattern Anal. Mach. Intell..