A new methodology for gray-scale character segmentation and recognition

Generally speaking, through the binarization of gray-scale images, useful information for the segmentation of touching or overlapping characters may be lost. If we analyze gray-scale images, however, specific topographic features and the variation of intensity can be observed in the character boundaries. We believe that such kinds of clues obtained from gray-scale images should be useful for efficient character segmentation. In this paper, we propose a new methodology for character segmentation and recognition which makes the best use of the characteristics of gray-scale images. In the proposed methodology, the character segmentation regions are determined by using projection profiles and topographic features extracted form gray-scale images. Then the nonlinear character segmentation path in each character segmentation region is found by using multistage graph search algorithm. Finally, in order to confirm the character segmentation paths and recognition results, recognition based segmentation method is adopted.

[1]  Majid Ahmadi,et al.  Segmentation of touching characters in printed document recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[2]  Yi Lu,et al.  Machine printed character segmentation --; An overview , 1995, Pattern Recognit..

[3]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[4]  Theodosios Pavlidis,et al.  On the Recognition of Printed Characters of Any Font and Size , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Theodosios Pavlidis,et al.  Direct Gray-Scale Extraction of Features for Character Recognition , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Ulrich Kressel,et al.  Cut classification for segmentation , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[7]  Jin Wang,et al.  Segmentation of merged characters by neural networks and shortest-path , 1993, SAC '93.

[8]  Ellis Horowitz,et al.  Fundamentals of Computer Algorithms , 1978 .

[9]  Young-Joon Kim,et al.  Direct Extraction of Topographic Features for Gray Scale Character Recognition , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Theodosios Pavlidis,et al.  A solution to the problem of touching and broken characters , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).