Simple sequentially designed rule-based alphanumerics recognition algorithm for OCR document processing using a thinning process

A simple method to recognize the printed alphanumerics is discussed. The proposed method is a simple rule-based structural method to recognize printed alphanumerics of image scanner data based on the thinning operation. This paper also presents major achievement made toward the development of a fast hierarchical recognition scheme for the printed and handwritten facsimile data. The conventional thinning techniques give good results for high-resolution image scanner data, but they suffer drawbacks for low-resolution data. Our scheme recognizes 55 characters per second on the IBM PC/386 environment and the recognition rate is 98%.

[1]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Bharat K. Bhargava,et al.  Tree Systems for Syntactic Pattern Recognition , 1973, IEEE Transactions on Computers.