Binarization and Recognition of Degraded Characters Using a Maximum Separability Axis in Color Space and GAT Correlation

This paper proposes a new technique of binarization and recognition of characters in color with a wide variety of image degradations and complex backgrounds. The key ideas are twofold. One is to automatically select one axis in the RGB color space that maximizes the between-class separability by a suitably chosen threshold for segmentation of character and background or binarization. The other is affine-invariant or distortion-tolerant grayscale character recognition using global affine transformation (GAT) correlation that yields the maximum correlation value between input and template images. In experiments, we use a total of 698 test images extracted from the public ICDAR 2003 robust OCR dataset containing a variety of single-character images in natural scenes. In advance, we classify those images into seven groups according to the degree of image degradations and/or background complexity. On the other hand, we only prepare a single-font set of 62 alphanumerics for templates. Experimental results show an average recognition rate of 81.4%, ranging from 94.5% for clear images to 39.3% for seriously distorted images

[1]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[2]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[3]  Kohji Fukunaga,et al.  Introduction to Statistical Pattern Recognition-Second Edition , 1990 .

[4]  Yann LeCun,et al.  Efficient Pattern Recognition Using a New Transformation Distance , 1992, NIPS.

[5]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Atsushi Sato A learning method for definite canonicalization based on minimum classification error , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[7]  Andrea Miene,et al.  Extracting textual inserts from digital videos , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[8]  Akira Tomono,et al.  Affine-Invariant Recognition of Gray-Scale Characters Using Global Affine Transformation Correlation , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Jean-Michel Jolion,et al.  Text localization, enhancement and binarization in multimedia documents , 2002, Object recognition supported by user interaction for service robots.

[10]  David S. Doermann,et al.  Progress in camera-based document image analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[11]  Adnan Amin,et al.  Automatic thresholding of gray-level using multistage approach , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[12]  Simon M. Lucas,et al.  ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..