Character Recognition under Severe Perspective Distortion

Perspective deformation is one of the main issues needed to be addressed in real-scene character recognition. An effective recognition approach, which is able to handle severe perspective deformation, is to employ Cross Ratio Spectrum and Dynamic Time Warping techniques. However, this solution suffers from a time complexity of O(n4). In this paper, a clustering based indexing method is proposed to index cross ratio spectra and thus expedite the recognition. Cross ratio spectra of all templates are clustered. A query is compared with the centroid of each cluster instead of spectra of all templates. Our method is 40 times faster than the previous method, and has archived about 15-time speed up while preserving almost the same recognition accuracy in the real scene character recognition experiment.

[1]  Shijian Lu,et al.  Camera Text Recognition based on Perspective Invariants , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[2]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.

[3]  Maurizio Pilu,et al.  Extraction of illusory linear clues in perspectively skewed documents , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4]  Klaus Gollmer,et al.  Detection of distorted pattern using dynamic time warping algorithm and application for supervision , 1995 .

[5]  Chew Lim Tan,et al.  Character Recognition under Severe Perspective Distortion , 2008, 2009 10th International Conference on Document Analysis and Recognition.

[6]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[7]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[8]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, CVPR 2004.

[9]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[10]  Shijian Lu,et al.  Perspective rectification of document images using fuzzy set and morphological operations , 2005, Image Vis. Comput..

[11]  Majid Mirmehdi,et al.  Recognising text in real scenes , 2002, International Journal on Document Analysis and Recognition.

[12]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[13]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[14]  Robert C. Bolles,et al.  Rectification and recognition of text in 3-D scenes , 2004, International Journal of Document Analysis and Recognition (IJDAR).