Location and recovery of text on oriented surfaces

We present a method for extracting text from images where the text plane is not necessarily fronto-parallel to the camera. Initially, we locate local image features such as borders and page edges. We then use perceptual grouping on these features to find rectangular regions in the scene. These regions are hypothesized to be pages or planes that may contain text. Edge distributions are then used for the assessment of these potential regions, providing a measure of confidence. It will be shown that the text may then be transformed to a fronto- parallel view suitable, for example, for an OCR system or other higher level recognition. The proposed method is scale independent (of the size of the text). We illustrate the algorithm using various examples.

[1]  Josef Kittler,et al.  An Optimizing Line Finder Using a Hough Transform Algorithm , 1997, Comput. Vis. Image Underst..

[2]  Gian Luca Foresti,et al.  2D into 3D Hough-space mapping for planar object pose estimation , 1997, Image Vis. Comput..

[3]  Tieniu Tan From image quadrilaterals to symmetrical trapezia , 1998, Pattern Recognit..

[4]  Massimo Bertozzi,et al.  Stereo inverse perspective mapping: theory and applications , 1998, Image Vis. Comput..

[5]  Luc Van Gool,et al.  The Characterization and Detection of Skewed Symmetry , 1995, Comput. Vis. Image Underst..

[6]  Andrew Zisserman,et al.  Geometric Grouping of Repeated Elements within Images , 1998, BMVC.

[7]  Shu-Yuan Chen,et al.  Adaptive page segmentation for color technical journals' cover images , 1998, Image Vis. Comput..

[8]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[9]  Josef Kittler,et al.  Optimal Edge Detectors for Ramp Edges , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[11]  Edward M. Riseman,et al.  Finding text in images , 1997, DL '97.

[12]  Robert Wilensky,et al.  Multivalent Documents: A New Model for Digital Documents , 1998 .

[13]  Chew Lim Tan,et al.  Text extraction using pyramid , 1998, Pattern Recognit..

[14]  Robert M. Gray,et al.  Text and picture segmentation by the distribution analysis of wavelet coefficients , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[15]  Andrew Zisserman,et al.  Geometric Grouping of Repeated Elements within Images , 1999, Shape, Contour and Grouping in Computer Vision.