Building cameras for capturing documents

Abstract.This paper explores those aspects of document capture that are specific to cameras. Each of them must be addressed in order to close the gap between taking a photograph of a document and capturing the document itself. We present results in five areas: (1) framing documents using structured light, (2) robustly dealing with ambient illumination when capturing glossy documents, (3) improving text quality when using mosaiced color sensors, (4) robustly and passively recovering perspective and image plane skew using text flow, and (5) measuring and undoing page curl using structured light and an applicable surface model. The ultimate success of subsequent document recognition will be heavily dependent on the successful completion of these tasks.

[1]  R. Haber,et al.  Visual Perception , 2018, Encyclopedia of Database Systems.

[2]  Majid Mirmehdi,et al.  Rectifying perspective views of text in 3D scenes using vanishing points , 2003, Pattern Recognit..

[3]  Hiroshi Ishii,et al.  Iterative design of seamless collaboration media , 1994, CACM.

[4]  Maurizio Pilu,et al.  Framing Aids to Support Document Capture Using Digital Cameras: A User Study , 1999 .

[5]  Kenton O'Hara,et al.  A diary study of information capture in working life , 2000, CHI.

[6]  J.K. Aggarwal,et al.  An overview of geometric modeling using active sensing , 1988, IEEE Control Systems Magazine.

[7]  Maurizio Pilu,et al.  Undoing page curl distortion using applicable surfaces , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[8]  M. Carter Computer graphics: Principles and practice , 1997 .

[9]  Alan L. Yuille,et al.  Manhattan World: compass direction from a single image by Bayesian inference , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[11]  V. Rovenski,et al.  Differential Geometry of Curves and Surfaces , 1952, Nature.

[12]  Stefano Messelodi,et al.  Automatic identification and skew estimation of text lines in real scene images , 1999, Pattern Recognition.

[13]  Manfredo P. do Carmo,et al.  Differential geometry of curves and surfaces , 1976 .

[14]  Hong Lin,et al.  Optimal Texture Mapping , 1988, Eurographics.

[15]  Victor A. Soifer,et al.  Laser beam mode selection by computer generated holograms , 1994 .

[16]  Jr. James E. Adams,et al.  Design of practical color filter array interpolation algorithms for digital cameras , 1997, Electronic Imaging.

[17]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[18]  R. Barrett Document imaging systems in the USA. I , 1988 .

[19]  Maurizio Pilu,et al.  A light-weight text image processing method for handheld embedded cameras , 2002, BMVC.

[20]  Yasuaki Nakano,et al.  An algorithm for the skew normalization of document image , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[21]  Maurizio Pilu Undoing paper curl distortion using applicable surfaces , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[22]  V. Leitáo,et al.  Computer Graphics: Principles and Practice , 1995 .

[23]  Maurizio Pilu,et al.  Extraction of illusory linear clues in perspectively skewed documents , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[24]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[25]  Robert C. Bolles,et al.  A RANSAC-Based Approach to Model Fitting and Its Application to Finding Cylinders in Range Data , 1981, IJCAI.

[26]  William M. Newman,et al.  Documents through cameras , 1999, Image Vis. Comput..

[27]  Jong-Soo Choi,et al.  Obtaining a 3-D orientation of projective textures using a morphological method , 1996, Pattern Recognit..

[28]  Richard L. Grimsdale,et al.  Computer graphics techniques for modeling cloth , 1996, IEEE Computer Graphics and Applications.

[29]  Majid Mirmehdi,et al.  Location and recovery of text on oriented surfaces , 1999, Electronic Imaging.

[30]  R.M. Haralick Monocular vision using inverse perspective projection geometry: analytic relations , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  Richard I. Hartley,et al.  Theory and Practice of Projective Rectification , 1999, International Journal of Computer Vision.

[32]  H. Piaggio Differential Geometry of Curves and Surfaces , 1952, Nature.