Document Flattening through Grid Modeling and Regularization

For document images captured by a digital camera, perspective and geometric distortions make it hard to recognize the document content properly. In this paper, we propose an integrated document restoration technique, which is capable of removing perspective and geometric distortions, and producing a flattened and fronto-parallel text image that is friendly to the generic OCR systems. The proposed document restoration is accomplished through grid modeling, which divides camera images into multiple quadrilateral grids using vertical text directions and the x lines and base lines. The global distortions are then removed through grid regularization that transforms the quadrilateral grids together with the pixel contents to the regular square grids. Experimental results show the proposed method is fast and easy for implementation

[1]  Michael S. Brown,et al.  Geometric and shading correction for images of printed materials: a unified approach using boundary , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[2]  Shijian Lu,et al.  Perspective rectification of document images using fuzzy set and morphological operations , 2005, Image Vis. Comput..

[3]  David S. Doermann,et al.  Flattening curved documents in images , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[5]  Christoph H. Lampert,et al.  Document image dewarping using robust estimation of curled text lines , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[6]  W. Brent Seales,et al.  Image restoration of arbitrarily warped documents , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Gady Agam,et al.  Document Image De-warping for Text/Graphics Recognition , 2002, SSPR/SPR.

[8]  A.W.M. Smeulders,et al.  An introduction to image processing , 1991 .

[9]  Majid Mirmehdi,et al.  Recognising text in real scenes , 2002, International Journal on Document Analysis and Recognition.

[10]  Roberto Cipolla,et al.  Using frontier points to recover shape, reflectance and illumination , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[11]  Changsong Liu,et al.  A cylindrical surface model to rectify the bound document image , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  André Marion,et al.  Introduction to Image Processing , 1990, Springer US.

[13]  Maurizio Pilu Undoing paper curl distortion using applicable surfaces , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.