Exploiting Vector Fields for Geometric Rectification of Distorted Document Images

This paper proposes a segment-free method for geometric rectification of a distorted document image captured by a hand-held camera. The method can recover the 3D page shape by exploiting the intrinsic vector fields of the image. Based on the assumption that the curled page shape is a general cylindrical surface, we estimate the parameters related to the camera and the 3D shape model through weighted majority voting on the vector fields. Then the spatial directrix of the surface is recovered by solving an ordinary differential equation (ODE) through the Euler method. Finally, the geometric distortions in images can be rectified by flattening the estimated 3D page surface onto a plane. Our method can exploit diverse types of visual cues available in a distorted document image to estimate its vector fields for 3D page shape recovery. In comparison to the state-of-the-art methods, the great advantage is that it is a segment-free method and does not have to extract curved text lines or textual blocks, which is still a very challenging problem especially for a distorted document image. Our method can therefore be freely applied to document images with extremely complicated page layouts and severe image quality degradation. Extensive experiments are implemented to demonstrate the effectiveness of the proposed method.

[1]  Michael S. Brown,et al.  Geometric and shading correction for images of printed materials using boundary , 2006, IEEE Transactions on Image Processing.

[2]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[3]  L. M. Mestetskiy,et al.  Usage of continuous skeletal image representation for document images de-warping , 2007 .

[4]  Gaofeng Meng,et al.  Active Flattening of Curved Document Images via Two Structured Beams , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Ioannis Pratikakis,et al.  Segmentation Based Recovery of Arbitrarily Warped Document Images , 2007 .

[6]  Michael S. Brown,et al.  Geometric and shading correction for images of printed materials: a unified approach using boundary , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[7]  Gaofeng Meng,et al.  Metric Rectification of Curved Document Images , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Chew Lim Tan,et al.  Straightening warped text lines using polynomial regression , 2002, Proceedings. International Conference on Image Processing.

[9]  Syed Saqib Bukhari,et al.  Dewarping of Document Images using Coupled-Snakes , 2009 .

[10]  Raúl Rojas,et al.  Robust Document Warping with Interpolated Vector Fields , 2007 .

[11]  Yuandong Tian,et al.  Rectification and 3D reconstruction of curved document images , 2011, CVPR 2011.

[12]  Katsushi Ikeuchi,et al.  Multiview Rectification of Folded Documents , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Ioannis Pratikakis,et al.  Goal-Oriented Rectification of Camera-Based Document Images , 2011, IEEE Transactions on Image Processing.

[14]  Nam Ik Cho,et al.  Document dewarping via text-line based optimization , 2015, Pattern Recognit..

[15]  David S. Doermann,et al.  Flattening curved documents in images , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Wenxin Li,et al.  A Model-based Book Dewarping Method Using Text Line Detection , 2007 .

[17]  Mila Nikolova,et al.  Analysis of Half-Quadratic Minimization Methods for Signal and Image Recovery , 2005, SIAM J. Sci. Comput..

[18]  Changsong Liu,et al.  A cylindrical surface model to rectify the bound document image , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[19]  Christoph H. Lampert,et al.  Document image dewarping using robust estimation of curled text lines , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[20]  Nanning Zheng,et al.  Active Rectification of Curved Document Images Using Structured Beams , 2016, International Journal of Computer Vision.

[21]  Nam Ik Cho,et al.  Composition of a Dewarped and Enhanced Document Image From Two View Images , 2009, IEEE Transactions on Image Processing.

[22]  Faisal Shafait Document Image Dewarping Contest , 2007 .

[23]  Chew Lim Tan,et al.  Correcting document image warping based on regression of curved text lines , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[24]  W. B. Seales,et al.  Restoring 2D Content from Distorted Documents , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Gaofeng Meng,et al.  Extraction of Virtual Baselines from Distorted Document Images Using Curvilinear Projection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[26]  Michael S. Brown,et al.  A unified framework for document restoration using inpainting and shape-from-shading , 2009, Pattern Recognit..

[27]  George Nagy,et al.  Twenty Years of Document Image Analysis in PAMI , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  David S. Doermann,et al.  Geometric Rectification of Camera-Captured Document Images , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Chew Lim Tan,et al.  Restoring Warped Document Images through 3D Shape Modeling , 2006, IEEE Trans. Pattern Anal. Mach. Intell..