Robust stereo correspondence for documents by matching connected components of text-lines with dynamic programming

In this paper we present a novel method for robust stereo matching on document image pairs. The matching itself is performed using an affine-invariant similarity measurement to compensate for perspective distortions, where affine invariance is achieved by normalization using second-order statistics, to finally allow a simple pixel-wise comparison. To handle the inherent high self-similarity of the page content we apply a dynamic programming approach on text-line pairs. We quantitatively show that the proposed method performs better in comparison to standard approaches using SURF - whether with or without incorporating text-line information.

[1]  Syed Saqib Bukhari,et al.  Performance evaluation of curled textline segmentation algorithms on CBDAR 2007 dewarping contest dataset , 2010, 2010 IEEE International Conference on Image Processing.

[2]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[3]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[4]  Hermann Ney,et al.  Local context in non-linear deformation models for handwritten character recognition , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[5]  Syed Saqib Bukhari,et al.  Improvements to Uncalibrated Feature-Based Stereo Matching for Document Images by Using Text-Line Segmentation , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[6]  Takeo Kanade,et al.  Stereo by Intra- and Inter-Scanline Search Using Dynamic Programming , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  T. Tuytelaars,et al.  Speeded-UpRobustFeatures(SURF) , 2008 .

[8]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[9]  Thabo Beeler,et al.  High-quality single-shot capture of facial geometry , 2010, ACM Trans. Graph..

[10]  Syed Saqib Bukhari,et al.  Text-Line Extraction Using a Convolution of Isotropic Gaussian Filter with a Set of Line Filters , 2011, 2011 International Conference on Document Analysis and Recognition.

[11]  Atsushi Yamashita,et al.  Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[12]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[13]  Christoph H. Lampert,et al.  Document capture using stereo vision , 2004, DocEng '04.

[14]  Syed Saqib Bukhari,et al.  Performance Evaluation of Curled Textlines Segmentation Algorithms , 2010 .