论文信息 - Real-time embedded skew detection and frame removal

Real-time embedded skew detection and frame removal

It is common to observe document skew and frame artifacts while photocopying and scanning documents. The motivation of this work is to embed skew correction and frame removal in the copy pipeline of a device to achieve ‘one touch’ cleanup. The two challenges that this poses are the need for: (a) substantially reducing computation and memory requirements and (b) minimizing the false positives. Peripheral document features, such as, page/content edges are low-complexity document skew predictors, and content-based approaches are of relatively higher complexity skew predictors. But state-of-the-art page edge detection methods fail on low-contrast document images, or for similar scanbed/document background. To minimize false positives required in embedded implementations, we propose: (1) a robust page edge detection algorithm that is a multiplicative combination of gradients and line based page edge detectors, (2) a robust skew detection algorithm that is a linear combination of page/content edge and content based predictors, and (3) a pipeline for skew correction and frame removal that uses these algorithms and has near-100% accuracy over a wide range of document images.

[1] Ioannis Pratikakis,et al. A segmentation-free approach for keyword search in historical typewritten documents , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[2] Alan W. Paeth,et al. A fast algorithm for general raster rotation , 1986 .

[3] Rafael Dueire Lins,et al. BigBatch: a document processing platform for clusters and grids , 2008, SAC '08.

[4] Jonathan J. Hull. Document Image skew Detection: Survey and Annotated Bibliography , 1996, DAS.

[5] Prasenjit Dey,et al. e-PCP: A robust skew detection method for scanned document images , 2010, Pattern Recognit..

[6] Dalong Li,et al. Shape Retrieval Based on Distance Ratio Distribution , 2002 .

[7] Anders Krogh,et al. A General Method for Combining Predictors Tested on Protein Secondary Structure Prediction , 2000, ANNIMAB.

[8] Serene Banerjee,et al. Real-time optimal-memory image rotation for embedded systems , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[9] Hua Jin,et al. The Optimal Linear Combination of Multiple Predictors Under the Generalized Linear Models. , 2009, Statistics & probability letters.