Analysis of Compressed Document Images for Dominant Skew, Multiple Skew, and Logotype Detection
暂无分享,去创建一个
Among the most commonly used compression algorithms for document images are those defined by the Consultative Committee for International Telephone and Telegraph (CCITT). CCITT Group III compression is used in all facsimile transmission by modem over analog telephone lines. CCITT Group IV is used in digital transmission and storage of document images. Sufficient readily interpretable spatial information exists in these compressed document images to enable their characterization. In particular, it is possible to locate the positions of the bottoms of both black and white structures. Using the bottoms of black structures we can determine the peak strength of their alignment in order to determine the dominant skew angle of the image. This method can be expanded, by finding minor peaks, to identify multiple skew angles in single images. The angular distributions of the peak alignments of both white and black structures are assembled to form an alignment signature. Logotypes can be designed which generate distinct alignment signatures that are detectable in the compressed representation.
[1] R. Hunter,et al. International digital facsimile coding standards , 1980, Proceedings of the IEEE.
[2] Andrew D. Bagdanov,et al. Evaluation of document image skew estimation techniques , 1996, Electronic Imaging.
[3] Henry S. Baird,et al. The skew angle of printed documents , 1995 .
[4] J. C. Stoffel. Data Compression Ratios Versus Sample Resolution , 1980, Optics & Photonics.