Multiresolution Morphological Approach to Document Image Analysis

An image-based approach to document image analysis is presented. The methods are motivated by a merged view of shape and textural image properties at multiple scales. The principal binary image operations are morphological and multiresolution. The generalized opening is introduced for extraction of both shape and texture from an image. Threshold reduction operations are introduced for performing efficient and controllable shape and texture transformations between resolution levels. Some problems, such as halftone or dark area segmentation, can be in large part solved by a sequence of threshold reductions. Aspects of the approach are illustrated by the problem of identifying italic and bold words in text, using word-level extraction at lowered resolution. The computational costs of the basic operations are given, so that algorithm efficienc y can be estimated, and the importance of operating at the lowest feasable resolution is demonstrated. For example, word segmentation and halftone extraction proceed in excess of 1.5x10 image pixels/second on a Sun Sparcstation2 .

[1]  Dan S. Bloomberg Image analysis using threshold reduction , 1991, Optics & Photonics.

[2]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[3]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[4]  Sargur N. Srihari,et al.  Classification of newspaper image blocks using texture analysis , 1989, Comput. Vis. Graph. Image Process..

[5]  Petros Maragos,et al.  Generalized hit-miss operators , 1990, Optics & Photonics.

[6]  Philip J. Bones,et al.  Segmentation of document images , 1990, Other Conferences.

[7]  Xinhua Zhuang,et al.  Binary morphology: working in the sampled domain , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  P. J. Burt,et al.  The Pyramid as a Structure for Efficient Computation , 1984 .