Document text segmentation using multiband disc model

This paper proposes a multi-band disc model to do document page segmentation to segregate text blocks from graphic images. We first introduce the idea of our disc-model and go on to discuss the improved multi-band version of the disc- model. The disc-model takes a bottom-up segmentation approach that tries to establish local neighborhood of objects on a page and then trace the propagation of such neighborhood until all objects in text blocks are reached. The significance of the disc-model is the link established between the sizes of the objects and their positional thus logical relationship. Furthermore, the disc-model is rotational symmetric. Therefore, the disc-model can be applied to text with mixed typefaces, with arbitrary outline shapes. It is tolerable to skews or misalignment of the objects in the input images.

[1]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[2]  Norihiro Abe,et al.  A clustering-based approach to the separation of text strings from mixed text/graphics documents , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[3]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[4]  Zhaoyang Lu,et al.  Detection of Text Regions From Digital Engineering Drawings , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Rama Chellappa,et al.  Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Anil K. Jain,et al.  Page segmentation using tecture analysis , 1996, Pattern Recognit..