Text block segmentation using pyramid structure

Text block segmentation is necessary in document layout analysis. An algorithm and its implementation that segregates text block by block (a block is either a title or a paragraph) from the provided document, e.g. newspaper image, based on pyramid structure is described in this paper. The pyramid structure, which is amenable for parallel processing on output, is a multi-resolution image representation. The pyramid structure also simulates what the human eyes see the document from afar visualizing the block structure of the document, the block segmentation can identify the titles, and distinguish different paragraphs based on the indentation between them. Our implementation will be used in a news articles retrieval project.

[1]  George Nagy,et al.  DOCUMENT ANALYSIS WITH AN EXPERT SYSTEM , 1986 .

[2]  Masahiko Hase,et al.  Segmentation method of document images by two-dimensional fourier transformation , 1985, Systems and Computers in Japan.

[3]  Chew Lim Tan,et al.  Agent-Based Text Extraction from Pyramid Images , 1999 .

[4]  S. Tanimoto Pictorial feature distortion in a pyramid , 1976 .

[5]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[6]  Andreas Dengel,et al.  Document Description and Analysis by Cuts , 1988, RIAO.

[7]  Sargur N. Srihari,et al.  Recognizing Address Blocks on Mail Pieces: Specialized Tools and Problem-Solving Architecture , 1987, AI Mag..

[8]  Anil K. Jain,et al.  Page segmentation using tecture analysis , 1996, Pattern Recognit..

[9]  Walter G. Kropatsch Properties of Pyramidal Representations , 1994, Theoretical Foundations of Computer Vision.

[10]  Yuan Yan Tang,et al.  Automatic Analysis and Understanding of Documents , 1993, Handbook of Pattern Recognition and Computer Vision.

[11]  J Patrick Bixler Tracking text in mixed-mode documents , 2000, DOCPROCS '88.

[12]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..

[13]  Yasuaki Nakano,et al.  A Top-Down Approach to the Analysis of Document Images , 1992 .

[14]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..