Document segmentation using texture variance and low resolution images

This paper describes a document segmentation method based on segmentation by texture using low resolution gray level images. The method is derived from the human vision perception theory. The concepts used from this theory are, global to local processing and low resolution information. If a document is viewed at a certain distance far from a person, the person sees a blurred image of the document, but is still able to detect the different blocks of the document. Detection is possible since each block has a specific texture pattern. These patterns correspond to regions of text, regions of graphics and regions of pictures. Thus the theory to prove is that a document image can be segmented into regions of text, and regions of graphics and/or pictures using the texture of low resolution images. The method presented in this paper, despite its simplicity, has shown to be effective and robust. It was designed to work with free format documents, text in background other than white, skew greater than 10 degrees. It requires less computation than the segmentation methods using texture described in other papers.

[1]  Anil K. Jain,et al.  Automatic filter design for texture discrimination , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[2]  M. Yamada,et al.  Document image processing based on enhanced border following algorithm , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[3]  B Julesz,et al.  Experiments in the visual perception of texture. , 1975, Scientific American.

[4]  T. Pavlidis,et al.  Page segmentation without rectangle assumption , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[5]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Lawrence O'Gorman,et al.  Document Image Analysis Systems - Guest Editors' Introduction to the Special Issue , 1992, Computer.

[7]  D. Navon Forest before trees: The precedence of global features in visual perception , 1977, Cognitive Psychology.

[8]  Rama Chellappa,et al.  Page segmentation using decision integration and wavelet packets , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[9]  T. John Stonham,et al.  Document segmentation using texture analysis , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[10]  Lawrence O'Gorman,et al.  Document Image Analysis , 1996 .

[11]  Yoshitake Tsuji,et al.  Document Image Analysis For Reading Books , 1987, Other Conferences.

[12]  C T Scialfa,et al.  Preferential processing of target features in texture segmentation , 1995, Perception & psychophysics.

[13]  T. John Stonham,et al.  Unsupervised/supervised texture segmentation and its application to real-world data , 1992, Other Conferences.

[14]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[15]  Anil K. Jain,et al.  Text segmentation using gabor filters for automatic document processing , 1992, Machine Vision and Applications.