Logarithmic spiral grid and gaze control for the development of strategies of visual segmentation on a document

The paper presents a page segmentation method which is based on perception phenomena and displays the unequal importance of information in the visual field. The access of information is directly linked to the search of attractive areas. This search is based on the idea of freeing oneself from an unbending physical structure and from a uniform vertical and horizontal scanning of the document, so as to classify the data in order of importance and interest. Using a space variant geometry for block selection, the page image, instead of being represented by a bitmap format, can be abstractly represented by the block format. This space variant geometry lays a sound basis for elaborating the kinetics of the ocular shifting on a document, which provides not only a meaningless document representation in blocks, but shows a unified view corresponding to the integration of time variant representations of the same visual field.

[1]  E. V. Krishnamurthy,et al.  On the compactness of subsets of digital pictures , 1978 .

[2]  Stewart W. Wilson On the Retino-Cortical Mapping , 1983, Int. J. Man Mach. Stud..

[3]  Richard M. Stern,et al.  Fast Computation of the Difference of Low-Pass Transform , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Lawrence W. Stark,et al.  Visual perception and sequences of eye movement fixations: a stochastic modeling approach , 1992, IEEE Trans. Syst. Man Cybern..

[5]  Shin-Ywan Wang,et al.  Block selection: a method for segmenting a page image of various editing styles , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[6]  HIROYUKI YAMAMOTO,et al.  An Active Foveated Vision System: Attentional Mechanisms and Scan Path Covergence Measures , 1996, Comput. Vis. Image Underst..