Web site: a structured document

A web site is a set of web pages and hypertexts links.The contribution of this paper is a description of websites also called web documents as a structured objectwith a logical and a physical structure. Indeed, it willshow that the web document can be divided intogeometrical blocks (physical structure) and furthermoreeach page of a site has an individual function, each canbe classified within a meaning (logical function).An application of this object is also described using aneye-tracking experiment. For this experiment, some webdocuments was studied with the same methodology thanthat used for paper documents. Two eye-tracking resultsare developed and for each the usual meaning fordocument and the signification in the context of webdocuments will be described. The results show the strongrelationship between the structure and the userperception of web document. Two eye-tracking data werederived from the results: the number of fixation per webdocument and the number of blocks (pages) seen are bothcorrelated with structure descriptors.

[1]  Apostolos Antonacopoulos,et al.  Text Extraction from Web Images Based on Human Perception and Fuzzy Inference , 2001 .

[2]  Eli Upfal,et al.  The Web as a graph , 2000, PODS.

[3]  Joseph H. Goldberg,et al.  Eye tracking in web search tasks: design implications , 2002, ETRA.

[4]  Gerd Maderlechner,et al.  Extraction of relevant information from document images using measures of visual attention , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[5]  Antoine Gagneux,et al.  Quality Approach of Web Documents by an Evaluation of Structure Relevance , 2001 .

[6]  Mohamed Cheriet,et al.  Documents analysis and understanding : a brief survey , 1991 .

[7]  Jacques Labiche,et al.  Image sorting and image classification: a global approach , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[8]  Mayer D. Schwartz,et al.  The Dexter Hypertext Reference Model , 1994, CACM.

[9]  Proceedings Seventh International Conference on Document Analysis and Recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[10]  Véronique Eglin,et al.  Logarithmic spiral grid and gaze control for the development of strategies of visual segmentation on a document , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[11]  Jakob Nielsen,et al.  Designing Web Usability: The Practice of Simplicity , 1999 .

[12]  Ching Y. Suen,et al.  Document structures: A survey , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).