Higher-order rank analysis for web structure

In this paper, we propose a method for the structural analysis of Web sites.The Web has become one of the most widely used media for electronic information because of its great flexibility. However, this flexibility has led to complicated structures. A structure that differs from the typical structures in a Web site might confuse readers, thus reducing the effectiveness of the site. A method for detecting unusual structures would be useful for identifying such structures so that their impact can be studied and ways to improve Web site effectiveness developed.We viewed the Web as a directed graph, and introduced a higher-order rank based on the non-well-founded set theory. We then developed higher-order rank analysis for detecting irregularities, defined as structures which differ from the typical structure of a target site. To test the effectiveness of our method, we applied it to several Web sites in actual use, and succeeded in identifying irregular structures in the sites.

[1]  Robert E. Tarjan,et al.  Three Partition Refinement Algorithms , 1987, SIAM J. Comput..

[2]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[3]  Chaomei Chen Structuring and visualising the WWW by generalised similarity analysis , 1997, HYPERTEXT '97.

[4]  Ke Wang,et al.  Discovering typical structures of documents: a road map approach , 1998, SIGIR '98.

[5]  Peter Aczel,et al.  Non-well-founded sets , 1988, CSLI lecture notes series.

[6]  Kazunori Yamaguchi,et al.  Structural analysis for web documentation using the non-well-founded set , 2004, HYPERTEXT '04.

[7]  K. Devlin The joy of sets : fundamentals of contemporary set theory , 1993 .

[8]  Sougata Mukherjea,et al.  Visualizing the World-Wide Web with the Navigational View Builder , 1995, Comput. Networks ISDN Syst..

[9]  John E. McEneaney Visualizing and assessing navigation in hypertext , 1999, Hypertext.

[10]  Sandra K. Parker,et al.  Web Style Guide: Basic Design Principles for Creating Web Sites , 1999 .

[11]  John E. McEneaney Graphic and numerical methods to assess navigation in hypertext , 2001, Int. J. Hum. Comput. Stud..

[12]  Ben Shneiderman,et al.  Identifying aggregates in hypertext structures , 1991, HYPERTEXT '91.

[13]  E. James Whitehead Uniform comparison of data models using containment modeling , 2002, HYPERTEXT '02.

[14]  David Carmel,et al.  The connectivity sonar: detecting site functionality by structural patterns , 2003, HYPERTEXT '03.

[15]  Ben Shneiderman,et al.  Structural analysis of hypertexts: identifying hierarchies and useful metrics , 1992, TOIS.