An approach and an Eclipse-based environment for enhancing the navigation structure of Web sites

This paper presents an approach based on information retrieval and clustering techniques for automatically enhancing the navigation structure of a Web site for improving navigability. The approach increments the set of navigation links provided in each page of the site with a semantic navigation map, i.e., a set of links enabling navigating from a given page to other pages of the site showing similar or related content. The approach uses Latent Semantic Indexing to compute a dissimilarity measure between the pages of the site and a graph-theoretic clustering algorithm to group pages showing similar or related content according to the calculated dissimilarity measure. AJAX code is finally used to extend each Web page with an associated semantic navigation map. The paper also presents a prototype of a tool developed to support the approach and the results from a case study conducted to assess the validity and feasibility of the proposal.

[1]  Dave Crane,et al.  Ajax in Action , 2005 .

[2]  Paolo Tonella,et al.  Understanding and Restructuring Web Sites with ReWeb , 2001, IEEE Multim..

[3]  Stéphane Ducasse,et al.  Enriching reverse engineering with semantic clustering , 2005, 12th Working Conference on Reverse Engineering (WCRE'05).

[4]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[5]  Paolo Tonella,et al.  Using clustering to support the migration from static to dynamic web pages , 2003, 11th IEEE International Workshop on Program Comprehension, 2003..

[6]  David B. Lowe,et al.  NavOptim Coding: Supporting Website Navigation Optimisation using Effort Minimisation , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[7]  Cornelia Boldyreff,et al.  Web Site Evolution , 2004, J. Softw. Maintenance Res. Pract..

[8]  Franca Garzotto,et al.  On the Acceptability of Conceptual Design Models for Web Applications , 2003, ER.

[9]  Andrian Marcus,et al.  Supporting program comprehension using semantic and structural information , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[10]  J. Michael Pearson,et al.  An Exploratory Study into Determining the Relative Importance of Key Criteria in Web Usability: A Multi-Criteria Approach , 2008, J. Comput. Inf. Syst..

[11]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[12]  Tomas Klos,et al.  Knowledge discovery in virtual community texts: Clustering virtual communities , 2003, J. Intell. Fuzzy Syst..

[13]  Preslav Nakov,et al.  Latent Semantic Analysis for German Literature Investigation , 2001, Fuzzy Days.

[14]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[15]  Giuliano Antoniol,et al.  An approach for reverse engineering of web-based applications , 2001, Proceedings Eighth Working Conference on Reverse Engineering.

[16]  H. Kaiser The Application of Electronic Computers to Factor Analysis , 1960 .

[17]  Mario Luca Bernardi,et al.  Reverse engineering of Web Applications to abstract user-centered conceptual models , 2008, 2008 10th International Symposium on Web Site Evolution.

[18]  G. Rossi,et al.  An Object Oriented Approach to Web-Based Application Design , 1998 .

[19]  Gustaf Neumann,et al.  Parameters driving effectiveness of automated essay scoring with LSA , 2005 .

[20]  Massimiliano Di Penta,et al.  An approach to identify duplicated web pages , 2002, Proceedings 26th Annual International Computer Software and Applications.

[21]  Donna K. Harman,et al.  Ranking Algorithms , 1992, Information Retrieval: Data Structures & Algorithms.

[22]  Paolo Tonella,et al.  Improving Web site understanding with keyword-based clustering , 2008 .

[23]  Giuseppe Scanniello,et al.  Identifying similar pages in Web applications using a competitive clustering algorithm , 2007, J. Softw. Maintenance Res. Pract..

[24]  Claes Wohlin,et al.  Experimentation in software engineering: an introduction , 2000 .

[25]  L. Guttman Some necessary conditions for common-factor analysis , 1954 .

[26]  Cornelia Boldyreff,et al.  Reverse engineering to achieve maintainable WWW sites , 2001, Proceedings Eighth Working Conference on Reverse Engineering.

[27]  Gustavo Rossi,et al.  Model Refactoring in Web Applications , 2007, 2007 9th IEEE International Workshop on Web Site Evolution.

[28]  Gerti Kappel,et al.  Web engineering : the discipline of systematic development of web applications , 2006 .

[29]  Giannis Tsakonas,et al.  Exploring usefulness and usability in the evaluation of open access digital libraries , 2008, Inf. Process. Manag..

[30]  Gustavo Rossi,et al.  A comprehensive design model for integrating business processes in web applications , 2007, Int. J. Web Eng. Technol..

[31]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[32]  Paolo Tonella,et al.  Restructuring multilingual web sites , 2002, International Conference on Software Maintenance, 2002. Proceedings..

[33]  Stefano Ceri,et al.  Web Modeling Language (WebML): a modeling language for designing Web sites , 2000, Comput. Networks.

[34]  Giuseppe Scanniello,et al.  Using Semantic clustering to enhance the navigation structure of Web sites , 2008, 2008 10th International Symposium on Web Site Evolution.

[35]  Giuseppe Scanniello,et al.  Clustering Algorithms and Latent Semantic Indexing to Identify Similar Pages in Web Applications , 2007, 2007 9th IEEE International Workshop on Web Site Evolution.

[36]  Gustavo Rossi,et al.  An Object Oriented Approach to Web-Based Applications Design , 1998, Theory Pract. Object Syst..

[37]  Scott R. Tilley Ten years of Web Site Evolution , 2008, 2008 10th International Symposium on Web Site Evolution.

[38]  Giuseppe Scanniello,et al.  Identifying Cloned Navigational Patterns in Web Applications , 2006, J. Web Eng..

[39]  Jordi Cabot,et al.  A Catalogue of Refactorings for Navigation Models , 2008, 2008 Eighth International Conference on Web Engineering.

[40]  Nora Koch,et al.  The Authoring Process of the UML-based Web Engineering Approach , 2000 .