UBB mining: finding unexpected browsing behaviour in clickstream data to improve a Web site's design

This paper describes a novel Web usage mining approach to discover patterns in the navigation of Web sites known as unexpected browsing behaviours (UBBs). By reviewing these UBBs, a Web site designer can choose to modify the design of their Web site or redesign the site completely. UBB mining is based on the continuous common subsequence (CCS), a special instance of common subsequence (CS), which is used to define a set of expected routes. The predefined expected routes are then treated as rules and stored in a rule base. By using the predefined route and the UBB mining algorithm, interesting browsing behaviours can be discovered. This paper introduces the format of the expected route and describes the UBB algorithms. The paper also describes a series of experiments designed to evaluate how well UBB mining algorithms work.

[1]  Graham Storrs,et al.  Characterizing user navigation through complex data structures , 1985 .

[2]  Olfa Nasraoui,et al.  Mining Evolving Web Clickstreams with Explicit Retrieval Similarity Measures , 2004, WebDyn@WWW.

[3]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[4]  Vipin Kumar,et al.  Discovery of Web Robot Sessions Based on their Navigational Patterns , 2004, Data Mining and Knowledge Discovery.

[5]  Chris Kimble,et al.  A Pattern Restore Method for Restoring Missing Patterns in Server Side Clickstream Data , 2005, APWeb.

[6]  Tao Luo,et al.  Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization , 2004, Data Mining and Knowledge Discovery.

[7]  Mark Levene,et al.  Data Mining of User Navigation Patterns , 1999, WEBKDD.

[8]  Chih-Hung Liu,et al.  Intelligent agent-based systems for personalized recommendations in Internet commerce , 2002, Expert Syst. Appl..

[9]  Chunhua Ju,et al.  Reorganizing web sites based on user access patterns , 2001, CIKM '01.

[10]  Myra Spiliopoulou,et al.  Web Usage Analysis and User Profiling: International WEBKDD'99 Workshop San Diego, CA, USA, August 15, 1999 Revised Papers , 2000 .

[11]  Ravi Kothari,et al.  On using Page Cooccurrences for Computing Clickstream Similarity , 2003, SDM.

[12]  Jaideep Srivastava,et al.  Discovery of Interesting Usage Patterns from Web Data , 1999, WEBKDD.

[13]  Oren Etzioni,et al.  Adaptive Web sites , 2000, CACM.

[14]  Chris Kimble,et al.  Visualizing and Classifying the Pattern of User’s Browsing Behavior for Website Design Recommendation , 2004 .

[15]  Giles Hooker,et al.  Sequential Analysis for Learning Modes of Browsing , 2004 .

[16]  Arindam Banerjee,et al.  Clickstream clustering using weighted longest common subsequences , 2001 .

[17]  Peter Dömel,et al.  WebMap: A Graphical Hypertext Navigation Tool , 1995, Comput. Networks ISDN Syst..