Using Ontology and Sequence Information for Extracting Behavior Patterns from Web Navigation Logs

Many e-commerce web sites such as online book retailers or specialized information hubs such as online movie databases make use of recommendation systems where users are directed to items of interests based on past user interactions. While keyword based approaches are naive and do not take content or context into account, collaborative filtering and content-based techniques su er from biased ratings, first item and first-rater problems. Recent approaches try to incorporate underlying semantic properties of data by employing ontology based usage mining. This work aims to design a Web navigation behavior extraction approach that makes use of page semantics and navigation sequence information where web pages are seen as objects and sessions as sequence of objects. Instead of relying on users’ content ratings, user sessions are clustered on a semantic level to capture di erent behavioral groups. Since semantic information and sequence are used for the clustering distance function, each cluster represents a behavior group instead of simple data groups. In this work, we use the recommendation results as a means for measuring the e ectiveness of the clusters we have generated. The e ect of integrating semantic information and page access sequence into the patterns are evaluated with a set of experiments.

[1]  Osmar R. Zaïane,et al.  Clustering Web sessions by sequence alignment , 2002, Proceedings. 13th International Workshop on Database and Expert Systems Applications.

[2]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[3]  Ronald L. Rivest,et al.  Introduction to Algorithms, third edition , 2009 .

[4]  Jaideep Srivastava,et al.  Creating adaptive Web sites through usage-based clustering of URLs , 1999, Proceedings 1999 Workshop on Knowledge and Data Engineering Exchange (KDEX'99) (Cat. No.PR00453).

[5]  Christie I. Ezeife,et al.  Using domain ontology for semantic web usage mining and next page prediction , 2009, CIKM.

[6]  Hichem Frigui,et al.  Clustering by competitive agglomeration , 1997, Pattern Recognit..

[7]  Alexander Maedche,et al.  Clustering Ontology-Based Metadata in the Semantic Web , 2002, PKDD.

[8]  Ajith Abraham,et al.  Web usage mining using artificial ant colony clustering and linear genetic programming , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[9]  Vipin Kumar,et al.  Hypergraph Based Clustering in High-Dimensional Data Sets: A Summary of Results , 1998, IEEE Data Eng. Bull..

[10]  Yuhui Qiu,et al.  Concept Similarity Matching Based on Semantic Distance , 2008, 2008 Fourth International Conference on Semantics, Knowledge and Grid.

[11]  Pinar Senkul,et al.  Using semantic information for web usage mining based recommendation , 2009, 2009 24th International Symposium on Computer and Information Sciences.

[12]  Bamshad Mobasher,et al.  Impact of Site Characteristics on Recommendation Models Based On Association Rules and Sequential Patterns , 2003 .

[13]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[14]  Andreas Hotho,et al.  Conceptual User Tracking , 2003, AWIC.

[15]  Tao Luo,et al.  Integrating Web Usage and Content Mining for More Effective Personalization , 2000, EC-Web.

[16]  Bamshad Mobasher,et al.  Using Ontologies to Discover Domain-Level Web Usage Profiles , 2002 .

[17]  Anupam,et al.  Mining Web Access Logs Using Relational Competitive Fuzzy Clustering , 1999 .

[18]  Cyrus Shahabi,et al.  Knowledge discovery from users Web-page navigation , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[19]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[20]  Pedro Pina,et al.  Self-Organized Data and Image Retrieval as a Consequence of Inter-Dynamic Synergistic Relationships in Artificial Ant Colonies , 2002, HIS.

[21]  Sungjune Park,et al.  Sequence-based clustering for Web usage mining: A new experimental framework and ANN-enhanced K-means algorithm , 2008, Data Knowl. Eng..

[22]  Yanchun Zhang,et al.  A Latent Usage Approach for Clustering Web Transaction and Building User Profile , 2005, ADMA.

[23]  S. Taherizadeh,et al.  Integrating Web Content Mining into Web Usage Mining for Finding Patterns and Predicting Users’ Behaviors , 2012 .

[24]  Chabane Djeraba,et al.  Toward Recommendation Based on Ontology-Powered Web-Usage Mining , 2007, IEEE Internet Computing.

[25]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[26]  Andreas Hotho,et al.  Towards Semantic Web Mining , 2002, SEMWEB.

[27]  Ramón García-Martínez,et al.  Web Usage Mining Using Self Organized Maps , 2007 .