José Borges Semantically Enriched Web Usage Mining for Personalization T

The continuous growth in the size of the World Wide Web has resulted in intricate Web sites, demanding enhanced user skills and more sophisticated tools to help the Web user to find the desired information. In order to make Web more user friendly, it is necessary to provide personalized services and recommendations to the Web user. For discovering interesting and frequent navigation patterns from Web server logs many Web usage mining techniques have been applied. The recommendation accuracy of usage based techniques can be improved by integrating Web site content and site structure in the personalization process. Herein, we propose semantically enriched Web Usage Mining method for Personalization (SWUMP), an extension to solely usage based technique. This approach is a combination of the fields of Web Usage Mining and Semantic Web. In the proposed method, we envisage enriching the undirected graph derived from usage data with rich semantic information extracted from the Web pages and the Web site structure. The experimental results show that the SWUMP generates accurate recommendations and is able to achieve 10-20% better accuracy than the solely usage based model. The SWUMP addresses the new item problem inherent to solely usage based techniques. Keywords—Prediction, Recommendation, Semantic Web Usage Mining, Web Usage Mining.

[1]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[2]  Fabrizio Silvestri,et al.  An Online Recommender System for Large Web Sites , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[3]  Peter Hofgesang,et al.  On Modelling and Synthetically Generating Web Usage Data , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[4]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[5]  Martin Junghans,et al.  Enabling Semantic Analysis of User Browsing Patterns in the Web of Data , 2012, ArXiv.

[6]  Pinar Senkul,et al.  Improving pattern quality in web usage mining by using semantic information , 2012, Knowledge and Information Systems.

[7]  Umeshwar Dayal,et al.  From User Access Patterns to Dynamic Hypertext Linking , 1996, Comput. Networks.

[8]  Cong Wang,et al.  Web user clustering and Web prefetching using Random Indexing with weight functions , 2011, Knowledge and Information Systems.

[9]  Alun D. Preece,et al.  Instance Based Clustering of Semantic Web Resources , 2008, ESWC.

[10]  Stuart E. Middleton,et al.  Ontological user profiling in recommender systems , 2004, TOIS.

[11]  Tao Luo,et al.  Integrating Web Usage and Content Mining for More Effective Personalization , 2000, EC-Web.

[12]  Alberto Apostolico,et al.  String Editing and Longest Common Subsequences , 1997, Handbook of Formal Languages.

[13]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[14]  Jie Lu,et al.  Ontology-style Web usage model for semantic Web applications , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[15]  Bamshad Mobasher,et al.  A Unified Approach to Personalization Based on Probabilistic Latent Semantic Models of Web Usage and Content , 2004 .

[16]  Jaideep Srivastava,et al.  Creating adaptive Web sites through usage-based clustering of URLs , 1999, Proceedings 1999 Workshop on Knowledge and Data Engineering Exchange (KDEX'99) (Cat. No.PR00453).

[17]  Ali Mamat,et al.  WebPUM: A Web-based recommendation system to predict user future movements , 2010, Expert Syst. Appl..

[18]  Iraklis Varlamis,et al.  SEWeP: using site semantics and a taxonomy to enhance the Web personalization process , 2003, KDD '03.

[19]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[20]  Mark Levene,et al.  Evaluating Variable-Length Markov Chain Models for Analysis of User Web Navigation Sessions , 2007, IEEE Transactions on Knowledge and Data Engineering.

[21]  Sungjune Park,et al.  Sequence-based clustering for Web usage mining: A new experimental framework and ANN-enhanced K-means algorithm , 2008, Data Knowl. Eng..

[22]  Chabane Djeraba,et al.  A framework for mining meaningful usage patterns within a semantically enhanced web portal , 2010, C3S2E '10.

[23]  Florent Masseglia,et al.  WebTool: An Integrated Framework for Data Mining , 1999, DEXA.

[24]  Giovanna Castellano,et al.  NEWER: A system for NEuro-fuzzy WEb Recommendation , 2011, Appl. Soft Comput..

[25]  Michalis Vazirgiannis,et al.  Introducing Semantics in Web Personalization: The Role of Ontologies , 2005, EWMF/KDO.

[26]  Vlado Keselj,et al.  n-Gram-based classification and unsupervised hierarchical clustering of genome sequences , 2006, Comput. Methods Programs Biomed..

[27]  Dimitrios Pierrakos KOINOTITES: A Web Usage Mining Tool for Personalization , 2001 .

[28]  Juan D. Velásquez,et al.  Extracting significant Website Key Objects: A Semantic Web mining approach , 2011, Eng. Appl. Artif. Intell..

[29]  Haibin Liu,et al.  Combined mining of Web server logs and web contents for classifying user navigation patterns and predicting users' future requests , 2007, Data Knowl. Eng..

[30]  Christie I. Ezeife,et al.  Semantic-Rich Markov Models for Web Prefetching , 2009, 2009 IEEE International Conference on Data Mining Workshops.