Web Usage Mining: Discovering Usage Patterns for Web Applications

The heterogeneous nature of the Web combined with the rapid diffusion of Web-based applications have made Web browsing an intricate activity for users. This has given rise to an urgent need for developing systems capable to assist and guide users during their navigational activity in the Web. Web Usage Mining (WUM) refers to the application of Data Mining techniques for the automatic discovery of meaningful usage patterns characterizing the browsing behavior of users, starting from access data collected from interactions of users with sites. The discovered patterns may be conveniently exploited in order to implement functionalities offering useful assistance to users. This chapter is mainly intended to provide an overview of the different stages involved in a general WUM process. As an example, a WUM approach is presented which is based on the use of fuzzy clustering to discovery user categories starting from usage patterns.

[1]  Giovanna Castellano,et al.  Web User Profiling Using Fuzzy Clustering , 2007, WILF.

[2]  Jaideep Srivastava,et al.  Web Mining , 2004, Data Mining and Knowledge Discovery.

[3]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[4]  Michalis Vazirgiannis,et al.  Clustering validity checking methods: part II , 2002, SGMD.

[5]  Sushmita Mitra,et al.  Applications of Fuzzy Sets Theory, 7th International Workshop on Fuzzy Logic and Applications, WILF 2007, Camogli, Italy, July 7-10, 2007, Proceedings , 2007, WILF.

[6]  Michalis Vazirgiannis,et al.  A Review of Web Document Clustering Approaches , 2010, Data Mining and Knowledge Discovery Handbook.

[7]  Pedro M. Domingos,et al.  Adaptive Web Navigation for Wireless Devices , 2001, IJCAI.

[8]  Mark Levene,et al.  Generating Dynamic Higher-Order Markov Models in Web Usage Mining , 2005, PKDD.

[9]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[10]  Pier Luca Lanzi,et al.  Mining interesting knowledge from weblogs: a survey , 2005, Data Knowl. Eng..

[11]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[12]  Ron Kohavi,et al.  WEBKDD 2001 — Mining Web Log Data Across All Customers Touch Points , 2002, Lecture Notes in Computer Science.

[13]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[14]  J. Leon Zhao,et al.  Automatic discovery of similarity relationships through Web mining , 2003, Decis. Support Syst..

[15]  Yannis Manolopoulos,et al.  . EFFECTIVE PREDICTION OF WEB-USER ACCESSES: A DATA MINING APPROACH , 2001 .

[16]  Georgios Paliouras,et al.  Web Usage Mining as a Tool for Personalization: A Survey , 2003, User Modeling and User-Adapted Interaction.

[17]  Murat Ali Bayir,et al.  Smart Miner: a new framework for mining large scale web usage data , 2009, WWW '09.

[18]  Jaideep Srivastava,et al.  Discovery of Interesting Usage Patterns from Web Data , 1999, WEBKDD.

[19]  Athena Vakali,et al.  An Overview of Web Data Clustering Practices , 2004, EDBT Workshops.

[20]  N. Khasawneh,et al.  Web usage mining using rough sets , 2005, NAFIPS 2005 - 2005 Annual Meeting of the North American Fuzzy Information Processing Society.

[21]  Myra Spiliopoulou,et al.  Web Usage Analysis and User Profiling , 2002, Lecture Notes in Computer Science.

[22]  Maurice D. Mulvenna,et al.  Personalization on the Net using Web mining: introduction , 2000, CACM.

[23]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[24]  Yoon Ho Cho,et al.  A personalized recommendation procedure for Internet shopping support , 2002, Electron. Commer. Res. Appl..

[25]  Vir V. Phoha,et al.  Web user clustering from access log using belief function , 2001, K-CAP '01.

[26]  V. Sathiyamoorthi,et al.  Data Preparation Techniques for Web Usage Mining in World Wide Web-An Approach , 2009 .

[27]  Myra Spiliopoulou,et al.  Data Mining for the Web , 1999, PKDD.

[28]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[29]  Bamshad Mobasher,et al.  Web Usage Mining and Personalization , 2004, The Practical Handbook of Internet Computing.

[30]  Ajith Abraham,et al.  Business Intelligence from Web Usage Mining , 2003, J. Inf. Knowl. Manag..

[31]  Xindong Wu,et al.  SiteHelper: A Localized Agent That Helps Incremental Exploration of the World Wide Web , 1997, Comput. Networks.

[32]  Jiawei Han,et al.  DISCOVERING AND MINING USER WEB-PAGE TRAVERSAL PATTERNS , 2001 .

[33]  Soumen Chakrabarti,et al.  Data mining for hypertext: a tutorial survey , 2000, SKDD.

[34]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[35]  Dino Pedreschi,et al.  Knowledge Discovery in Databases: PKDD 2004 , 2004, Lecture Notes in Computer Science.

[36]  Sushmita Mitra,et al.  Web mining: a survey in the fuzzy framework , 2004, Fuzzy Sets Syst..

[37]  Tasawar Hussain,et al.  Web usage mining: A survey on preprocessing of web log file , 2010, 2010 International Conference on Information and Emerging Technologies.

[38]  Yves Lechevallier,et al.  Dissimilarities for Web Usage Mining , 2006, Data Science and Classification.

[39]  Mahmood Neshati,et al.  Taxonomy Learning Using Compound Similarity Measure , 2007 .

[40]  Zhiguo Gong,et al.  Web structure mining: an introduction , 2005, 2005 IEEE International Conference on Information Acquisition.

[41]  Steffen Staab,et al.  Learning by googling , 2004, SKDD.

[42]  Ernestina Menasalvas Ruiz,et al.  Subsessions: a granular approach to click path analysis , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[43]  Chien-Chung Chan,et al.  Active User-Based and Ontology-Based Web Log Data Preprocessing for Web Usage Mining , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[44]  Qiang Yang,et al.  Web-Log Mining for Predictive Web Caching , 2003, IEEE Trans. Knowl. Data Eng..

[45]  Beng Chin Ooi,et al.  Making Web Servers Pushier , 1999, WEBKDD.

[46]  Bamshad Mobasher,et al.  Using Ontologies to Discover Domain-Level Web Usage Profiles , 2002 .

[47]  Olfa Nasraoui,et al.  Web Usage Mining , 2011 .

[48]  Anupam Joshi,et al.  On Using a Warehouse to Analyze Web Logs , 2003, Distributed and Parallel Databases.

[49]  Wolfgang Lindner,et al.  Current Trends in Database Technology - EDBT 2004 Workshops, EDBT 2004 Workshops PhD, DataX, PIM, P2P&DB, and ClustWeb, Heraklion, Crete, Greece, March 14-18, 2004, Revised Selected Papers , 2004, EDBT Workshops.

[50]  Mark Hansen,et al.  Using navigation data to improve IR functions in the context of web search , 2001, CIKM '01.

[51]  Jian Pei,et al.  Mining Access Patterns Efficiently from Web Logs , 2000, PAKDD.

[52]  Edith Cohen,et al.  Improving end-to-end performance of the Web using server volumes and proxy filters , 1998, SIGCOMM '98.

[53]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[54]  K. M. Mehata,et al.  The Variable Precision Rough Set Model for Web Usage Mining , 2001, Web Intelligence.

[55]  Beatrice Lazzerini,et al.  A Hierarchical Fuzzy Clustering-based System to Create User Profiles , 2007, Soft Comput..

[56]  Jaideep Srivastava,et al.  WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles , 2003, Lecture Notes in Computer Science.

[57]  Padhraic Smyth,et al.  Visualization of navigation patterns on a Web site using model-based clustering , 2000, KDD '00.

[58]  Pei-Min Chen,et al.  An information retrieval system based on a user profile , 2000, J. Syst. Softw..

[59]  Ali A. Ghorbani,et al.  A Fuzzy Markov Model Approach for Predicting User Navigation , 2007 .

[60]  Brian D. Davison A Web Caching Primer , 2001, IEEE Internet Comput..

[61]  Pengfei Shi,et al.  Similarity measures on intuitionistic fuzzy sets , 2003, Pattern Recognit. Lett..

[62]  John Riedl,et al.  Analysis of recommendation algorithms for e-commerce , 2000, EC '00.

[63]  Anupam Joshi,et al.  Low-complexity fuzzy relational clustering algorithms for Web mining , 2001, IEEE Trans. Fuzzy Syst..

[64]  Ajith Abraham i-Miner: a Web usage mining framework using hierarchical intelligent systems , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[65]  Thorsten Joachims,et al.  Web Watcher: A Tour Guide for the World Wide Web , 1997, IJCAI.

[66]  Dell Zhang,et al.  A novel Web usage mining approach for search engines , 2002, Comput. Networks.

[67]  Olfa Nasraoui,et al.  Combining Web Usage Mining and Fuzzy Inference for Website Personalization , 2003 .

[68]  Myra Spiliopoulou,et al.  A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis , 2003, INFORMS J. Comput..

[69]  Ning Zhong,et al.  Web Intelligence: Research and Development , 2001, Lecture Notes in Computer Science.

[70]  George D. Magoulas,et al.  Adaptable and Adaptive Hypermedia Systems , 2005 .

[71]  Analía Amandi,et al.  Learning Browsing Patterns for Context-Aware Recommendation , 2006, IFIP AI.

[72]  John Riedl,et al.  E-Commerce Recommendation Applications , 2004, Data Mining and Knowledge Discovery.

[73]  Selwyn Piramuthu,et al.  On learning to predict Web traffic , 2003, Decis. Support Syst..

[74]  Luís Torgo,et al.  Knowledge Discovery in Databases: PKDD 2005, 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, Porto, Portugal, October 3-7, 2005, Proceedings , 2005, PKDD.

[75]  Yannis Manolopoulos,et al.  Exploiting Web Log Mining for Web Cache Enhancement , 2001, WEBKDD.

[76]  V. Chitraa,et al.  A Survey on Preprocessing Methods for Web Usage Data , 2010, ArXiv.

[77]  Antonio Badia,et al.  A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites , 2008, IEEE Transactions on Knowledge and Data Engineering.

[78]  Kevin Chen-Chuan Chang,et al.  Editorial: special issue on web content mining , 2004, SKDD.

[79]  Torben Bach Pedersen,et al.  A Hybrid Approach to Web Usage Mining , 2002, DaWaK.

[80]  Nematollaah Shiri,et al.  An Efficient Technique for Mining Usage Profiles Using Relational Fuzzy Subtractive Clustering , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.

[81]  Anupam Joshi,et al.  Automatic Web User Profiling and Personalization Using Robust Fuzzy Relational Clustering , 2002 .

[82]  Michael D. Smith,et al.  Using Path Profiles to Predict HTTP Requests , 1998, Comput. Networks.

[83]  Johannes Fürnkranz,et al.  Web Structure Mining --- Exploiting the Graph Structure of the World-Wide Web , 2002 .

[84]  James E. Pitkow,et al.  In Search of Reliable Usage Data on the WWW , 1997, Comput. Networks.

[85]  Xiangji Huang,et al.  Comparison of interestingness functions for learning web usage patterns , 2002, CIKM '02.

[86]  Maurice D. Mulvenna,et al.  Discovering Internet marketing intelligence through online analytical web usage mining , 1998, SGMD.

[87]  Giovanna Castellano,et al.  Relational fuzzy approach for mining user profiles , 2007 .

[88]  Michalis Vazirgiannis,et al.  SEWeP: A Web Mining System Supporting Semantic Personalization , 2004, PKDD.

[89]  Jaideep Srivastava,et al.  Web usage mining: discovery and application of interesting patterns from web data , 2000 .

[90]  Enrique Frías-Martínez,et al.  A Customizable Behavior Model for Temporal Prediction of Web User Sequences , 2002, WEBKDD.

[91]  Henry Lieberman,et al.  Letizia: An Agent That Assists Web Browsing , 1995, IJCAI.

[92]  Vipin Kumar,et al.  Discovery of Web Robot Sessions Based on their Navigational Patterns , 2004, Data Mining and Knowledge Discovery.

[93]  Siu Cheung Hui,et al.  Web Usage Mining for Semantic Web Personalization , 2005 .

[94]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[95]  Nikolaos Avouris,et al.  A Survey of Web-Usage Mining: Techniques for Building Web-Based Adaptive Hypermedia Systems , 2005 .

[96]  Thomas A. Runkler,et al.  Web mining with relational clustering , 2003, Int. J. Approx. Reason..

[97]  Yoon Ho Cho,et al.  A personalized recommender system based on web usage mining and decision tree induction , 2002, Expert Syst. Appl..