Integrating recommendation models for improved web page prediction accuracy

Recent research initiatives have addressed the need for improved performance of Web page prediction accuracy that would profit many applications, e-business in particular. Different Web usage mining frameworks have been implemented for this purpose specifically Association rules, clustering, and Markov model. Each of these frameworks has its own strengths and weaknesses and it has been proved that using each of these frameworks individually does not provide a suitable solution that answers today's Web page prediction needs. This paper endeavors to provide an improved Web page prediction accuracy by using a novel approach that involves integrating clustering, association rules and Markov models according to some constraints. Experimental results prove that this integration provides better prediction accuracy than using each technique individually.

[1]  Dimitrios Skoutas,et al.  STAVIES: a system for information extraction from unknown Web data sources through automatic Web wrapper generation using clustering techniques , 2005, IEEE Transactions on Knowledge and Data Engineering.

[2]  Joydeep Ghosh,et al.  A Unified Framework for Model-based Clustering , 2003, J. Mach. Learn. Res..

[3]  Giuliano Casale,et al.  Combining queueing networks and web usage mining techniques for web performance analysis , 2005, SAC '05.

[4]  Tzyy-Ching Yang,et al.  A group-based inference approach to customized marketing on the Web integrating clustering and association rules techniques , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[5]  Michalis Vazirgiannis,et al.  Web path recommendations based on page ranking and Markov models , 2005, WIDM '05.

[6]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[7]  Christoph F. Eick,et al.  Supervised clustering - algorithms and benefits , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[8]  Peter Pirolli,et al.  Mining Longest Repeating Subsequences to Predict World Wide Web Surfing , 1999, USENIX Symposium on Internet Technologies and Systems.

[9]  R. Mooney,et al.  Impact of Similarity Measures on Web-page Clustering , 2000 .

[10]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[11]  Ke Wang,et al.  Building Association-Rule Based Sequential Classifiers for Web-Document Prediction , 2004, Data Mining and Knowledge Discovery.

[12]  Ramesh R. Sarukkai,et al.  Link prediction and path analysis using Markov chains , 2000, Comput. Networks.

[13]  Lin Lu,et al.  Mining Significant Usage Patterns from Clickstream Data , 2005, WEBKDD.

[14]  Jaswinder Pal Singh,et al.  Predicting category accesses for a user in a structured information space , 2002, SIGIR '02.

[15]  Qing Wang,et al.  Characterizing customer groups for an e-commerce website , 2004, EC '04.

[16]  Zhang Yang Mining Sequential Association Rule for Improving Web Document Prediction , 2006 .

[17]  Vipul Mathur,et al.  An overhead and resource contention aware analytical model for overloaded web servers , 2007, WOSP '07.

[18]  Sourav S. Bhowmick,et al.  WAM-Miner: in the search of web access motifs from historical web log data , 2005, CIKM '05.

[19]  Michael Bieber,et al.  A clickstream-based collaborative filtering personalization model: towards a better performance , 2004, WIDM '04.

[20]  Tao Luo,et al.  Effective personalization based on association rule discovery from web usage data , 2001, WIDM '01.

[21]  Giannis Tzimas,et al.  A method for personalized clustering in data intensive web applications , 2006, APS '06.

[22]  Lin Lu,et al.  Discovery of Significant Usage Patterns from Clusters of Clickstream Data , 2005 .

[23]  Alexander P. Pons Object prefetching using semantic links , 2006, DATB.

[24]  Padhraic Smyth,et al.  Model-Based Clustering and Visualization of Navigation Patterns on a Web Site , 2003, Data Mining and Knowledge Discovery.

[25]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[26]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[27]  Christos Bouras,et al.  Predictive Prefetching on the Web and Its Potential Impact in the Wide Area , 2004, World Wide Web.

[28]  Jun Hong,et al.  Using Markov models for web site link prediction , 2002, HYPERTEXT '02.

[29]  George Karypis,et al.  Selective Markov models for predicting Web page accesses , 2004, TOIT.

[30]  Songfeng Lu,et al.  Mining association rules using clustering , 2001, Intell. Data Anal..

[31]  Iraklis Varlamis,et al.  THESUS: Organizing Web document collections based on link semantics , 2003, The VLDB Journal.

[32]  Diego Sona,et al.  Clustering documents in a web directory , 2003, WIDM '03.