Usage-based PageRank for Web personalization

Recommendation algorithms aim at proposing "next" pages to a user based on her current visit and the past users' navigational patterns. In the vast majority of related algorithms, only the usage data are used to produce recommendations, whereas the structural properties of the Web graph are ignored. We claim that taking also into account the Web structure and using link analysis algorithms ameliorates the quality of recommendations. In this paper we present UPR, a novel personalization algorithm which combines usage data and link analysis techniques for ranking and recommending Web pages to the end user. Using the Web site's structure and its usage data we produce personalized navigational graph synopsis (prNG) to be used for applying UPR and produce personalized recommendations. Experimental results show that the accuracy of the recommendations is superior to pure usage-based approaches.

[1]  Mark Levene,et al.  Data Mining of User Navigation Patterns , 1999, WEBKDD.

[2]  Neoklis Polyzotis,et al.  Structure and Value Synopses for XML Data Graphs , 2002, VLDB.

[3]  M. Kendall,et al.  Rank Correlation Methods , 1949 .

[4]  George Karypis,et al.  Selective Markov models for predicting Web page accesses , 2004, TOIT.

[5]  Ramesh R. Sarukkai,et al.  Link prediction and path analysis using Markov chains , 2000, Comput. Networks.

[6]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[7]  C. Lee Giles,et al.  Probabilistic user behavior models , 2003, Third IEEE International Conference on Data Mining.

[8]  Iraklis Varlamis,et al.  SEWeP: using site semantics and a taxonomy to enhance the Web personalization process , 2003, KDD '03.

[9]  Maguelonne Teisseire,et al.  Using data mining techniques on Web access logs to dynamically improve hypertext structure , 1999, LINK.

[10]  Oren Etzioni,et al.  Towards adaptive Web sites: Conceptual framework and case study , 1999, Artif. Intell..

[11]  Mark Levene,et al.  Computing the Entropy of User Navigation in the Web , 2003, Int. J. Inf. Technol. Decis. Mak..

[12]  Maurice Mulvenna,et al.  Navigation Pattern Discovery from Internet Data , 1999 .

[13]  WebUsersMyra Spiliopoulou,et al.  A Data Miner analyzing the Navigational Behaviour of , 1999 .

[14]  Neoklis Polyzotis,et al.  Approximate XML query answers , 2004, SIGMOD '04.

[15]  Russ Bubley,et al.  Randomized algorithms , 1995, CSUR.

[16]  Padhraic Smyth,et al.  Visualization of navigation patterns on a Web site using model-based clustering , 2000, KDD '00.

[17]  Fabrizio Silvestri,et al.  An Online Recommender System for Large Web Sites , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[18]  Padhraic Smyth,et al.  A general probabilistic framework for clustering individuals and objects , 2000, KDD '00.

[19]  MAGDALINI EIRINAKI,et al.  Web mining for web personalization , 2003, TOIT.

[20]  Matthew Richardson,et al.  The Intelligent surfer: Probabilistic Combination of Link and Content Information in PageRank , 2001, NIPS.

[21]  Myra Spiliopoulou,et al.  Revised Papers from the International Workshop on Web Usage Analysis and User Profiling , 1999 .

[22]  Carolina Ruiz,et al.  FS-Miner: efficient and incremental mining of frequent sequence patterns in web logs , 2004, WIDM '04.

[23]  F. Menczer,et al.  Personalizing PageRank Based on Domain Profiles , 2004 .

[24]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[25]  Mark Hansen,et al.  Predicting Web Users' Next Access Based on Log Data , 2003 .

[26]  R. Forthofer,et al.  Rank Correlation Methods , 1981 .

[27]  Sourav S. Bhowmick,et al.  Mining History of Changes to Web Access Patterns , 2004, PKDD.