Cold-start news recommendation with domain-dependent browse graph

Online social networks and mash-up services create opportunities to connect different web services otherwise isolated. Specifically in the case of news, users are very much exposed to news articles while performing other activities, such as social networking or web searching. Browsing behavior aimed at the consumption of news, especially in relation to the visits coming from other domains, has been mainly overlooked in previous work. To address that, we build a BrowseGraph out of the collective browsing traces extracted from a large viewlog of Yahoo News (0.5B entries), and we define the ReferrerGraph as its subgraph induced by the sessions with the same referrer domain. The structural and temporal properties of the graph show that browsing behavior in news is highly dependent on the referrer URL of the session, in terms of type of content consumed and time of consumption. We build on this observation and propose a news recommender that addresses the cold-start problem: given a user landing on a page of the site for the first time, we aim to predict the page she will visit next. We compare 24 flavors of recommenders belonging to the families of content-based, popularity-based, and browsing-based models. We show that the browsing-based recommender that takes into account the referrer URL is the best performing, achieving a prediction accuracy of 48% in conditions of heavy data sparsity.

[1]  Yue Xu,et al.  Using Association Rules to Solve the Cold-Start Problem in Recommender Systems , 2010, PAKDD.

[2]  Tie-Yan Liu,et al.  BrowseRank: letting web users vote for page importance , 2008, SIGIR '08.

[3]  Chen Lin,et al.  PRemiSE: personalized news recommendation via implicit social experts , 2012, CIKM.

[4]  Igor Trajkovski,et al.  Pagerank-Like Algorithm for Ranking News Stories and News Portals , 2013, ICT Innovations.

[5]  Zhaohui Zheng,et al.  Learning to model relatedness for news recommendation , 2011, WWW.

[6]  Alan Said,et al.  News Recommendation in the Wild: CWI's Recommendation Algorithms in the NRS Challenge , 2013 .

[7]  Ravi Kumar,et al.  A characterization of online browsing behavior , 2010, WWW '10.

[8]  Luca Chiarandini,et al.  Discovering Social Photo Navigation Patterns , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[9]  Flavio Figueiredo,et al.  The tube over time: characterizing popularity growth of youtube videos , 2011, WSDM '11.

[10]  Deepak Agarwal,et al.  fLDA: matrix factorization through latent dirichlet allocation , 2010, WSDM '10.

[11]  Amanda Spink,et al.  Multitasking during Web search sessions , 2006, Inf. Process. Manag..

[12]  Hua Li,et al.  Demographic prediction based on user's browsing behavior , 2007, WWW '07.

[13]  John Riedl,et al.  Learning preferences of new users in recommender systems: an information theoretic approach , 2008, SKDD.

[14]  Min Zhang,et al.  Automatic online news topic ranking using media focus and user attention based on aging theory , 2008, CIKM '08.

[15]  Kannan Srinivasan,et al.  Modeling Online Browsing and Path Analysis Using Clickstream Data , 2004 .

[16]  Roi Blanco,et al.  Language intent models for inferring user browsing behavior , 2012, SIGIR '12.

[17]  M. de Rijke,et al.  Linking online news and social media , 2011, WSDM '11.

[18]  Abhinandan Das,et al.  Google news personalization: scalable online collaborative filtering , 2007, WWW '07.

[19]  Jürgen Pfeffer,et al.  Characterizing the life cycle of online news stories using social media reactions , 2013, CSCW.

[20]  Jiahui Liu,et al.  Personalized news recommendation based on click behavior , 2010, IUI '10.

[21]  Balaji Padmanabhan,et al.  SCENE: a scalable two-stage personalized news recommendation system , 2011, SIGIR.

[22]  H. Sobhanam,et al.  Addressing cold start problem in recommender systems using association rules and clustering technique , 2013, 2013 International Conference on Computer Communication and Informatics.

[23]  Francesco Romani,et al.  Ranking a stream of news , 2005, WWW '05.

[24]  Minghai Liu,et al.  User browsing behavior-driven web crawling , 2011, CIKM '11.

[25]  Craig MacDonald,et al.  News article ranking: leveraging the wisdom of bloggers , 2010, RIAO.

[26]  Yang Guo,et al.  Bayesian-Inference-Based Recommendation in Online Social Networks , 2011, IEEE Transactions on Parallel and Distributed Systems.

[27]  Luca Chiarandini,et al.  Leveraging Browsing Patterns for Topic Discovery and Photostream Recommendation , 2013, ICWSM.

[28]  Wiebke Wagner,et al.  Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit , 2010, Lang. Resour. Evaluation.

[29]  Yiqun Liu,et al.  User Browsing Graph: Structure, Evolution and Application , 2009, WSDM.