BiRank: Towards Ranking on Bipartite Graphs

The bipartite graph is a ubiquitous data structure that can model the relationship between two entity types: for instance, users and items, queries and webpages. In this paper, we study the problem of ranking vertices of a bipartite graph, based on the graph's link structure as well as prior information about vertices (which we term a query vector). We present a new solution, BiRank, which iteratively assigns scores to vertices and finally converges to a unique stationary ranking. In contrast to the traditional random walk-based methods, BiRank iterates towards optimizing a regularization function, which smooths the graph under the guidance of the query vector. Importantly, we establish how BiRank relates to the Bayesian methodology, enabling the future extension in a probabilistic way. To show the rationale and extendability of the ranking methodology, we further extend it to rank for the more generic n-partite graphs. BiRank's generic modeling of both the graph structure and vertex features enables it to model various ranking hypotheses flexibly. To illustrate its functionality, we apply the BiRank and TriRank (ranking for tripartite graphs) algorithms to two real-world applications: a general ranking scenario that predicts the future popularity of items, and a personalized ranking scenario that recommends items of interest to users. Extensive experiments on both synthetic and real-world datasets demonstrate BiRank's soundness (fast convergence), efficiency (linear in the number of graph edges), and effectiveness (achieving state-of-the-art in the two real-world tasks).

[1]  Jimmy J. Lin,et al.  WTF: the who to follow service at Twitter , 2013, WWW.

[2]  Tat-Seng Chua,et al.  Fast Matrix Factorization for Online Recommendation with Implicit Feedback , 2016, SIGIR.

[3]  Wei-Ying Ma,et al.  Bipartite graph reinforcement model for web image annotation , 2007, ACM Multimedia.

[4]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[5]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[6]  Jimeng Sun,et al.  Relevance search and anomaly detection in bipartite graphs , 2005, SKDD.

[7]  Huanbo Luan,et al.  Discrete Collaborative Filtering , 2016, SIGIR.

[8]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[9]  Jussara M. Almeida,et al.  Using early view patterns to predict the popularity of youtube videos , 2013, WSDM.

[10]  Bernhard Schölkopf,et al.  Regularization on Discrete Spaces , 2005, DAGM-Symposium.

[11]  Michael R. Lyu,et al.  A generalized Co-HITS algorithm and its application to bipartite graphs , 2009, KDD.

[12]  Shankar Kumar,et al.  Video suggestion and discovery for youtube: taking random walks through the view graph , 2008, WWW.

[13]  Yiqun Liu,et al.  Predicting the popularity of web 2.0 items based on user comments , 2014, SIGIR.

[14]  Min-Yen Kan,et al.  Comment-based multi-view clustering of web 2.0 items , 2014, WWW.

[15]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[16]  Michael I. Jordan,et al.  Stable algorithms for link analysis , 2001, SIGIR '01.

[17]  Rynson W. H. Lau,et al.  Knowledge and Data Engineering for e-Learning Special Issue of IEEE Transactions on Knowledge and Data Engineering , 2008 .

[18]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[19]  Yiqun Liu,et al.  Do users rate or review?: boost phrase-level sentiment labeling with review-level sentiment classification , 2014, SIGIR.

[20]  Steffen Rendle,et al.  Learning recommender systems with adaptive regularization , 2012, WSDM '12.

[21]  Ming Gao,et al.  R-energy for evaluating robustness of dynamic networks , 2013, WebSci.

[22]  Alexander J. Smola,et al.  Kernels and Regularization on Graphs , 2003, COLT.

[23]  Yiqun Liu,et al.  Search engine click spam detection based on bipartite graph propagation , 2014, WSDM.

[24]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[25]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[26]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[27]  Bernhard Schölkopf,et al.  Learning from labeled and unlabeled data on a directed graph , 2005, ICML.

[28]  Tie-Yan Liu,et al.  BrowseRank: letting web users vote for page importance , 2008, SIGIR '08.

[29]  Lei Cao,et al.  Bipartite Graph Based Entity Ranking for Related Entity Finding , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[30]  Shlomo Moran,et al.  The stochastic approach for link-structure analysis (SALSA) and the TKC effect , 2000, Comput. Networks.

[31]  Bernhard Schölkopf,et al.  Ranking on Data Manifolds , 2003, NIPS.

[32]  Tie-Yan Liu,et al.  Semi-supervised ranking on very large graphs with rich metadata , 2011, KDD.

[33]  Chris H. Q. Ding,et al.  PageRank, HITS and a unified framework for link analysis , 2002, SIGIR '02.

[34]  Daraksha Parveen,et al.  Multi-document Summarization Using Bipartite Graphs , 2014, TextGraphs@EMNLP.

[35]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[36]  Shivani Agarwal,et al.  Ranking on graph data , 2006, ICML.

[37]  Sangkeun Lee,et al.  Random walk based entity ranking on graph for multidimensional recommendation , 2011, RecSys '11.

[38]  Tao Chen,et al.  TriRank: Review-aware Explainable Recommendation by Modeling Aspects , 2015, CIKM.

[39]  Daniel Dajun Zeng,et al.  A Random Walk Model for Item Recommendation in Social Tagging Systems , 2013, TMIS.