Random surfing on multipartite graphs

Equipping an imaginary Random Surfer of the Web with the ability to teleport, was Page et al.'s creative way to justify a mathematical necessity; the Teleportation Matrix. Despite being essential for ensuring the ergodicity of the underlying Markov chain, the standard definition of this matrix treats the nodes of the graph in a simplistic and “leveling” way that can prove counterintuitive — especially for applications of the method on graphs of heterogeneous data. In this work, we focus on such graphs and we propose a novel alternative teleportation model that yields a well-defined ranking vector, while being as easy to handle as the traditional teleportation. We explore the theoretical implications of our model, and we reveal a wealth of nice properties that result to direct computational advantages over PageRank. We conduct a set of experiments using real-world datasets and we verify both the useful computational characteristics of our model and its favorable qualitative performance. Our promising findings suggest there remain more to be explored, and maybe much to be gained, by revisiting the teleportation model; a neglected part of PageRank that is typically taken for granted by the majority of applications in the literature.

[1]  François Fouss,et al.  An experimental investigation of kernels on graphs for collaborative recommendation and semisupervised classification , 2012, Neural Networks.

[2]  John D. Garofalakis,et al.  NCDawareRank: a novel ranking method that exploits the decomposable structure of the web , 2013, WSDM.

[3]  John D. Garofalakis,et al.  NCDREC: A Decomposability Inspired Framework for Top-N Recommendation , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[4]  Carl Dean Meyer,et al.  Stochastic Data Clustering , 2010, SIAM J. Matrix Anal. Appl..

[5]  Sebastiano Vigna,et al.  PageRank: Functional dependencies , 2009, TOIS.

[6]  Philip S. Yu,et al.  Unsupervised learning on k-partite graphs , 2006, KDD '06.

[7]  Takuya Akiba,et al.  Computing Personalized PageRank Quickly by Exploiting Graph Structures , 2014, Proc. VLDB Endow..

[8]  Ryan A. Rossi,et al.  A Dynamical System for PageRank with Time-Dependent Teleportation , 2012, Internet Math..

[9]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[10]  Chun Chen,et al.  Document recommendation in social tagging services , 2010, WWW '10.

[11]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[12]  John G. Kemeny,et al.  Finite Markov chains , 1960 .

[13]  William J. Stewart,et al.  Introduction to the numerical solution of Markov Chains , 1994 .

[14]  John D. Garofalakis,et al.  Random Surfing Without Teleportation , 2015, Algorithms, Probability, Networks, and Games.

[15]  Qiang Yang,et al.  Exploiting the hierarchical structure for link analysis , 2005, SIGIR '05.

[16]  Herbert A. Simon,et al.  Aggregation of Variables in Dynamic Systems , 1961 .

[17]  J. A. Bondy,et al.  Graph Theory , 2008, Graduate Texts in Mathematics.

[18]  Jérôme Kunegis,et al.  KONECT: the Koblenz network collection , 2013, WWW.

[19]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[20]  Giorgios Kollias,et al.  Surfing the Network for Ranking by Multidamping , 2014, IEEE Transactions on Knowledge and Data Engineering.

[21]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[22]  Peter Bühlmann,et al.  Protein and gene model inference based on statistical modeling in k-partite graphs , 2010, Proceedings of the National Academy of Sciences.

[23]  Kevin S. McCurley,et al.  Ranking the web frontier , 2004, WWW '04.

[24]  G. Grimmett,et al.  Probability and random processes , 2002 .

[25]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[26]  Ricardo A. Baeza-Yates,et al.  Generic Damping Functions for Propagating Importance in Link-Based Ranking , 2006, Internet Math..

[27]  P.-J. Courtois,et al.  Decomposability: Queueing and Computer System Applications , 2014 .

[28]  Fan Chung Graham,et al.  Local Graph Partitioning using PageRank Vectors , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).