Random walk-based ranking in signed social networks: model and algorithms

How can we rank nodes in signed social networks? Relationships between nodes in a signed network are represented as positive (trust) or negative (distrust) edges. Many social networks have adopted signed networks to express trust between users. Consequently, ranking friends or enemies in signed networks has received much attention from the data mining community. The ranking problem, however, is challenging because it is difficult to interpret negative edges. Traditional random walk-based methods such as PageRank and random walk with restart cannot provide effective rankings in signed networks since they assume only positive edges. Although several methods have been proposed by modifying traditional ranking models, they also fail to account for proper rankings due to the lack of ability to consider complex edge relations. In this paper, we propose Signed Random Walk with Restart ( SRWR ), a novel model for personalized ranking in signed networks. We introduce a signed random surfer so that she considers negative edges by changing her sign for walking. Our model provides proper rankings considering signed edges based on the signed random walk. We develop two methods for computing SRWR scores: SRWR-Iter and SRWR-Pre which are iterative and preprocessing methods, respectively. SRWR-Iter naturally follows the definition of SRWR , and iteratively updates SRWR scores until convergence. SRWR-Pre enables fast ranking computation which is important for the performance of applications of SRWR. Through extensive experiments, we demonstrate that SRWR achieves the best accuracy for link prediction, predicts trolls $$4\times $$ 4 × more accurately, and shows a satisfactory performance for inferring missing signs of edges compared to other competitors. In terms of efficiency, SRWR-Pre preprocesses a signed network $$4.5 \times $$ 4.5 × faster and requires $$11 \times $$ 11 × less memory space than other preprocessing methods; furthermore, SRWR-Pre computes SRWR scores up to $$14 \times $$ 14 × faster than other methods in the query phase.

[1]  F. Heider Attitudes and cognitive organization. , 1946, The Journal of psychology.

[2]  James A. Davis Clustering and Structural Balance in Graphs , 1977 .

[3]  Dongjin Song,et al.  Recommending Positive Links in Signed Social Networks by Optimizing a Generalized AUC , 2015, AAAI.

[4]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[5]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[6]  Scott P. Robertson,et al.  Proceedings of the SIGCHI Conference on Human Factors in Computing Systems , 1991 .

[7]  Jinhong Jung,et al.  Supervised and extended restart in random walks for ranking and link prediction in networks , 2017, PloS one.

[8]  Michael Szell,et al.  Multirelational organization of large-scale social networks in an online world , 2010, Proceedings of the National Academy of Sciences.

[9]  Christos Faloutsos,et al.  SlashBurn: Graph Compression and Mining beyond Caveman Communities , 2014, IEEE Transactions on Knowledge and Data Engineering.

[10]  Pablo Fernández,et al.  Google’s pagerank and beyond: The science of search engine rankings , 2008 .

[11]  John G. Lewis,et al.  Sparse matrix test problems , 1982, SGNM.

[12]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[13]  Mahdi Jalili,et al.  Ranking Nodes in Signed Social Networks , 2014, Social Network Analysis and Mining.

[14]  Lee Sael,et al.  BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs , 2015, SIGMOD Conference.

[15]  M. Rao Measure Theory and Integration , 2018 .

[16]  Shlomo Moran,et al.  SALSA: the stochastic approach for link-structure analysis , 2001, TOIS.

[17]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[18]  Christos Faloutsos,et al.  Fast best-effort pattern matching in large attributed graphs , 2007, KDD '07.

[19]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[20]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[21]  Jiming Liu,et al.  Community Mining from Signed Social Networks , 2007, IEEE Transactions on Knowledge and Data Engineering.

[22]  David F. Gleich,et al.  Vertex neighborhoods, low conductance cuts, and good seeds for local community methods , 2012, KDD.

[23]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[24]  Arnab Bhattacharya,et al.  Finding the bias and prestige of nodes in networks based on trust scores , 2011, WWW.

[25]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[26]  Jon M Kleinberg,et al.  Hubs, authorities, and communities , 1999, CSUR.

[27]  Minji Yoon,et al.  Fast and Accurate Random Walk with Restart on Dynamic Graphs with Guarantees , 2017, WWW.

[28]  Michael I. Jordan,et al.  Stable algorithms for link analysis , 2001, SIGIR '01.

[29]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[30]  James Hendler,et al.  Google’s PageRank and Beyond: The Science of Search Engine Rankings , 2007 .

[31]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[32]  Lee Sael,et al.  Personalized Ranking in Signed Networks Using Signed Random Walk with Restart , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[33]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[34]  Jimeng Sun,et al.  Fast Random Walk Graph Kernel , 2012, SDM.

[35]  Minji Yoon,et al.  TPA: Fast, Scalable, and Accurate Method for Approximate Random Walk with Restart on Billion Scale Graphs , 2017, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[36]  Charu C. Aggarwal,et al.  The Troll-Trust Model for Ranking in Signed Networks , 2016, WSDM.

[37]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[38]  Lee Sael,et al.  Random Walk with Restart on Large Graphs Using Block Elimination , 2016, ACM Trans. Database Syst..

[39]  Ashish Goel,et al.  Fast Incremental and Personalized PageRank , 2010, Proc. VLDB Endow..

[40]  Christian Bauckhage,et al.  The slashdot zoo: mining a social network with negative edges , 2009, WWW.

[41]  Yasuhiro Fujiwara,et al.  Fast and Exact Top-k Search for Random Walk with Restart , 2012, Proc. VLDB Endow..

[42]  LiuJiming,et al.  Community Mining from Signed Social Networks , 2007 .

[43]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[44]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[45]  Christos Faloutsos,et al.  Beyond 'Caveman Communities': Hubs and Spokes for Graph Compression and Mining , 2011, 2011 IEEE 11th International Conference on Data Mining.

[46]  Lee Sael,et al.  BePI: Fast and Memory-Efficient Method for Billion-Scale Random Walk with Restart , 2017, SIGMOD Conference.

[47]  Christos Faloutsos,et al.  Random walk with restart: fast solutions and applications , 2008, Knowledge and Information Systems.