On the Troll-Trust Model for Edge Sign Prediction in Social Networks

In the problem of edge sign prediction, we are given a directed graph (representing a social network), and our task is to predict the binary labels of the edges (i.e., the positive or negative nature of the social relationships). Many successful heuristics for this problem are based on the troll-trust features, estimating at each node the fraction of outgoing and incoming positive/negative edges. We show that these heuristics can be understood, and rigorously analyzed, as approximators to the Bayes optimal classifier for a simple probabilistic model of the edge labels. We then show that the maximum likelihood estimator for this model approximately corresponds to the predictions of a Label Propagation algorithm run on a transformed version of the original social graph. Extensive experiments on a number of real-world datasets show that this algorithm is competitive against state-of-the-art classifiers in terms of both accuracy and scalability. Finally, we show that troll-trust features can also be used to derive online learning algorithms which have theoretical guarantees even when edges are adversarially labeled.

[1]  Mahdi Jalili,et al.  Ranking Nodes in Signed Social Networks , 2014, Social Network Analysis and Mining.

[2]  Manfred K. Warmuth,et al.  The Weighted Majority Algorithm , 1994, Inf. Comput..

[3]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[4]  David B. Skillicorn,et al.  Spectral Embedding of Signed Networks , 2015, SDM.

[5]  Dongjin Song,et al.  Link sign prediction and ranking in signed directed social networks , 2015, Social Network Analysis and Mining.

[6]  Minghua Chen,et al.  Predicting positive and negative links in signed social networks by transfer learning , 2013, WWW.

[7]  James A. Davis Clustering and Structural Balance in Graphs , 1977 .

[8]  Paul Van Dooren,et al.  The PageTrust Algorithm: How to rank web pages when negative links are allowed? , 2008, SDM.

[9]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[10]  Huan Liu,et al.  Exploiting homophily effect for trust prediction , 2013, WSDM.

[11]  Lauren Wood 技術解説 IEEE Internet Computing , 1999 .

[12]  Sibel Adali,et al.  Foundations of Trust and Distrust in Networks: Extended Structural Balance Theory , 2014, TWEB.

[13]  Charu C. Aggarwal,et al.  The Troll-Trust Model for Ranking in Signed Networks , 2016, WSDM.

[14]  Jure Leskovec,et al.  Antisocial Behavior in Online Discussion Communities , 2015, ICWSM.

[15]  Claudio Gentile,et al.  A Correlation Clustering Approach to Link Classification in Signed Networks , 2012, COLT.

[16]  Anna Monreale,et al.  Classifying Trust/Distrust Relationships in Online Social Networks , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[17]  N. Littlestone Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[18]  Alexander Zien,et al.  Label Propagation and Quadratic Criterion , 2006 .

[19]  Christian Bauckhage,et al.  The slashdot zoo: mining a social network with negative edges , 2009, WWW.

[20]  Vincent A. Traag,et al.  Exponential Ranking: Taking into Account Negative Links , 2010, SocInfo.

[21]  Mark Herbster,et al.  Prediction on a Graph with a Perceptron , 2006, NIPS.

[22]  Claudio Gentile,et al.  A Linear Time Active Learning Algorithm for Link Classification , 2012, NIPS.

[23]  Claudio Gentile,et al.  Adaptive and Self-Confident On-Line Learning Algorithms , 2000, J. Comput. Syst. Sci..

[24]  Guy Lever,et al.  Online Prediction on Large Diameter Graphs , 2008, NIPS.

[25]  P. Holland,et al.  An Exponential Family of Probability Distributions for Directed Graphs , 1981 .

[26]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[27]  Arnab Bhattacharya,et al.  Finding the bias and prestige of nodes in networks based on trust scores , 2011, WWW.

[28]  Charu C. Aggarwal,et al.  A Survey of Signed Network Mining in Social Media , 2015, ACM Comput. Surv..

[29]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[30]  David Haussler,et al.  How to use expert advice , 1993, STOC.

[31]  Talel Abdessalem,et al.  Casting a web of trust over Wikipedia: an interaction-based approach , 2011, WWW.

[32]  Nagarajan Natarajan,et al.  Prediction and clustering in signed networks: a local to global perspective , 2013, J. Mach. Learn. Res..

[33]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[34]  Alessandro Panconesi,et al.  Concentration of Measure for the Analysis of Randomized Algorithms , 2009 .

[35]  F. Heider The psychology of interpersonal relations , 1958 .

[36]  Theodora A. Varvarigou,et al.  Trust inference in online social networks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[37]  Claudio Gentile,et al.  Random Spanning Trees and the Prediction of Weighted Graphs , 2010, ICML.

[38]  Kiyana Zolfaghar,et al.  Mining trust and distrust relationships in social Web applications , 2010, Proceedings of the 2010 IEEE 6th International Conference on Intelligent Computer Communication and Processing.

[39]  Konstantinos Tserpes,et al.  Predicting Edge Signs in Social Networks Using Frequent Subgraph Discovery , 2014, IEEE Internet Computing.

[40]  Avrim Blum,et al.  Learning from Labeled and Unlabeled Data using Graph Mincuts , 2001, ICML.

[41]  Srijan Kumar,et al.  Structure and Dynamics of Signed Citation Networks , 2016, WWW.