BalanSiNG: Fast and Scalable Generation of Realistic Signed Networks

How can we e ciently generate large-scale signed networks following real-world properties? Due to its rich modeling capability of representing trust relations as positive and negative edges, signed networks have spurred much interests with various applications. Despite its importance, however, existing models for generating signed networks do not correctly re ect properties of real-world signed networks. In this paper, we propose BalanSiNG, a novel, scalable, and fully parallelizable method for generating large-scale signed networks following realistic properties. We identify a self-similar balanced structure observed from a real-world signed network, and simulate the self-similarity via Kronecker product. Then, we exploit noise and careful weighting of signs such that our resulting network obeys various properties of real-world signed networks. BalanSiNG is easily parallelizable, and we implement it using Spark. Extensive experiments show that BalanSiNG efciently generates the most realistic signed networks satisfying various desired properties.

[1]  J. Pei,et al.  Finding Gangs in War from Signed Networks , 2016, KDD.

[2]  Manfred Schroeder,et al.  Fractals, Chaos, Power Laws: Minutes From an Infinite Paradise , 1992 .

[3]  Matt Thomas,et al.  Get out the vote: Determining support or opposition from Congressional floor-debate transcripts , 2006, EMNLP.

[4]  Lee Sael,et al.  Random Walk with Restart on Large Graphs Using Block Elimination , 2016, ACM Trans. Database Syst..

[5]  Lee Sael,et al.  Personalized Ranking in Signed Networks Using Signed Random Walk with Restart , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[6]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[7]  Michael Szell,et al.  Multirelational organization of large-scale social networks in an online world , 2010, Proceedings of the National Academy of Sciences.

[8]  Jennifer Neville,et al.  Fast Generation of Large Scale Social Networks While Incorporating Transitive Closures , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[9]  Christos Faloutsos,et al.  Realistic, Mathematically Tractable Graph Generation and Evolution, Using Kronecker Multiplication , 2005, PKDD.

[10]  Charu C. Aggarwal,et al.  Signed Network Modeling Based on Structural Balance Theory , 2017, CIKM.

[11]  Christos Faloutsos,et al.  Graph mining: Laws, generators, and algorithms , 2006, CSUR.

[12]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[13]  Charu C. Aggarwal,et al.  A Survey of Signed Network Mining in Social Media , 2015, ACM Comput. Surv..

[14]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[15]  Himchan Park,et al.  TrillionG: A Trillion-scale Synthetic Graph Generator using a Recursive Vector Model , 2017, SIGMOD Conference.

[16]  Jiming Liu,et al.  Community Mining from Signed Social Networks , 2007, IEEE Transactions on Knowledge and Data Engineering.

[17]  Peter Abell,et al.  An evolutionary model of social networks , 2007 .

[18]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[19]  Sung-Hyon Myaeng,et al.  Partition Aware Connected Component Computation in Distributed Systems , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[20]  U Kang,et al.  TeGViz: Distributed Tera-Scale Graph Generation and Visualization , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[21]  Christos Faloutsos,et al.  Edge Weight Prediction in Weighted Signed Networks , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[22]  Jinhong Jung,et al.  Supervised and extended restart in random walks for ranking and link prediction in networks , 2017, PloS one.

[23]  Charu C. Aggarwal,et al.  Node Classification in Signed Social Networks , 2016, SDM.

[24]  Jia Wu,et al.  Link Prediction with Signed Latent Factors in Signed Social Networks , 2019, KDD.

[25]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[26]  Dacheng Tao,et al.  Efficient Latent Link Recommendation in Signed Networks , 2015, KDD.

[27]  Jinhong Jung,et al.  Random walk-based ranking in signed social networks: model and algorithms , 2019, Knowledge and Information Systems.

[28]  U Kang,et al.  PegasusN: A Scalable and Versatile Graph Mining System , 2018, AAAI.

[29]  Charu C. Aggarwal,et al.  Signed Network Embedding in Social Media , 2017, SDM.

[30]  P. Holland,et al.  TRANSITIVITY IN STRUCTURAL MODELS OF SMALL GROUPS , 1977 .

[31]  Tamara G. Kolda,et al.  An in-depth analysis of stochastic Kronecker graphs , 2011, JACM.

[32]  Hui Fang,et al.  Supervised User Ranking in Signed Social Networks , 2019, AAAI.

[33]  V. S. Subrahmanian,et al.  Accurately detecting trolls in Slashdot Zoo via decluttering , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[34]  Christos Faloutsos,et al.  PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[35]  Riste Skrekovski,et al.  Modeling acquaintance networks based on balance theory , 2014, Int. J. Appl. Math. Comput. Sci..

[36]  Brian W. Barrett,et al.  Introducing the Graph 500 , 2010 .

[37]  G. Cecchi,et al.  Scale-free brain functional networks. , 2003, Physical review letters.

[38]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[39]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[40]  Sung-Hyon Myaeng,et al.  PTE: Enumerating Trillion Triangles On Distributed Systems , 2016, KDD.

[41]  Tamara G. Kolda,et al.  An In-depth Study of Stochastic Kronecker Graphs , 2011, 2011 IEEE 11th International Conference on Data Mining.

[42]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[43]  Christos Faloutsos,et al.  R-MAT: A Recursive Model for Graph Mining , 2004, SDM.

[44]  Lee Sael,et al.  BePI: Fast and Memory-Efficient Method for Billion-Scale Random Walk with Restart , 2017, SIGMOD Conference.

[45]  Christos Faloutsos,et al.  Kronecker Graphs: An Approach to Modeling Networks , 2008, J. Mach. Learn. Res..

[46]  Lee Sael,et al.  BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs , 2015, SIGMOD Conference.

[47]  Junghwan Kim,et al.  SIDE: Representation Learning in Signed Directed Networks , 2018, WWW.