Speeding-up node influence computation for huge social networks

We address the problem of efficiently estimating the influence degree for all the nodes simultaneously in the network under the SIR setting. The proposed approach is a further improvement over the existing work of the bond percolation process which was demonstrated to be very effective, i.e., three orders of magnitude faster than direct Monte Carlo simulation, in approximately solving the influence maximization problem. We introduce two pruning techniques which improve computational efficiency by an order of magnitude. This approach is generic and can be instantiated to any specific diffusion model. It does not require any approximations or assumptions to the model that were needed in the existing approaches. We demonstrate its effectiveness by extensive experiments on two large real social networks. Main finding includes that different network structures have different epidemic thresholds and the node influence can identify influential nodes that the existing centrality measures cannot. We analyze how the performance changes when the network structure is systematically changed using synthetically generated networks and identify important factors that affect the performance.

[1]  A. Vázquez Growing network with local rules: preferential attachment, clustering hierarchy, and degree correlations. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[3]  Silvio Lattanzi,et al.  Efficient Algorithms for Public-Private Social Networks , 2015, KDD.

[4]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[5]  Christos Faloutsos,et al.  Patterns of Cascading Behavior in Large Blog Graphs , 2007, SDM.

[6]  Christian Borgs,et al.  Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[7]  Laks V. S. Lakshmanan,et al.  A Data-Based Approach to Social Influence Maximization , 2011, Proc. VLDB Endow..

[8]  Hai Zhuge,et al.  Topological centrality and its e-Science applications , 2010, J. Assoc. Inf. Sci. Technol..

[9]  Masahiro Kimura,et al.  Efficient analysis of node influence based on SIR model over huge complex networks , 2014, 2014 International Conference on Data Science and Advanced Analytics (DSAA).

[10]  Jon M. Kleinberg,et al.  Mining the Web's Link Structure , 1999, Computer.

[11]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[12]  Masahiro Kimura,et al.  Extracting influential nodes on a social network for information diffusion , 2009, Data Mining and Knowledge Discovery.

[13]  Sebastiano Vigna,et al.  In-Core Computation of Geometric Centralities with HyperBall: A Hundred Billion Nodes and Beyond , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[14]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[15]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[16]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[17]  Duncan J Watts,et al.  A simple model of global cascades on random networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Yifei Yuan,et al.  Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[19]  Yu Wang,et al.  Influence Maximization on Large-Scale Mobile Social Network: A Divide-and-Conquer Method , 2015, IEEE Transactions on Parallel and Distributed Systems.

[20]  Masahiro Kimura,et al.  Resampling-Based Framework for Estimating Node Centrality of Large Social Network , 2014, Discovery Science.

[21]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[22]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[23]  Enhong Chen,et al.  On Approximation of Real-World Influence Spread , 2012, ECML/PKDD.

[24]  Laks V. S. Lakshmanan,et al.  Information and Influence Propagation in Social Networks , 2013, Synthesis Lectures on Data Management.

[25]  Li Guo,et al.  On the Upper Bounds of Spread for Greedy Algorithms in Social Network Influence Maximization , 2015, IEEE Transactions on Knowledge and Data Engineering.

[26]  Masahiro Kimura,et al.  Efficient Estimation of Influence Functions for SIS Model on Social Networks , 2009, IJCAI.

[27]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[28]  Masahiro Kimura,et al.  Identifying Super-Mediators of Information Diffusion in Social Networks , 2013, Discovery Science.

[29]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[30]  Rong Zheng,et al.  Influence Spread in Large-Scale Social Networks - A Belief Propagation Approach , 2012, ECML/PKDD.

[31]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[32]  Masahiro Kimura,et al.  Extracting Influential Nodes for Information Diffusion on a Social Network , 2007, AAAI.