UserGreedy: Exploiting the Activation Set to Solve Influence Maximization Problem

Influence Maximization is the problem of selecting a small set of seed users in a social network to maximize the spread of influence. Traditional solutions are mainly divided into two directions. The one is greedy-based methods and the other is heuristics-based methods. The greedy-based methods can effectively estimate influence spread using thousands of Monte-Carlo simulations. However, the computational cost of simulation is extremely expensive so that they are not scalable to large networks. The heuristics-based methods, estimating influence spread according to heuristic strategies, have low computational cost but without theoretical guarantees. In order to improve both performance and effectiveness, in this paper we propose a greedy-based algorithm, named UserGreedy. In UserGreedy, we first propose a novel concept called Activation Set, which is defined as a set of users that can be activated by a seed user with a certain probability under the most standard and popular independent cascade (IC) model. Based on the computation of such probabilities, we can directly estimate the influence spread without the expensive simulation process. We then design an influence spread function based on the the Activation Set and mathematically prove that it has the property of monotonicity and submodularity, which provides theoretical guarantee for the UserGreedy algorithm. Besides, we also propose an efficient method to obtain the Activation Set and hence implement the UserGreedy algorithm. Experiments on real-world social networks demonstrate that our algorithm is much faster than existing greedy-based algorithms and outperforms the state-of-art heuristics-based algorithms in terms of effectiveness.

[1]  Bernhard Schölkopf,et al.  Influence Maximization in Continuous Time Diffusion Networks , 2012, ICML.

[2]  Yifeng Zeng,et al.  Influence Maximization with Novelty Decay in Social Networks , 2014, AAAI.

[3]  Xueqi Cheng,et al.  StaticGreedy: solving the scalability-accuracy dilemma in influence maximization , 2012, CIKM.

[4]  Wei Chen,et al.  Influence Blocking Maximization in Social Networks under the Competitive Linear Threshold Model , 2011, SDM.

[5]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[6]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[7]  Ning Zhang,et al.  Time-Critical Influence Maximization in Social Networks with Time-Delayed Diffusion Process , 2012, AAAI.

[8]  Shishir Bharathi,et al.  Competitive Influence Maximization in Social Networks , 2007, WINE.

[9]  Sourav S. Bhowmick,et al.  CINEMA: conformity-aware greedy algorithm for influence maximization in online social networks , 2013, EDBT '13.

[10]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[11]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[12]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[13]  Kyomin Jung,et al.  IRIE: Scalable and Robust Influence Maximization in Social Networks , 2011, 2012 IEEE 12th International Conference on Data Mining.

[14]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[15]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[16]  Yu Wang,et al.  Community-based greedy algorithm for mining top-K influential nodes in mobile social networks , 2010, KDD.