CoFIM: A community-based framework for influence maximization on large-scale networks

Abstract Influence maximization is a classic optimization problem studied in the area of social network analysis and viral marketing. Given a network, it is defined as the problem of finding k seed nodes so that the influence spread of the network can be optimized. Kempe et al. have proved that this problem is NP hard and the objective function is submodular, based on which a greedy algorithm was proposed to give a near-optimal solution. However, this simple greedy algorithm is time consuming, which limits its application on large-scale networks. Heuristic algorithms generally cannot provide any performance guarantee. To solve this problem, in this paper we propose CoFIM, a community-based framework for influence maximization on large-scale networks. In our framework the influence propagation process is divided into two phases: (i) seeds expansion; and (ii) intra-community propagation. The first phase is the expansion of seed nodes among different communities at the beginning of diffusion. The second phase is the influence propagation within communities which are independent of each other. Based on the framework, we derive a simple evaluation form of the total influence spread which is submodular and can be efficiently computed. Then we further propose a fast algorithm to select the seed nodes. Experimental results on synthetic and nine real-world large datasets including networks with millions of nodes and hundreds of millions of edges show that our algorithm achieves competitive results in influence spread as compared with state-of-the-art algorithms and it is much more efficient in terms of both time and memory usage.

[1]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[2]  Xiaokui Xiao,et al.  Influence maximization: near-optimal time complexity meets practical efficiency , 2014, SIGMOD Conference.

[3]  Cheng Wu,et al.  Targeted revision: A learning-based approach for incremental community detection in dynamic networks , 2016 .

[4]  Vijay Mahajan,et al.  New Product Diffusion Models in Marketing: A Review and Directions for Research: , 1990 .

[5]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[6]  Asaf Shapira,et al.  A note on maximizing the spread of influence in social networks , 2011, Inf. Process. Lett..

[7]  C. A. Murthy,et al.  A New Centrality Measure for Influence Maximization in Social Networks , 2011, PReMI.

[8]  Jie Tang,et al.  Influence Maximization in Dynamic Social Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.

[9]  Zonghua Liu,et al.  How community structure influences epidemic spread in social networks , 2008 .

[10]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Song Wang,et al.  OASNET: an optimal allocation approach to influence maximization in modular social networks , 2010, SAC '10.

[12]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[13]  Han Zhao,et al.  Identifying influential nodes in complex networks with community structure , 2013, Knowl. Based Syst..

[14]  Xin Zhang,et al.  IncOrder: Incremental density-based community detection in dynamic networks , 2014, Knowl. Based Syst..

[15]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Chengcui Zhang,et al.  A fast parallel modularity optimization algorithm (FPMQA) for community detection in online social network , 2013, Knowl. Based Syst..

[17]  Yitong Wang,et al.  A Potential-Based Node Selection Strategy for Influence Maximization in a Social Network , 2009, ADMA.

[18]  Matthew Richardson,et al.  Trust Management for the Semantic Web , 2003, SEMWEB.

[19]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[20]  Xiaokui Xiao,et al.  Influence Maximization in Near-Linear Time: A Martingale Approach , 2015, SIGMOD Conference.

[21]  Marco Pellegrini,et al.  Extraction and classification of dense communities in the web , 2007, WWW '07.

[22]  Aristides Gionis,et al.  Sparsification of influence networks , 2011, KDD.

[23]  Hemank Lamba,et al.  A Novel and Model Independent Approach for Efficient Influence Maximization in Social Networks , 2013, WISE.

[24]  Masahiro Kimura,et al.  Tractable Models for Information Diffusion in Social Networks , 2006, PKDD.

[25]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[26]  Feng Xie,et al.  Epidemic spreading on complex networks with overlapping and non-overlapping community structure , 2015 .

[27]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[28]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[29]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[30]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[31]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Christian Borgs,et al.  Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[33]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[34]  Alessandro Vespignani,et al.  Epidemic spreading in scale-free networks. , 2000, Physical review letters.

[35]  Cheng Wu,et al.  A Real-Time Detecting Algorithm for Tracking Community Structure of Dynamic Networks , 2014, ArXiv.

[36]  Bambi Hu,et al.  Epidemic spreading in community networks , 2005 .

[37]  Peter H. Reingen,et al.  Social Ties and Word-of-Mouth Referral Behavior , 1987 .

[38]  L. Takac DATA ANALYSIS IN PUBLIC SOCIAL NETWORKS , 2012 .

[39]  Jinha Kim,et al.  Scalable and parallelizable processing of influence maximization for large-scale social networks? , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[40]  Laks V. S. Lakshmanan,et al.  CELF++: optimizing the greedy algorithm for influence maximization in social networks , 2011, WWW.

[41]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[42]  Cheng Wu,et al.  How Overlapping Community Structure Affects Epidemic Spreading in Complex Networks , 2014, 2014 IEEE 38th International Computer Software and Applications Conference Workshops.

[43]  Suh-Yin Lee,et al.  CIM: Community-Based Influence Maximization in Social Networks , 2014, TIST.

[44]  Yifei Yuan,et al.  Influence Maximization in Social Networks When Negative Opinions May Emerge and Propagate , 2011, SDM.

[45]  Maxim Sviridenko,et al.  A note on maximizing a submodular set function subject to a knapsack constraint , 2004, Oper. Res. Lett..

[46]  Wei Chen,et al.  Influence Blocking Maximization in Social Networks under the Competitive Linear Threshold Model , 2011, SDM.

[47]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Wonyeol Lee,et al.  CT-IC: Continuously activated and Time-restricted Independent Cascade model for viral marketing , 2014, Knowl. Based Syst..

[49]  Xiaodong Liu,et al.  IMGPU: GPU-Accelerated Influence Maximization in Large-Scale Social Networks , 2014, IEEE Transactions on Parallel and Distributed Systems.

[50]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[51]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[52]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[53]  Jiangtao Cui,et al.  Conformity-aware influence maximization in online social networks , 2014, The VLDB Journal.