Efficient presolving methods for influence maximization problem in social networks

The influence maximization problem (IMP) is a hot topic, which asks for identifying a limited number of key individuals to spread influence in a social network such that the expected number of influenced individuals is maximized. The stochastic maximal covering location problem (SMCLP) formulation is a mixed integer programming formulation that approximates the IMP by the Monte-Carlo sampling. However, the SMCLP formulation cannot be solved efficiently using existing exact algorithms due to its large problem size. In this paper, we concentrate on deriving presolving methods to reduce the problem size and hence enhance the capability of employing exact algorithms in solving the IMP. In particular, we propose two effective presolving methods, called the strongly connected nodes aggregation (SCNA) and the isomorphic nodes aggregation (INA). The SCNA enables us to build a new SMCLP formulation that is much more compact than the existing one. For the INA, an analysis is given on the one-way bipartite social network. Specifically, we show that under certain conditions, the problem size of the reduced SMCLP formulation depends only on the size of the given network but not on the number of samplings and this reduced formulation is strongly polynomialtime solvable. Finally, we integrate the proposed presolving methods SCNA and INA into the Benders decomposition algorithm, which is recognized as one of the state-of-the-art exact algorithms for solving the IMP. Numerical results demonstrate that with the SCNA and INA, the Benders decomposition algorithm is much more effective in solving the IMP in terms of solution time.

[1]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[2]  Yifei Yuan,et al.  Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[3]  Peng Zhang,et al.  Minimizing seed set selection with probabilistic coverage guarantee in a social network , 2014, KDD.

[4]  Rui Zhang,et al.  A Branch-and-Cut Approach for the Weighted Target Set Selection Problem on Social Networks , 2019 .

[5]  Charles A. Sutton,et al.  GEMSEC: Graph Embedding with Self Clustering , 2018, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[6]  R. Borndörfer,et al.  Aspects of Set Packing, Partitioning, and Covering , 1998 .

[7]  Xiaokui Xiao,et al.  Influence Maximization in Near-Linear Time: A Martingale Approach , 2015, SIGMOD Conference.

[8]  Simge Küçükyavuz,et al.  A two-stage stochastic programming approach for influence maximization in social networks , 2018, Comput. Optim. Appl..

[9]  Robert E. Bixby,et al.  Presolve Reductions in Mixed Integer Programming , 2020, INFORMS J. Comput..

[10]  Rui Zhang,et al.  Least-Cost Influence Maximization on Social Networks , 2020, INFORMS J. Comput..

[11]  Xueqi Cheng,et al.  StaticGreedy: solving the scalability-accuracy dilemma in influence maximization , 2012, CIKM.

[12]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[13]  Yuchen Li,et al.  Influence Maximization on Social Graphs: A Survey , 2018, IEEE Transactions on Knowledge and Data Engineering.

[14]  Evren Güney,et al.  An efficient linear programming based method for the influence maximization problem in social networks , 2019, Inf. Sci..

[15]  Markus Leitner,et al.  Benders decomposition for competitive influence maximization in (social) networks , 2020, Omega.

[16]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[17]  Xiang Li,et al.  TipTop: (Almost) Exact Solutions for Influence Maximization in Billion-Scale Networks , 2017, IEEE/ACM Transactions on Networking.

[18]  Kathleen M. Carley,et al.  Patterns and dynamics of users' behavior and interaction: Network analysis of an online community , 2009, J. Assoc. Inf. Sci. Technol..

[19]  Shourya Roy,et al.  Holistic Influence Maximization: Combining Scalability and Efficiency with Opinion-Aware Models , 2016, SIGMOD Conference.

[20]  G. Ergun Human Sexual Contact Network as a Bipartite Graph , 2001, cond-mat/0111323.

[21]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[22]  Matteo Fischetti,et al.  Least cost influence propagation in (social) networks , 2018, Math. Program..

[23]  Ian T. Foster,et al.  Mapping the Gnutella Network: Macroscopic Properties of Large-Scale Peer-to-Peer Systems , 2002, IPTPS.

[24]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[25]  Markus Leitner,et al.  Large-scale influence maximization via maximal covering location , 2021, Eur. J. Oper. Res..

[26]  Juan José Salazar González,et al.  Exact approaches to the single‐source network loading problem , 2012, Networks.

[27]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[28]  M. Sharir,et al.  A strong-connectivity algorithm and its applications in data flow analysis. , 2018 .

[29]  Simge Küçükyavuz,et al.  Probabilistic Partial Set Covering with an Oracle for Chance Constraints , 2019, SIAM J. Optim..

[30]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[31]  J. Christopher Beck,et al.  Using dual presolving reductions to reformulate cumulative constraints , 2013, Constraints.

[32]  Xiaokui Xiao,et al.  Influence maximization: near-optimal time complexity meets practical efficiency , 2014, SIGMOD Conference.

[33]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[34]  Matteo Fischetti,et al.  Exploiting Erraticism in Search , 2014, Oper. Res..

[35]  Andrea Lodi,et al.  Performance Variability in Mixed-Integer Programming , 2013 .

[36]  Masahiro Kimura,et al.  Tractable Models for Information Diffusion in Social Networks , 2006, PKDD.