Extracting influential nodes on a social network for information diffusion

We address the combinatorial optimization problem of finding the most influential nodes on a large-scale social network for two widely-used fundamental stochastic diffusion models. The past study showed that a greedy strategy can give a good approximate solution to the problem. However, a conventional greedy method faces a computational problem. We propose a method of efficiently finding a good approximate solution to the problem under the greedy algorithm on the basis of bond percolation and graph theory, and compare the proposed method with the conventional method in terms of computational complexity in order to theoretically evaluate its effectiveness. The results show that the proposed method is expected to achieve a great reduction in computational cost. We further experimentally demonstrate that the proposed method is much more efficient than the conventional method using large-scale real-world networks including blog networks.

[1]  Pedro M. Domingos Mining Social Networks for Viral Marketing , 2022 .

[2]  Masahiro Kimura,et al.  Effective Visualization of Information Diffusion Process over Complex Networks , 2008, ECML/PKDD.

[3]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[4]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks , 2005, IJCAI.

[5]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[6]  P. Grassberger On the critical behavior of the general epidemic process and dynamical percolation , 1983 .

[7]  M. Newman Spread of epidemic disease on networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[9]  Bruce A. Reed,et al.  The Size of the Giant Component of a Random Graph with a Given Degree Sequence , 1998, Combinatorics, Probability and Computing.

[10]  Éva Tardos,et al.  Influential Nodes in a Diffusion Model for Social Networks , 2005, ICALP.

[11]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[12]  F. Chung,et al.  Connected Components in Random Graphs with Given Expected Degree Sequences , 2002 .

[13]  D. Watts,et al.  Influentials, Networks, and Public Opinion Formation , 2007 .

[14]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[15]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[17]  D S Callaway,et al.  Network robustness and fragility: percolation on random graphs. , 2000, Physical review letters.

[18]  Eyal Even-Dar,et al.  A note on maximizing the spread of influence in social networks , 2007, Inf. Process. Lett..

[19]  Steffen Staab,et al.  Social Networks Applied , 2005, IEEE Intell. Syst..

[20]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[21]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[22]  Duncan J Watts,et al.  A simple model of global cascades on random networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.