Modeling influence diffusion in networks for community detection, resilience analysis and viral marketing

The past decades have seen a fast-growing and dynamic trend of network science and its applications. From the Internet to Facebook, from telecommunications to power grids, from protein interactions to paper citations, networks are everywhere and the network paradigm is pervasive. Network analysis and mining has become an important tool for scientific research and industrial applications to diverse domains. For example, finding communities within social networks enables us to identify groups of densely connected customers who may share similar interests and behaviors and thus generate more effective recommender systems; investigating the supply-network topological structure and growth model improves the resilience of supply networks against disruptions; and modeling influence diffusion in social networks provides insights into viral marketing strategies. However, none of these tasks is trivial. In fact, community detection, resilience analysis, and influence-diffusion modeling are all important challenges in complex networks. My PhD research contributes to these endeavors by exploring the implicit knowledge of connectivity and proximity encoded in the network graph topology. Our research originated from an attempt to find communities in networks. After carefully examining real-life communities and the features and limitations of a set of widely-used centrality measures, we develop a simple but powerful reachabilitybased influence-diffusion model. Based upon this model, we propose a new influence centrality and a novel shared-influence-neighbor (SIN) similarity. The former differentiates the comprehensive influence significance more precisely, and the latter gives rise to a refined vertex-pair closeness metric. Then we develop an influence-guided spherical K-means (IGSK) algorithm for community detection. Further, we propose two novel influence-guided label propagation (IGLP) algorithms for finding hierarchical communities in complex networks. Experiments on both real-life networks and synthetic benchmarks demonstrate superior performance of our algorithms in both

[1]  Sergio Gómez,et al.  Size reduction of complex networks preserving modularity , 2007, ArXiv.

[2]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[3]  Wei Chen,et al.  Influence Blocking Maximization in Social Networks under the Competitive Linear Threshold Model , 2011, SDM.

[4]  Samuel Schmidt,et al.  The political network in Mexico , 1996 .

[5]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[6]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Lada A. Adamic,et al.  Tracking information epidemics in blogspace , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[9]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[10]  Jure Leskovec,et al.  Can cascades be predicted? , 2014, WWW.

[11]  N. Christakis,et al.  The Spread of Obesity in a Large Social Network Over 32 Years , 2007, The New England journal of medicine.

[12]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  P. Bonacich Factoring and weighting approaches to status scores and clique identification , 1972 .

[14]  Renana Peres,et al.  The impact of network characteristics on the diffusion of innovations , 2014 .

[15]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  R. Jarvis,et al.  ClusteringUsing a Similarity Measure Based on SharedNear Neighbors , 1973 .

[17]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[18]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[20]  Yifei Yuan,et al.  Influence Maximization in Social Networks When Negative Opinions May Emerge and Propagate , 2011, SDM.

[21]  Yusoon Kim,et al.  Supply network disruption and resilience: A network structural perspective , 2015 .

[22]  Chung-Kuan Cheng,et al.  Towards efficient hierarchical designs by ratio cut partitioning , 1989, 1989 IEEE International Conference on Computer-Aided Design. Digest of Technical Papers.

[23]  Mourad Ykhlef,et al.  Toward Information Diffusion Model for Viral Marketing in Business , 2016 .

[24]  Yizhou Sun,et al.  SHRINK: a structural clustering algorithm for detecting hierarchical communities in networks , 2010, CIKM.

[25]  R. Burt Social Contagion and Innovation: Cohesion versus Structural Equivalence , 1987, American Journal of Sociology.

[26]  Wei Chen,et al.  Influence diffusion dynamics and influence maximization in social networks with friend and foe relationships , 2011, WSDM.

[27]  L. Freeman,et al.  Centrality in valued graphs: A measure of betweenness based on network flow , 1991 .

[28]  Wei Chen,et al.  Scalable influence maximization for independent cascade model in large-scale social networks , 2012, Data Mining and Knowledge Discovery.

[29]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[30]  Inderjit S. Dhillon,et al.  Concept Decompositions for Large Sparse Text Data Using Clustering , 2004, Machine Learning.

[31]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[32]  Esteban Moro,et al.  Impact of human activity patterns on the dynamics of information diffusion. , 2009, Physical review letters.

[33]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[34]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[35]  J. Laurie Snell,et al.  Markov Random Fields and Their Applications , 1980 .

[36]  Matteo Pellegrini,et al.  Detecting Communities Based on Network Topology , 2014, Scientific Reports.

[37]  John Yen,et al.  Analyzing the Resilience of Complex Supply Network Topologies Against Random and Targeted Disruptions , 2011, IEEE Systems Journal.

[38]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[39]  Alex Arenas,et al.  Analysis of the structure of complex networks at different resolution levels , 2007, physics/0703218.

[40]  Cameron Marlow,et al.  A 61-million-person experiment in social influence and political mobilization , 2012, Nature.

[41]  Laks V. S. Lakshmanan,et al.  Information and Influence Propagation in Social Networks , 2013, Synthesis Lectures on Data Management.

[42]  Michael R. Lyu,et al.  Mining social networks using heat diffusion processes for marketing candidates selection , 2008, CIKM '08.

[43]  Huijun Sun,et al.  SCALE-FREE CHARACTERISTICS OF SUPPLY CHAIN DISTRIBUTION NETWORKS , 2005 .

[44]  Pietro Liò,et al.  Towards real-time community detection in large networks. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[45]  William Nick Street,et al.  A novel algorithm for community detection and influence ranking in social networks , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[46]  Soundar R. T. Kumara,et al.  Survivability of multiagent-based supply networks: a topological perspect , 2004, IEEE Intelligent Systems.

[47]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[48]  P. Howard,et al.  Opening Closed Regimes: What Was the Role of Social Media During the Arab Spring? , 2011 .

[49]  Jiawei Han,et al.  Learning influence from heterogeneous social networks , 2012, Data Mining and Knowledge Discovery.

[50]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[52]  Massimo Marchiori,et al.  Error and attacktolerance of complex network s , 2004 .

[53]  Yong Zhou,et al.  A Node Influence Based Label Propagation Algorithm for Community Detection in Networks , 2014, TheScientificWorldJournal.

[54]  William Nick Street,et al.  Finding Hierarchical Communities in Complex Networks Using Influence-Guided Label Propagation , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[55]  Allan Borodin,et al.  Threshold Models for Competitive Influence in Social Networks , 2010, WINE.

[56]  Thomas W. Valente,et al.  Opinion Leadership and Social Contagion in New Product Diffusion , 2011, Mark. Sci..

[57]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[58]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[59]  Germaine H. Saad,et al.  Managing Disruption Risks in Supply Chains , 2005 .

[60]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[61]  M. A. Muñoz,et al.  Journal of Statistical Mechanics: An IOP and SISSA journal Theory and Experiment Detecting network communities: a new systematic and efficient algorithm , 2004 .

[62]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[63]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[64]  E. Muller,et al.  Decomposing the Value of Word-of-Mouth Seeding Programs: Acceleration vs. Expansion , 2012 .

[65]  William Rand,et al.  Agent-Based Modeling in Marketing: Guidelines for Rigor , 2011 .

[66]  Laks V. S. Lakshmanan,et al.  SIMPATH: An Efficient Algorithm for Influence Maximization under the Linear Threshold Model , 2011, 2011 IEEE 11th International Conference on Data Mining.

[67]  Gao Cong,et al.  Simulated Annealing Based Influence Maximization in Social Networks , 2011, AAAI.

[68]  Paul Jen-Hwa Hu,et al.  Predicting Adoption Probabilities in Social Networks , 2012, Inf. Syst. Res..

[69]  Elchanan Mossel,et al.  Submodularity of Influence in Social Networks: From Local to Global , 2010, SIAM J. Comput..

[70]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[71]  Boleslaw K. Szymanski,et al.  LabelRank: A stabilized label propagation algorithm for community detection in networks , 2013, 2013 IEEE 2nd Network Science Workshop (NSW).

[72]  Jure Leskovec,et al.  Empirical comparison of algorithms for network community detection , 2010, WWW '10.

[73]  Brian Tomlin,et al.  On the Value of Mitigation and Contingency Strategies for Managing Supply Chain Disruption Risks , 2006, Manag. Sci..

[74]  M. Zelen,et al.  Rethinking centrality: Methods and examples☆ , 1989 .

[75]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[76]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[77]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[78]  François Fouss,et al.  Graph nodes clustering with the sigmoid commute-time kernel: A comparative study , 2009, Data Knowl. Eng..

[79]  Heiko Rieger,et al.  Random walks on complex networks. , 2004, Physical review letters.

[80]  Jiawei Han,et al.  gSkeletonClu: Density-Based Network Clustering via Structure-Connected Tree Division or Agglomeration , 2010, 2010 IEEE International Conference on Data Mining.

[81]  Chris Arney Connected: The Surprising Power of Our Social Networks and How They Shape Our Lives - How Your Friends' Friends' Friends Affect Everything You Feel, Think, and Do , 2014 .

[82]  Santosh S. Vempala,et al.  On clusterings-good, bad and spectral , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[83]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[84]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[85]  Yifei Yuan,et al.  Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[86]  Ning Zhang,et al.  Time-Critical Influence Maximization in Social Networks with Time-Delayed Diffusion Process , 2012, AAAI.

[87]  Wolfgang Gaissmaier,et al.  The amplification of risk in experimental diffusion chains , 2015, Proceedings of the National Academy of Sciences.

[88]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[89]  José M. Vidal,et al.  Supply network topology and robustness against disruptions – an investigation using multi-agent model , 2011 .

[90]  Chris Volinsky,et al.  Network-Based Marketing: Identifying Likely Adopters Via Consumer Networks , 2006, math/0606278.

[91]  Nitesh V. Chawla,et al.  Is Objective Function the Silver Bullet? A Case Study of Community Detection Algorithms on Social Networks , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[92]  Jon M. Kleinberg,et al.  Overview of the 2003 KDD Cup , 2003, SKDD.

[93]  François Fouss,et al.  Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[94]  Gert Sabidussi,et al.  The centrality index of a graph , 1966 .

[95]  S. Jurvetson What exactly is viral marketing , 2000 .

[96]  Damon Centola,et al.  The Spread of Behavior in an Online Social Network Experiment , 2010, Science.

[97]  Yanjun Li,et al.  A Framework to Model the Topological Structure of Supply Networks , 2011, IEEE Transactions on Automation Science and Engineering.

[98]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[99]  Hui-Huang Chen,et al.  Complex Network Characteristics and Invulnerability Simulating Analysis of Supply Chain , 2012, J. Networks.

[100]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[101]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[102]  Leysia Palen,et al.  Twitter adoption and use in mass convergence and emergency events , 2009 .

[103]  Nicholas A. Christakis,et al.  Social contagion theory: examining dynamic social networks and human behavior , 2011, Statistics in medicine.

[104]  Ernesto Estrada,et al.  Communicability in complex networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[105]  Evaggelia Pitoura,et al.  Diffusion Maximization in Evolving Social Networks , 2015, COSN.

[106]  William Nick Street,et al.  Topological resilience analysis of supply networks under random disruptions and targeted attacks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[107]  Laks V. S. Lakshmanan,et al.  CELF++: optimizing the greedy algorithm for influence maximization in social networks , 2011, WWW.

[108]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[109]  P. Clifford,et al.  A model for spatial conflict , 1973 .

[110]  Juan-Zi Li,et al.  Social Influence Locality for Modeling Retweeting Behaviors , 2013, IJCAI.

[111]  Judd Harrison Michael,et al.  Modeling the communication network in a sawmill , 1997 .

[112]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[113]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[114]  Laks V. S. Lakshmanan,et al.  Maximizing product adoption in social networks , 2012, WSDM '12.

[115]  Jeffrey T. Hancock,et al.  Experimental evidence of massive-scale emotional contagion through social networks , 2014, Proceedings of the National Academy of Sciences.

[116]  William Nick Street,et al.  Modeling influence diffusion to uncover influence centrality and community structure in social networks , 2015, Social Network Analysis and Mining.

[117]  S. Brenner,et al.  The structure of the nervous system of the nematode Caenorhabditis elegans. , 1986, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[118]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.

[119]  Jennifer Neville,et al.  Randomization tests for distinguishing social influence and homophily effects , 2010, WWW '10.

[120]  C. Bulte,et al.  Referral Programs and Customer Value. , 2011 .

[121]  Ernesto Estrada,et al.  Communicability graph and community structures in complex networks , 2009, Appl. Math. Comput..

[122]  Sarah J. S. Wilner,et al.  Networked Narratives: Understanding Word-of-Mouth Marketing in Online Communities , 2009 .

[123]  Ulrik Brandes,et al.  Centrality Measures Based on Current Flow , 2005, STACS.

[124]  Yu Wang,et al.  Community-based greedy algorithm for mining top-K influential nodes in mobile social networks , 2010, KDD.

[125]  Jan U. Becker,et al.  Seeding Strategies for Viral Marketing: An Empirical Comparison , 2011 .

[126]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[127]  Greg Thomas Building the buzz in the hive mind , 2004 .

[128]  Reinhard Lipowsky,et al.  Network Brownian Motion: A New Method to Measure Vertex-Vertex Proximity and to Identify Communities and Subcommunities , 2004, International Conference on Computational Science.

[129]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[130]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[131]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[132]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[133]  John Yen,et al.  Achieving High Robustness in Supply Distribution Networks by Rewiring , 2011, IEEE Transactions on Engineering Management.

[134]  S. vanDongen Graph Clustering by Flow Simulation , 2000 .

[135]  Yizhou Sun,et al.  RankClus: integrating clustering with ranking for heterogeneous information network analysis , 2009, EDBT '09.

[136]  Martin Rosvall,et al.  An information-theoretic framework for resolving community structure in complex networks , 2007, Proceedings of the National Academy of Sciences.

[137]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[138]  Christos Faloutsos,et al.  Cascading Behavior in Large Blog Graphs , 2007 .

[139]  A. Arenas,et al.  Models of social networks based on social distance attachment. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[140]  Peter H. Reingen,et al.  Social Ties and Word-of-Mouth Referral Behavior , 1987 .

[141]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.