Influence maximization across heterogeneous interconnected networks based on deep learning

Abstract With the fast development of online social networks, a large number of their members are involved in more than one social network. Finding most influential users is one of the interesting social network analysis tasks. The influence maximization (IM) problem aims to select a minimum set of users who maximize the influence spread on the underlying network. Most of the previous researches only focus on a single social networks, whereas in real world, users join to multiple social networks. Thus, influence can spread through common users on multiple networks. Besides, the existing works including simulation based, proxy based and sketch based approaches suffer from different issues including scalability, efficiency and feasibility due to the nature of these approaches for exploring networks and computation of their influence diffusion. Moreover, in the previous algorithms, several heuristics are employed to capture network topology for IM. But, these methods have information loss during network exploration because of their pruning strategies. In this paper, a new research direction is presented for studying IM problem on interconnected networks. The proposed approach employs deep learning techniques to learn the feature vectors of network nodes while preserving both local and global structural information. To the best of our knowledge, network embedding has not yet been used to solve IM problem. Indeed, our algorithm leverages deep learning techniques for feature engineering to extract all the appropriate information related to IM problem for single and interconnected networks. Moreover, we prove that the proposed algorithm is monotone and submodular, thus, an optimal solution is guaranteed by the proposed approach. The experimental results on two interconnected networks including DBLP and Twitter-Foursquare illustrate the efficiency of the proposed algorithm in comparison to state of the art IM algorithms. We also conduct some experiments on NetHept dataset to evaluate the performance of the proposed approach on single networks.

[1]  Philip S. Yu,et al.  Discover Tipping Users for Cross Network Influencing (Invited Paper) , 2016, 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI).

[2]  Christian Borgs,et al.  Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[3]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[4]  Ali Farhadi,et al.  Unsupervised Deep Embedding for Clustering Analysis , 2015, ICML.

[5]  Philip S. Yu,et al.  Influence Maximization Across Partially Aligned Heterogenous Social Networks , 2015, PAKDD.

[6]  Rossano Gaeta,et al.  A Model of Information Diffusion in Interconnected Online Social Networks , 2018, ACM Trans. Web.

[7]  Dong Yu,et al.  Investigation of full-sequence training of deep belief networks for speech recognition , 2010, INTERSPEECH.

[8]  Geoffrey E. Hinton,et al.  Discovering Binary Codes for Documents by Learning Deep Generative Models , 2011, Top. Cogn. Sci..

[9]  My T. Thai,et al.  Interest-matching information propagation in multiple online social networks , 2012, CIKM.

[10]  Masoud Asadpour,et al.  Community Aware Random Walk for Network Embedding , 2017, Knowl. Based Syst..

[11]  Huiling Zhang,et al.  Least Cost Influence Maximization Across Multiple Social Networks , 2016, IEEE/ACM Transactions on Networking.

[12]  Yuanyuan Tian,et al.  Event-based social networks: linking the online and offline social worlds , 2012, KDD.

[13]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[14]  Divyakant Agrawal,et al.  Limiting the spread of misinformation in social networks , 2011, WWW.

[15]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[16]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[17]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[18]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[19]  Dino Pedreschi,et al.  NDlib: a python library to model and analyze diffusion processes over complex networks , 2017, International Journal of Data Science and Analytics.

[20]  Laks V. S. Lakshmanan,et al.  SIMPATH: An Efficient Algorithm for Influence Maximization under the Linear Threshold Model , 2011, 2011 IEEE 11th International Conference on Data Mining.

[21]  Dino Pedreschi,et al.  NDlib: Studying Network Diffusion Dynamics , 2017, 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[22]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[23]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[24]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[25]  Mao Ye,et al.  Exploring social influence for recommendation: a generative model approach , 2012, SIGIR '12.

[26]  Lada A. Adamic,et al.  The role of social networks in information diffusion , 2012, WWW.

[27]  Yu Wang,et al.  Community-based greedy algorithm for mining top-K influential nodes in mobile social networks , 2010, KDD.

[28]  Yifei Yuan,et al.  Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[29]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[30]  Hui Xiong,et al.  Influence Maximization over Large-Scale Social Networks: A Bounded Linear Approach , 2014, CIKM.

[31]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[32]  Yuchen Li,et al.  Influence Maximization on Social Graphs: A Survey , 2018, IEEE Transactions on Knowledge and Data Engineering.

[33]  Masahiro Kimura,et al.  Tractable Models for Information Diffusion in Social Networks , 2006, PKDD.

[34]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[35]  Laks V. S. Lakshmanan,et al.  CELF++: optimizing the greedy algorithm for influence maximization in social networks , 2011, WWW.

[36]  My T. Thai,et al.  Least Cost Influence in Multiplex Social Networks: Model Representation and Analysis , 2013, 2013 IEEE 13th International Conference on Data Mining.

[37]  Wei Wang,et al.  Deep Embedding Network for Clustering , 2014, 2014 22nd International Conference on Pattern Recognition.

[38]  Junshan Zhang,et al.  Information diffusion in overlaying social-physical networks , 2012, 2012 46th Annual Conference on Information Sciences and Systems (CISS).

[39]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.