Mining Initial Nodes with BSIS Model and BS-G Algorithm on Social Networks for Influence Maximization

Influence maximization is the problem to identify and find a set of the most influential nodes, whose aggregated influence in the network is maximized. This research is of great application value for advertising, viral marketing and public opinion monitoring. However, we always ignore the tendency of nodes’ behaviors and sentiment in the researches of influence maximization. On general, users’ sentiment determines users behaviors, and users’ behaviors reflect the influence between users in social network. In this paper, we design a training model of sentimental words to expand the existing sentimental dictionary with the marked-comment data set, and propose an influence spread model considering both the tendency of users’ behaviors and sentiment named as BSIS (Behavior and Sentiment Influence Spread) to depict and compute the influence between nodes. We also propose an algorithm for influence maximization named as BS-G (BSIS with Greedy Algorithm) to select the initial node. In the experiments, we use two real social network data sets on the Hadoop and Spark distributed cluster platform for experiments, and the experiment results show that BSIS model and BS-G algorithm on big data platform have better influence spread effects and higher quality of the selection of seed node comparing with the approaches with traditional IC, LT and CDNF models.

[1]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[2]  Yifei Yuan,et al.  Influence Maximization in Social Networks When Negative Opinions May Emerge and Propagate , 2011, SDM.

[3]  Xiaoheng Deng,et al.  Credit Distribution for Influence Maximization in Online Social Networks with Time Constraint , 2015, 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity).

[4]  Sourav S. Bhowmick,et al.  CINEMA: conformity-aware greedy algorithm for influence maximization in online social networks , 2013, EDBT '13.

[5]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[6]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[7]  Jingsong Gui,et al.  Credit Distribution and influence maximization in online social networks using node features , 2015, 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[8]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[9]  Hui Xiong,et al.  Influence Maximization over Large-Scale Social Networks: A Bounded Linear Approach , 2014, CIKM.

[10]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[11]  Yifei Yuan,et al.  Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[12]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[13]  Wei Chen,et al.  Maximizing acceptance probability for active friending in online social networks , 2013, KDD.

[14]  Laks V. S. Lakshmanan,et al.  A Data-Based Approach to Social Influence Maximization , 2011, Proc. VLDB Endow..

[15]  Laks V. S. Lakshmanan,et al.  CELF++: optimizing the greedy algorithm for influence maximization in social networks , 2011, WWW.

[16]  My T. Thai,et al.  Maximizing the Spread of Positive Influence in Online Social Networks , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems.