Discerning Influence Patterns with Beta-Poisson Factorization in Microblogging Environments

Social influence analysis in microblogging services has attracted much attention in recent years. However, most previous studies were focused on measuring users’ (topical) influence. Little effort has been made to discern and quantify how a user is influenced. Specifically, the fact that user <inline-formula><tex-math notation="LaTeX">$i$</tex-math><alternatives><mml:math><mml:mi>i</mml:mi></mml:math><inline-graphic xlink:href="guan-ieq1-2897932.gif"/></alternatives></inline-formula> retweets a tweet from author <inline-formula><tex-math notation="LaTeX">$j$</tex-math><alternatives><mml:math><mml:mi>j</mml:mi></mml:math><inline-graphic xlink:href="guan-ieq2-2897932.gif"/></alternatives></inline-formula> could be either because <inline-formula><tex-math notation="LaTeX">$i$</tex-math><alternatives><mml:math><mml:mi>i</mml:mi></mml:math><inline-graphic xlink:href="guan-ieq3-2897932.gif"/></alternatives></inline-formula> is influenced by <inline-formula><tex-math notation="LaTeX">$j$</tex-math><alternatives><mml:math><mml:mi>j</mml:mi></mml:math><inline-graphic xlink:href="guan-ieq4-2897932.gif"/></alternatives></inline-formula> (i.e., <inline-formula><tex-math notation="LaTeX">$j$</tex-math><alternatives><mml:math><mml:mi>j</mml:mi></mml:math><inline-graphic xlink:href="guan-ieq5-2897932.gif"/></alternatives></inline-formula> is a topical authority), or simply because he is “influenced” by the content (interested in the content). To mine such influence patterns, we propose a novel Bayesian factorization model, dubbed Influence Beta-Poisson Factorization (IBPF). IBPF jointly factorizes the retweet data and tweet content to quantify latent topical factors of user preference, author influence and content influence. It generates every retweet record according to the sum of two causing terms: one representing author influence, and the other one derived from content influence. To control the impact of the two terms, for each user IBPF generates a probability for each latent topic by Beta distribution, indicating how strongly the user cares about the topical authority of the author. We develop an efficient variational inference algorithm for IBPF. We demonstrate the efficacy of IBPF on two public microblogging datasets.

[1]  Scott Counts,et al.  Identifying topical authorities in microblogs , 2011, WSDM '11.

[2]  Chong Wang,et al.  Variational inference in nonconjugate models , 2012, J. Mach. Learn. Res..

[3]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[4]  Jianxin Li,et al.  Most Influential Community Search over Large Social Networks , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[5]  David M. Blei,et al.  Content-based recommendations with Poisson factorization , 2014, NIPS.

[6]  Yang Yang,et al.  Collaborative Filtering with Social Exposure: A Modular Approach to Social Recommendation , 2017, AAAI.

[7]  Juan Hu,et al.  Topical authority propagation on microblogs , 2013, CIKM.

[8]  Yang Liu,et al.  Who Influenced You? Predicting Retweet via Social Influence Locality , 2015, ACM Trans. Knowl. Discov. Data.

[9]  David M. Blei,et al.  Scalable Recommendation with Poisson Factorization , 2013, ArXiv.

[10]  Yong Yu,et al.  Diffusion-aware personalized social update recommendation , 2013, RecSys.

[11]  Xuanjing Huang,et al.  Retweet Behavior Prediction Using Hierarchical Dirichlet Process , 2015, AAAI.

[12]  Rizal Setya Perdana What is Twitter , 2013 .

[13]  Wei Zhang,et al.  A Collective Bayesian Poisson Factorization Model for Cold-start Local Event Recommendation , 2015, KDD.

[14]  Xiaohui Yan,et al.  A biterm topic model for short texts , 2013, WWW.

[15]  Mohammed J. Zaki,et al.  ProfileRank: finding relevant content and influential users based on information diffusion , 2013, SNAKDD '13.

[16]  Xiaoyu Du,et al.  Outer Product-based Neural Collaborative Filtering , 2018, IJCAI.

[17]  Fabián Riquelme,et al.  Measuring user influence on Twitter: A survey , 2015, Inf. Process. Manag..

[18]  Juan-Zi Li,et al.  Understanding retweeting behaviors in social networks , 2010, CIKM.

[19]  Brian D. Davison,et al.  Co-factorization machines: modeling user interests and predicting individual decisions in Twitter , 2013, WSDM.

[20]  Juan-Zi Li,et al.  Social Influence Locality for Modeling Retweeting Behaviors , 2013, IJCAI.

[21]  Aaron Schein,et al.  Inferring Polyadic Events With Poisson Tensor Factorization , 2014 .

[22]  Yong Yu,et al.  Collaborative personalized tweet recommendation , 2012, SIGIR '12.

[23]  Matthew Michelson,et al.  Tweet Disambiguate Entities Retrieve Folksonomy SubTree Step 1 : Discover Categories Generate Topic Profile from SubTrees Step 2 : Discover Profile Topic Profile : “ English Football ” “ World Cup ” , 2011 .

[24]  Fenglong Ma,et al.  Topical Influential User Analysis with Relationship Strength Estimation in Twitter , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[25]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[26]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[27]  Xiaowen Dai,et al.  User‐level microblogging recommendation incorporating social influence , 2017, J. Assoc. Inf. Sci. Technol..

[28]  Xiaoyu Du,et al.  Fast Matrix Factorization With Nonuniform Weights on Missing Data , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[30]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[31]  Rediet Abebe Can Cascades be Predicted? , 2014 .

[32]  Indrajit Bhattacharya,et al.  Online Topic-based Social Influence Analysis for the Wimbledon Championships , 2015, KDD.

[33]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[34]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[35]  Xuanjing Huang,et al.  Retweet Prediction with Attention-based Deep Neural Network , 2016, CIKM.

[36]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[37]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.

[38]  Qi Tian,et al.  Adversarial Training Towards Robust Multimedia Recommender System , 2018, IEEE Transactions on Knowledge and Data Engineering.

[39]  Ee-Peng Lim,et al.  Retweeting: An Act of Viral Users, Susceptible Users, or Viral Topics? , 2013, SDM.

[40]  Fei Wang,et al.  Social contextual recommendation , 2012, CIKM.

[41]  Wei Chen,et al.  Diffusion of “Following” Links in Microblogging Networks , 2015, IEEE Transactions on Knowledge and Data Engineering.

[42]  Younghoon Kim,et al.  TWITOBI: A Recommendation System for Twitter Using Probabilistic Modeling , 2011, 2011 IEEE 11th International Conference on Data Mining.

[43]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[44]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[45]  Jianyong Wang,et al.  Retweet or not?: personalized tweet re-ranking , 2013, WSDM.

[46]  Jon M. Kleinberg,et al.  Feedback effects between similarity and social influence in online communities , 2008, KDD.

[47]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[48]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[49]  Meng Wang,et al.  Tri-Clustered Tensor Completion for Social-Aware Image Tag Refinement , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Chong Wang,et al.  Stochastic variational inference , 2012, J. Mach. Learn. Res..

[51]  Yannis Sismanis,et al.  Scalable topic-specific influence analysis on microblogs , 2014, WSDM.

[52]  Fabio Ciravegna,et al.  Social influence analysis in microblogging platforms - A topic-sensitive based approach , 2014, Semantic Web.

[53]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[54]  Juston Moore,et al.  Poisson factorization for peer-based anomaly detection , 2016, 2016 IEEE Conference on Intelligence and Security Informatics (ISI).

[55]  Jiawei Han,et al.  Learning influence from heterogeneous social networks , 2012, Data Mining and Knowledge Discovery.

[56]  W. Bruce Croft,et al.  User oriented tweet ranking: a filtering approach to microblogs , 2011, CIKM '11.

[57]  Ali Taylan Cemgil,et al.  Bayesian Inference for Nonnegative Matrix Factorisation Models , 2009, Comput. Intell. Neurosci..

[58]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[59]  Jianxin Li,et al.  Personalized Influential Topic Search via Social Network Summarization , 2016, IEEE Transactions on Knowledge and Data Engineering.

[60]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[61]  Tony White,et al.  Modelling Influence in a Social Network: Metrics and Evaluation , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.