Identifying topical influencers on twitter based on user behavior and network topology

Social media web sites have become major media platforms to share personal information, news, photos, videos and more. Users can even share live streams whenever they want to reach out to many other. This prevalent usage of social media attracted companies, data scientists, and researchers who are trying to infer meaningful information from this vast amount of data. Information diffusion and maximizing the spread of words is one of the most important focus for researchers working on social media. This information can serve many purposes such as; user or content recommendation, viral marketing, and user modeling. In this research, finding topical influential/authority users on Twitter is addressed. Since Twitter is a good platform to spread knowledge as a word of mouth approach and it has many more public profiles than protected ones, it is a target media for marketers. In this paper, we introduce a novel methodology, called Personalized PageRank, that integrates both the information obtained from network topology and the information obtained from user actions and activities in Twitter. The proposed approach aims to determine the topical influencers who are experts on a specific topic. Experimental results on a large dataset consisting of Turkish tweets show that using user specific features like topical focus rate, activeness, authenticity and speed of getting reaction on specific topics positively affects identifying influencers and lead to higher information diffusion. Algorithms are implemented on a distributed computing environment which makes high-cost graph processing more efficient.

[1]  Weiyi Liu,et al.  Containment of competitive influence spread in social networks , 2016, Knowl. Based Syst..

[2]  Dylan Walker,et al.  Tie Strength, Embeddedness, and Social Influence: A Large-Scale Networked Experiment , 2014, Manag. Sci..

[3]  Laks V. S. Lakshmanan,et al.  A Data-Based Approach to Social Influence Maximization , 2011, Proc. VLDB Endow..

[4]  Paolo Rosso,et al.  Figurative messages and affect in Twitter: Differences between #irony, #sarcasm and #not , 2016, Knowl. Based Syst..

[5]  Sule Gündüz Ögüdücü,et al.  Extracting Topical Information of Tweets Using Hashtags , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[6]  Jie Tang,et al.  Mining structural hole spanners through information diffusion in social networks , 2013, WWW.

[7]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[8]  P. Lazarsfeld,et al.  Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[9]  Ronald Lippitt,et al.  The Dynamics of Power , 1952 .

[10]  Erik Cambria,et al.  Aspect extraction for opinion mining with a deep convolutional neural network , 2016, Knowl. Based Syst..

[11]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[12]  Felipe Bravo-Marquez,et al.  Building a Twitter opinion lexicon from automatically-annotated tweets , 2016, Knowl. Based Syst..

[13]  Nirmalie Wiratunga,et al.  Contextual sentiment analysis for social media genres , 2016, Knowl. Based Syst..

[14]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[15]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[16]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[17]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[18]  Caglar Tirkaz,et al.  A Morphology-Aware Network for Morphological Disambiguation , 2016, AAAI.

[19]  Murat Can Ganiz,et al.  A new hybrid semi-supervised algorithm for text classification with class-based semantics , 2016, Knowl. Based Syst..

[20]  Brian D. Davison,et al.  Empirical study of topic modeling in Twitter , 2010, SOMA '10.

[21]  M. Deutsch,et al.  A study of normative and informational social influences upon individual judgement. , 1955, Journal of abnormal psychology.

[22]  Isabel Anger,et al.  Measuring influence on Twitter , 2011, i-KNOW '11.

[23]  Scott Counts,et al.  Identifying topical authorities in microblogs , 2011, WSDM '11.

[24]  Daniel M. Romero,et al.  Influence and passivity in social media , 2010, ECML/PKDD.

[25]  Masahiro Kimura,et al.  Learning Diffusion Probability Based on Node Attributes in Social Networks , 2011, ISMIS.

[26]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[27]  Chuan Zhou,et al.  Big social network influence maximization via recursively estimating influence spread , 2016, Knowl. Based Syst..

[28]  Scott Sanner,et al.  Improving LDA topic models for microblogs via tweet pooling and automatic labeling , 2013, SIGIR.

[29]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[30]  Hakim Hacid,et al.  A predictive model for the temporal dynamics of information diffusion in online social networks , 2012, WWW.

[31]  C Prem Sankar,et al.  Learning from Bees: An Approach for Influence Maximization on Viral Campaigns , 2016, PloS one.

[32]  Michal Karpowicz,et al.  Opinion Mining on the Web 2.0 - Characteristics of User Generated Content and Their Impacts , 2013, CHI-KDD.

[33]  Hongfei Yan,et al.  Comparing Twitter and Traditional Media Using Topic Models , 2011, ECIR.

[34]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[35]  Sule Gündüz Ögüdücü,et al.  Influential user detection on Twitter: Analyzing effect of focus rate , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[36]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[37]  Jiawei Han,et al.  Learning influence from heterogeneous social networks , 2012, Data Mining and Knowledge Discovery.

[38]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[39]  Yizhou Sun,et al.  RAIN: Social Role-Aware Information Diffusion , 2015, AAAI.

[40]  Fabián Riquelme,et al.  Measuring user influence on Twitter: A survey , 2015, Inf. Process. Manag..

[41]  Ludovic Lietard,et al.  Two evidential data based models for influence maximization in Twitter , 2017, Knowl. Based Syst..

[42]  Roberto V. Zicari,et al.  PoliTwi: Early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis , 2014, Knowl. Based Syst..

[43]  Cheng Long,et al.  Viral marketing for dedicated customers , 2014, Inf. Syst..