Discovering Information Propagation Patterns in Microblogging Services

During the last decade, microblog has become an important social networking service with billions of users all over the world, acting as a novel and efficient platform for the creation and dissemination of real-time information. Modeling and revealing the information propagation patterns in microblogging services cannot only lead to more accurate understanding of user behaviors and provide insights into the underlying sociology, but also enable useful applications such as trending prediction, recommendation and filtering, spam detection and viral marketing. In this article, we aim to reveal the information propagation patterns in Sina Weibo, the biggest microblogging service in China. First, the cascade of each message is represented as a tree based on its retweeting process. Afterwards, we divide the information propagation pattern into two levels, that is, the macro level and the micro level. On one hand, the macro propagation patterns refer to general propagation modes that are extracted by grouping propagation trees based on hierarchical clustering. On the other hand, the micro propagation patterns are frequent information flow patterns that are discovered using tree-based mining techniques. Experimental results show that several interesting patterns are extracted, such as popular message propagation, artificial propagation, and typical information flows between different types of users.

[1]  Fuji Ren,et al.  Learning Sentimental Influence in Twitter , 2011, 2011 International Conference on Future Computer Sciences and Application.

[2]  Marc Cheong,et al.  A Study on Detecting Patterns in Twitter Intra-topic User and Message Clustering , 2010, 2010 20th International Conference on Pattern Recognition.

[3]  Daqing Zhang,et al.  The Emergence of Social and Community Intelligence , 2011, Computer.

[4]  Aoying Zhou,et al.  Towards modeling popularity of microblogs , 2013, Frontiers of Computer Science.

[5]  Fang Wu,et al.  Social Networks that Matter: Twitter Under the Microscope , 2008, First Monday.

[6]  J. Leskovec,et al.  Cascading Behavior in Large Blog Graphs Patterns and a model , 2006 .

[7]  Nick Koudas,et al.  Information cascade at group scale , 2013, KDD.

[8]  Jiawei Han,et al.  ACM Transactions on Knowledge Discovery from Data: Introduction , 2007 .

[9]  Jure Leskovec,et al.  Modeling Information Diffusion in Implicit Networks , 2010, 2010 IEEE International Conference on Data Mining.

[10]  Dan Zhang,et al.  ALPOS: A Machine Learning Approach for Analyzing Microblogging Data , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[11]  Zhoujun Li,et al.  Diabetes-Associated Factors as Predictors of Nursing Home Admission and Costs in the Elderly Across Europe. , 2017, Journal of the American Medical Directors Association.

[12]  Edward J. Coyle,et al.  An energy efficient hierarchical clustering algorithm for wireless sensor networks , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[13]  Philip S. Yu,et al.  Extracting social events for learning better information diffusion models , 2013, KDD.

[14]  Les Carr,et al.  Identifying communicator roles in twitter , 2012, WWW.

[15]  Wei Li,et al.  Measurement and analysis of topology and information propagation on Sina-Microblog , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[16]  Xingshe Zhou,et al.  Predicting the content dissemination trends by repost behavior modeling in mobile social networks , 2014, J. Netw. Comput. Appl..

[17]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[18]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[19]  Ya Zhang,et al.  Discovering patterns of advertisement propagation in Sina-Microblog , 2012, ADKDD '12.

[20]  Michael S. Bernstein,et al.  Short and tweet: experiments on recommending content from information streams , 2010, CHI.

[21]  Wolfgang Kellerer,et al.  Outtweeting the Twitterers - Predicting Information Cascades in Microblogs , 2010, WOSN.

[22]  Bernardo A. Huberman,et al.  What Trends in Chinese Social Media , 2011, ArXiv.

[23]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[24]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[25]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[26]  Mary Beth Rosson,et al.  How and why people Twitter: the role that micro-blogging plays in informal communication at work , 2009, GROUP.

[27]  Chen Huang,et al.  Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake , 2011, CSCW.

[28]  Masahiro Kimura,et al.  Extracting Influential Nodes for Information Diffusion on a Social Network , 2007, AAAI.

[29]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[30]  Bin Wu,et al.  Comparing Twitter and Chinese native microblog , 2011, 2011 Second Worldwide Cybersecurity Summit (WCS).

[31]  Hiroki Arimura,et al.  Efficient Substructure Discovery from Large Semi-Structured Data , 2001, IEICE Trans. Inf. Syst..

[32]  Xingshe Zhou,et al.  Tree-Based Mining for Discovering Patterns of Human Interaction in Meetings , 2012, IEEE Transactions on Knowledge and Data Engineering.

[33]  Bernardo A. Huberman,et al.  Artificial Inflation: The Real Story of Trends and Trend-Setters in Sina Weibo , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[34]  Rediet Abebe Can Cascades be Predicted? , 2014 .

[35]  Christos Faloutsos,et al.  Patterns of Cascading Behavior in Large Blog Graphs , 2007, SDM.

[36]  Zhu Wang,et al.  Discovering and Profiling Overlapping Communities in Location-Based Social Networks , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[37]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[38]  Scott Counts,et al.  Predicting the Speed, Scale, and Range of Information Diffusion in Twitter , 2010, ICWSM.

[39]  Juan-Zi Li,et al.  Understanding retweeting behaviors in social networks , 2010, CIKM.

[40]  Xueqi Cheng,et al.  Popularity prediction in microblogging network: a case study on sina weibo , 2013, WWW.

[41]  Daniel M. Romero,et al.  Influence and passivity in social media , 2010, ECML/PKDD.

[42]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[43]  Shaoyong Chen,et al.  Comparision of microblogging service between Sina Weibo and Twitter , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[44]  Ke Xiao,et al.  Predicting Retweet Behavior in Weibo Social Network , 2012, WISE.

[45]  Christos Faloutsos,et al.  Graph mining: Laws, generators, and algorithms , 2006, CSUR.

[46]  Sivaraman Balakrishnan,et al.  Efficient Active Algorithms for Hierarchical Clustering , 2012, ICML.

[47]  Vwani P. Roychowdhury,et al.  Information resonance on Twitter: watching Iran , 2010, SOMA '10.

[48]  Tao Qin,et al.  Who are active? An in-depth measurement on user activity characteristics in sina microblogging , 2012, 2012 IEEE Global Communications Conference (GLOBECOM).

[49]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.