Comparing Community-based Information Adoption and Diffusion Across Different Microblogging Sites

The proliferation of social media is bringing about significant changes in how people make sense of their world and adopt new information. However, social, cultural and political divisions continue to separate users and information into different social media systems. Twitter and Facebook, for example, are strictly forbidden in mainland China. As a result, 21.97% of all world-wide Internet users are thus excluded from participating in these platforms. In this study, we investigate whether the dynamics of information diffusion, modeled as the adoption patterns of topical hashtags, differ between the communities of the mentioned social media sites as a result of this separation. Specifically, we compare Weibo and Twitter, the two largest micro-blogging sites serving respectively the Chinese population and the rest of the world, by exploring the similarities and differences of how their respective users adopt new information. By leveraging sophisticated community detection algorithms and heterogeneous graph mining methods, we investigate and compare how the different characteristics of these communities influence information diffusion and adoption. Experimental results show that while community-specific information influences topic diffusion and adoption in both environments, novel features, extracted from heterogeneous graph based communities, have a greater effect on Weibo information adoption than Twitter. We also find that users sharing hashtags is an important factor in information diffusion on both Twitter and Weibo, whereas user mentions are important for Weibo, but less so for Twitter. Overall, we conclude that Weibo and Twitter differ sharply in how their users adopt information in response to similar factors.

[1]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[2]  Kathleen M. Carley,et al.  Understanding online firestorms: Negative word-of-mouth dynamics in social media networks , 2014 .

[3]  Rizal Setya Perdana What is Twitter , 2013 .

[4]  Yizhou Sun,et al.  Full-text based context-rich heterogeneous network mining approach for citation recommendation , 2014, IEEE/ACM Joint Conference on Digital Libraries.

[5]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[6]  Evgeny V. Morozov,et al.  Response to Philip N. Howard's review of The Net Delusion: The Dark Side of Internet Freedom , 2011, Perspectives on Politics.

[7]  Renaud Lambiotte,et al.  Communities, knowledge creation, and information diffusion , 2009, J. Informetrics.

[8]  Eli Pariser,et al.  The Filter Bubble: What the Internet Is Hiding from You , 2011 .

[9]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[10]  Víctor Urrea,et al.  Letter to the Editor: Stability of Random Forest importance measures , 2011, Briefings Bioinform..

[11]  Xiaozhong Liu,et al.  Mirroring the real world in social media: twitter, geolocation, and sentiment analysis , 2013, UnstructureNLP@CIKM.

[12]  Reza Zafarani,et al.  Finding Friends on a New Site Using Minimum Information , 2014, SDM.

[13]  Shou-De Lin,et al.  Matching users and items across domains to improve the recommendation quality , 2014, KDD.

[14]  Reza Zafarani,et al.  Connecting users across social media sites: a behavioral-modeling approach , 2013, KDD.

[15]  M. Osborne,et al.  Using Prediction Markets and Twitter to Predict a Swine Flu Pandemic , 2009 .

[16]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[17]  Huan Liu,et al.  Community Detection and Mining in Social Media , 2010, Community Detection and Mining in Social Media.

[18]  Martin H. Levinson Alone Together: Why We Expect More from Technology and Less from Each Other , 2011 .

[19]  Johan Bollen,et al.  Improving news ranking by community tweets , 2012, WWW.

[20]  Fareed Zakaria,et al.  The Post-American World: Release 2.0 , 2011 .

[21]  Zhigang Cao,et al.  Analyzing user behavior of the micro-blogging website Sina Weibo during hot social events , 2013, 1304.3898.

[22]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[23]  Bin Zhao,et al.  Discovering Collective Viewpoints on Micro-blogging Events Based on Community and Temporal Aspects , 2011, ADMA.

[24]  Filippo Menczer,et al.  Virality Prediction and Community Structure in Social Networks , 2013, Scientific Reports.

[25]  Eli Pariser FILTER BUBBLE: Wie wir im Internet entmündigt werden , 2012 .

[26]  Dennis M. Murphy The Net Delusion: The Dark Side of Internet Freedom , 2012 .

[27]  Filippo Menczer,et al.  Predicting Successful Memes Using Network and Community Structure , 2014, ICWSM.

[28]  Xin Shuai,et al.  Comparing the pulses of categorical hot events in Twitter and Weibo , 2014, HT.

[29]  Jure Leskovec,et al.  Information diffusion and external influence in networks , 2012, KDD.

[30]  Gerhard Weikum,et al.  Integrating DB and IR Technologies: What is the Sound of One Hand Clapping? , 2005, CIDR.

[31]  Yong Yu,et al.  A comparative study of users' microblogging behavior on sina weibo and twitter , 2012, UMAP.

[32]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[33]  Yu Wang,et al.  Community-based greedy algorithm for mining top-K influential nodes in mobile social networks , 2010, KDD.

[34]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[35]  Zeynep Tufekci,et al.  Engineering the public: Big data, surveillance and computational politics , 2014, First Monday.

[36]  Xiaozhong Liu,et al.  Computational community interest for ranking , 2009, CIKM.

[37]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[38]  Marco Sandri,et al.  A Bias Correction Algorithm for the Gini Variable Importance Measure in Classification Trees , 2008 .

[39]  Keith A. Markus,et al.  Principles and Practice of Structural Equation Modeling by Rex B. Kline , 2012 .

[40]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[41]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[42]  Jie Tang,et al.  What is the Nature of Chinese MicroBlogging: Unveiling the Unique Features of Tencent Weibo , 2012, 1211.2197.

[43]  Lei Yang,et al.  We know what @you #tag: does the dual role affect hashtag adoption? , 2012, WWW.

[44]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[45]  J. Cheung Alone Together: Why We Expect More from Technology and Less from Each Other , 2013 .