Discovering Social Events through Online Attention

Twitter is a major social media platform in which users send and read messages (“tweets”) of up to 140 characters. In recent years this communication medium has been used by those affected by crises to organize demonstrations or find relief. Because traffic on this media platform is extremely heavy, with hundreds of millions of tweets sent every day, it is difficult to differentiate between times of turmoil and times of typical discussion. In this work we present a new approach to addressing this problem. We first assess several possible “thermostats” of activity on social media for their effectiveness in finding important time periods. We compare methods commonly found in the literature with a method from economics. By combining methods from computational social science with methods from economics, we introduce an approach that can effectively locate crisis events in the mountains of data generated on Twitter. We demonstrate the strength of this method by using it to locate the social events relating to the Occupy Wall Street movement protests at the end of 2011.

[1]  Huan Liu,et al.  Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose , 2013, ICWSM.

[2]  J. Nadal,et al.  Manifesto of computational social science , 2012 .

[3]  Nick Chater,et al.  Using big data to predict collective behavior in the real world 1 , 2014, Behavioral and Brain Sciences.

[4]  C. McClelland The Acute International Crisis , 1961 .

[5]  A. Pentland,et al.  Life in the network: The coming age of computational social science: Science , 2009 .

[6]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[7]  David Lazer,et al.  Inferring friendship network structure by using mobile phone data , 2009, Proceedings of the National Academy of Sciences.

[8]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[9]  Mohammad Ali Abbasi,et al.  TweetTracker: An Analysis Tool for Humanitarian and Disaster Relief , 2011, ICWSM.

[10]  Qi He,et al.  What Do People Want in Microblogs? Measuring Interestingness of Hashtags in Twitter , 2010, 2010 IEEE International Conference on Data Mining.

[11]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[12]  John A. Swets,et al.  Signal Detection Theory and ROC Analysis in Psychology and Diagnostics: Collected Papers , 1996 .

[13]  Alessandro Vespignani,et al.  Beating the news using social media: the case study of American Idol , 2012, EPJ Data Science.

[14]  E. Ben-Jacob,et al.  Challenges in network science: Applications to infrastructures, climate, social systems and economics , 2012 .

[15]  H. Stanley,et al.  Quantifying Trading Behavior in Financial Markets Using Google Trends , 2013, Scientific Reports.

[16]  Juval Portugali,et al.  Population movement under extreme events , 2012, Proceedings of the National Academy of Sciences.

[17]  Alessandro Vespignani,et al.  Epidemic spreading in scale-free networks. , 2000, Physical review letters.

[18]  Lei Yang,et al.  We know what @you #tag: does the dual role affect hashtag adoption? , 2012, WWW.

[19]  Rui Wang,et al.  Towards social user profiling: unified and discriminative influence model for inferring home locations , 2012, KDD.

[20]  H. Eugene Stanley,et al.  Quantifying Wikipedia Usage Patterns Before Stock Market Moves , 2013, Scientific Reports.

[21]  Jing Hu,et al.  Culturomics meets random fractal theory: insights into long-range correlations of social and natural phenomena over the past two centuries , 2012, Journal of The Royal Society Interface.

[22]  Shlomo Havlin,et al.  How people interact in evolving online affiliation networks , 2011, ArXiv.

[23]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[24]  Gail A. Herndon The chronicle of higher education , 1977 .

[25]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[26]  Alexei Pozdnoukhov,et al.  Space-time dynamics of topics in streaming text , 2011, LBSN '11.

[27]  Amit P. Sheth,et al.  A Qualitative Examination of Topical Tweet and Retweet Practices , 2010, ICWSM.

[28]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[29]  Fred Morstatter,et al.  Finding Eyewitness Tweets During Crises , 2014, LTCSS@ACL.

[30]  Herschel F. Thomas,et al.  The Importance of Attention Diversity and How to Measure It , 2014 .

[31]  Miles Efron,et al.  Hashtag retrieval in a microblogging environment , 2010, SIGIR.

[32]  Huan Liu,et al.  Twitter Data Analytics , 2013, SpringerBriefs in Computer Science.

[33]  Bertrand De Longueville,et al.  "OMG, from here, I can see the flames!": a use case of mining location based social networks to acquire spatio-temporal data on forest fires , 2009, LBSN '09.

[34]  Jiawei Han,et al.  Geographical topic discovery and comparison , 2011, WWW.

[35]  Benjamin Gleason,et al.  #Occupy Wall Street , 2013 .

[36]  S. Rhoades The Herfindahl-Hirschman index , 1993 .

[37]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[38]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.