Geolocated Twitter Panels to Study the Impact of Events

Data from Twitter have been employed in prior research to study the impact of events. Historically, researchers have relied on keyword-based samples of tweets to create a panel of Twitter users that mention event-related keywords during and/or after an event. There are limitations to the keyword-based panel approach. First, the technique suffers from selection bias since users who discuss an event are already more likely to discuss event-related topics beforehand; it is unclear whether observed impacts are merely driven by a set of users who are intrinsically more interested in events. Second, there are no viable groups for comparison to a keyword-based sample of tweeters. We propose an alternative sampling approach to studying response to events on Twitter that addresses the aforementioned two issues. We construct panels of users defined by their geolocation. These panels are exogenous to the keywords in users’ tweets, resulting in less selection bias than the keyword-based panel method. Geolocated Twitter panels allow us to follow within-person changes over time and also enable the creation of comparison groups. We evaluate our panel selection approach in two real-world settings: response to mass shootings and TV advertising. We illustrate how our approach limits selection bias introduced by the keyword-based approach, how discussion among the panel of users shifts before and after an event, and how geography can provide meaningful comparison groups regarding the impact of these events. We believe that we are the first to provide a clear empirical example of how a better panel selection design, based on an exogenous variable like geography, both reduces selection bias compared to the current state-of-the-art and increases the value of Twitter research for studying events.

[1]  Daniel Gayo-Avello,et al.  Don't turn social media into another 'Literary Digest' poll , 2011, Commun. ACM.

[2]  Jisun An,et al.  Whom should we sense in “social sensing” - analyzing which users work best for social media now-casting , 2015, EPJ Data Science.

[3]  David M. Rothschild,et al.  Forecasting elections with non-representative polls , 2015 .

[4]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[5]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[6]  Ron Kohavi,et al.  Responsible editor: R. Bayardo. , 2022 .

[7]  Fernando Diaz,et al.  CrisisLex: A Lexicon for Collecting and Filtering Microblogged Communications in Crises , 2014, ICWSM.

[8]  Michael D. Barnes,et al.  "Right Time, Right Place" Health Communication on Twitter: Value and Accuracy of Location Information , 2012, Journal of medical Internet research.

[9]  Johan Bos,et al.  Predicting the 2011 Dutch Senate Election Results with Twitter , 2012 .

[10]  Murphy Choy,et al.  A sentiment analysis of Singapore Presidential Election 2011 using Twitter data with census correction , 2011, ArXiv.

[11]  Sarah Vieweg,et al.  Processing Social Media Messages in Mass Emergency , 2014, ACM Comput. Surv..

[12]  Ciro Cattuto,et al.  Dynamical classes of collective attention in twitter , 2011, WWW.

[13]  Rich Ling,et al.  The Activation of Core Social Networks in the Wake of the 22 July Oslo Bombing , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[14]  K. Ochsner,et al.  Sadness Shifts to Anxiety Over Time and Distance From the National Tragedy in Newtown, Connecticut , 2015, Psychological science.

[15]  Themis Palpanas,et al.  Dynamics of news events and social media reaction , 2014, KDD.

[16]  Robert O. Keohane,et al.  Designing Social Inquiry: Scientific Inference in Qualitative Research. , 1995 .

[17]  Claudia Hauff,et al.  Twitter-based Election Prediction in the Developing World , 2015, HT.

[18]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[19]  Ingmar Weber,et al.  U.S. Religious Landscape on Twitter , 2014, SocInfo.

[20]  Han Zhang Witnessing Political Protest on Civic Engagement ∗ , 2016 .

[21]  Mike Thelwall,et al.  Sentiment in Twitter events , 2011, J. Assoc. Inf. Sci. Technol..

[22]  Derek Ruths,et al.  Geolocation Prediction in Twitter Using Social Networks: A Critical Analysis and Review of Current Practice , 2015, ICWSM.

[23]  J. Bollen,et al.  More Tweets, More Votes: Social Media as a Quantitative Indicator of Political Behavior , 2013, PloS one.

[24]  Joemon M. Jose,et al.  Building a large-scale corpus for evaluating event detection on twitter , 2013, CIKM.

[25]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[26]  Michael Gamon,et al.  Online And Social Media Data As A Flawed Continuous Panel Survey , 2014 .

[27]  Fahad Bin Muhaya,et al.  Estimating Twitter User Location Using Social Interactions--A Content Based Approach , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[28]  Zeynep Tufekci,et al.  Big Questions for Social Media Big Data: Representativeness, Validity and Other Methodological Pitfalls , 2014, ICWSM.

[29]  Alice H. Oh,et al.  Sociolinguistic analysis of Twitter in multilingual societies , 2014, HT.

[30]  David Lazer,et al.  Voices of victory: a computational focus group framework for tracking opinion shift in real time , 2013, WWW '13.

[31]  Bo Thiesson,et al.  Discussion Graphs: Putting Social Media Analysis in Context , 2014, ICWSM.

[32]  E. Suchman AN ANALYSIS OF “BIAS” IN SURVEY RESEARCH , 1962 .

[33]  Brian J. Taylor,et al.  Causal discovery in social media using quasi-experimental designs , 2010, SOMA '10.

[34]  Yiqun Liu,et al.  Discover breaking events with popular hashtags in twitter , 2012, CIKM.

[35]  Matthias Schonlau,et al.  Selection Bias in Web Surveys and the Use of Propensity Scores , 2006 .

[36]  Ee-Peng Lim,et al.  Tweets and Votes: A Study of the 2011 Singapore General Election , 2012, 2012 45th Hawaii International Conference on System Sciences.

[37]  Mor Naaman,et al.  Understanding Musical Diversity via Online Social Media , 2015, ICWSM.

[38]  Wolfgang Nejdl,et al.  Understanding the diversity of tweets in the time of outbreaks , 2013, WWW.

[39]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[40]  Dyng Au,et al.  Can Television Advertising Impact Be Measured on the Web? Web Spike Response as a Possible Conversion Tracking System for Television , 2014, ADKDD'14.

[41]  Mor Naaman,et al.  Unfolding the event landscape on twitter: classification and exploration of user categories , 2012, CSCW '12.

[42]  Ed H. Chi,et al.  Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles , 2011, CHI.

[43]  D. Watts,et al.  Dissecting the Spirit of Gezi: Influence vs. Selection in the Occupy Gezi Movement. , 2015 .

[44]  Trevor Cohn,et al.  A user-centric model of voting intention from Social Media , 2013, ACL.