Social media analytics and research testbed (SMART): Exploring spatiotemporal patterns of human dynamics with geo-targeted social media messages

The multilevel model of meme diffusion conceptualizes how mediated messages diffuse over time and space. As a pilot application of implementing the meme diffusion, we developed the social media analytics and research testbed to monitor Twitter messages and track the diffusion of information in and across different cities and geographic regions. Social media analytics and research testbed is an online geo-targeted search and analytics tool, including an automatic data processing procedure at the backend and an interactive frontend user interface. Social media analytics and research testbed is initially designed to facilitate (1) searching and geo-locating tweet topics and terms in different cities and geographic regions; (2) filtering noise from raw data (such as removing redundant retweets and using machine learning methods to improve precision); (3) analyzing social media data from a spatiotemporal perspective; and (4) visualizing social media data in diagnostic ways (such as weekly and monthly trends, trend maps, top media, top retweets, top mentions, or top hashtags). Social media analytics and research testbed provides researchers and domain experts with a tool that can efficiently facilitate the refinement, formalization, and testing of research hypotheses or questions. Three case studies (flu outbreaks, Ebola epidemic, and marijuana legalization) are introduced to illustrate how the predictions of meme diffusion can be examined and to demonstrate the potentials and key functions of social media analytics and research testbed.

[1]  David S. Ebert,et al.  Spatiotemporal social media analytics for abnormal event detection and examination using seasonal-trend decomposition , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[2]  R. Blendon,et al.  Ebola in the United States--Public Reactions and Implications. , 2015, The New England journal of medicine.

[3]  Jennifer M. Olsen,et al.  Infodemiology to improve public health situational awareness: An investigation of 2010 pertussis outbreaks in California, Michigan and Ohio , 2013 .

[4]  John S Brownstein,et al.  Searching the Web for Influenza Vaccines: HealthMap Vaccine Finder. , 2015, American journal of public health.

[5]  Benjamin M. Good,et al.  Tag clouds for summarizing web search results , 2007, WWW '07.

[6]  Julie A. Pavlin,et al.  A Review of Evaluations of Electronic Event-Based Biosurveillance Systems , 2014, PloS one.

[7]  M. Tsou,et al.  Research challenges and opportunities in mapping social media and Big Data , 2015 .

[8]  Sean D. Young,et al.  Stimulant Use Among African American and Latino MSM Social Networking Users , 2013, Journal of addictive diseases.

[9]  Jeffrey Nichols,et al.  Home Location Identification of Twitter Users , 2014, TIST.

[10]  I. Helsloot,et al.  Twitter: An Underutilized Potential During Sudden Crises? , 2013 .

[11]  Carlos Castillo-Chavez,et al.  Mass Media and the Contagion of Fear: The Case of Ebola in America , 2015, PloS one.

[12]  Richard Brennan Reflections on the Ebola outbreak: Perspectives from the frontline , 2014 .

[13]  Brian H. Spitzberg,et al.  Mapping social activities and concepts with social media (Twitter) and web search engines (Yahoo and Bing): a case study in 2012 US Presidential Election , 2013 .

[14]  Liuliu Zhou,et al.  The construction and application research of crisis early warning mechanism of reputation of colleges and universitie s in the new media environment , 2014 .

[15]  Krzysztof Janowicz,et al.  POI Pulse: A Multi-granular, Semantic Signature–Based Information Observatory for the Interactive Visualization of Big Geosocial Data , 2015, Cartogr. Int. J. Geogr. Inf. Geovisualization.

[16]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[17]  Renato Miranda,et al.  Uncovering the Location of Twitter Users , 2013, 2013 Brazilian Conference on Intelligent Systems.

[18]  Scott A. Hale,et al.  Where in the World Are You? Geolocation and Language Identification in Twitter* , 2013, ArXiv.

[19]  Karolien Poels,et al.  #universitycrisis: The Impact of Social Media Type, Source, and Information on Student Responses Toward a University Crisis , 2014 .

[20]  Ming-Hsiang Tsou,et al.  Mapping ideas from cyberspace to realspace: visualizing the spatial context of keywords from web page search results , 2014, Int. J. Digit. Earth.

[21]  Robert J. Meijer,et al.  Sensor Data Storage Performance: SQL or NoSQL, Physical or Virtual , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[22]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[23]  Ping Zhang,et al.  The Evolution of Social Commerce: The People, Management, Technology, and Information Dimensions , 2012, Commun. Assoc. Inf. Syst..

[24]  Elad Segev,et al.  Families and Networks of Internet Memes: The Relationship Between Cohesiveness, Uniqueness, and Quiddity Concreteness , 2015, J. Comput. Mediat. Commun..

[25]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[26]  D A Asch,et al.  The content of social media's shared images about Ebola: a retrospective study. , 2015, Public health.

[27]  Steven Verstockt,et al.  Review of wildfire detection using social media , 2014 .

[28]  Dimple R. Thadani,et al.  The impact of electronic word-of-mouth communication: A literature analysis and integrative model , 2012, Decis. Support Syst..

[29]  James A. Thom,et al.  Geotagging Twitter Messages in Crisis Management , 2015, Comput. J..

[30]  Filippo Menczer,et al.  Design and prototyping of a social media observatory , 2013, WWW.

[31]  Barry Wellman,et al.  Geography of Twitter networks , 2012, Soc. Networks.

[32]  Mor Naaman,et al.  Diamonds in the rough: Social media visual analytics for journalistic inquiry , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[33]  Carter T. Butts,et al.  Warning tweets: serial transmission of messages during the warning phase of a disaster event , 2014 .

[34]  Xiao Zhang,et al.  SensePlace2: GeoTwitter analytics support for situational awareness , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[35]  John S. Brownstein,et al.  Evaluation of Local Media Surveillance for Improved Disease Recognition and Monitoring in Global Hotspot Regions , 2014, PloS one.

[36]  Thomas Ertl,et al.  Thematic Patterns in Georeferenced Tweets through Space-Time Visual Analytics , 2013, Computing in Science & Engineering.

[37]  Brian H. Spitzberg,et al.  The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets , 2013, Journal of medical Internet research.

[38]  Ming-Hsiang Tsou,et al.  Visualization of social media: seeing a mirage or a message? , 2013 .

[39]  Ben Shneiderman,et al.  Designing the User Interface: Strategies for Effective Human-Computer Interaction , 1998 .

[40]  Ed H. Chi,et al.  Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles , 2011, CHI.

[41]  Michael D. Barnes,et al.  Tweaking and Tweeting: Exploring Twitter for Nonmedical Use of a Psychostimulant Drug (Adderall) Among College Students , 2013, Journal of medical Internet research.

[42]  Carlo Ratti,et al.  Geo-located Twitter as proxy for global mobility patterns , 2013, Cartography and geographic information science.

[43]  John S Brownstein,et al.  Comparing timeliness, content, and disease severity of formal and informal source outbreak reporting , 2015, BMC Infectious Diseases.

[44]  Russ Burtner,et al.  INTERNATIONAL JOURNAL OF HEALTH GEOGRAPHICS REVIEW Open Access , 2022 .

[45]  Thomas Ertl,et al.  Spatiotemporal anomaly detection through visual analysis of geolocated Twitter messages , 2012, 2012 IEEE Pacific Visualization Symposium.

[46]  Brian H. Spitzberg Toward a Model of Meme Diffusion (M3D) , 2014 .

[47]  Arkaitz Zubiaga,et al.  Real‐time classification of Twitter trends , 2014, J. Assoc. Inf. Sci. Technol..

[48]  Nigel Collier,et al.  Factors Influencing Performance of Internet-Based Biosurveillance Systems Used in Epidemic Intelligence for Early Detection of Infectious Diseases Outbreaks , 2014, PloS one.

[49]  Kazuyuki Aihara,et al.  Quantifying Collective Attention from Tweet Stream , 2013, PloS one.

[50]  Courtney Corley,et al.  Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance , 2010, Comput. Methods Programs Biomed..

[51]  P. Suñé,et al.  Positive Outcomes Influence the Rate and Time to Publication, but Not the Impact Factor of Publications of Clinical Trial Results , 2013, PloS one.

[52]  J. Brownstein,et al.  A Case Study of the New York City 2012-2013 Influenza Season With Daily Geocoded Twitter Data From Temporal and Spatiotemporal Perspectives , 2014, Journal of medical Internet research.

[53]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[54]  Aron Culotta,et al.  Lightweight methods to estimate influenza rates and alcohol sales volume from Twitter messages , 2012, Language Resources and Evaluation.

[55]  Joel Lexchin,et al.  The Cost of Pushing Pills: A New Estimate of Pharmaceutical Promotion Expenditures in the United States , 2008, PLoS medicine.

[56]  Martha Larson,et al.  The where in the tweet , 2011, CIKM '11.

[57]  Alex Pentland,et al.  Beyond Contagion: Reality Mining Reveals Complex Patterns of Social Influence , 2015, PloS one.

[58]  Ben Y. Reis,et al.  Surveillance Sans Frontières: Internet-Based Emerging Infectious Disease Intelligence and the HealthMap Project , 2008, PLoS medicine.

[59]  Brian H. Spitzberg,et al.  The Reliability of Tweets as a Supplementary Method of Seasonal Influenza Surveillance , 2014, Journal of medical Internet research.