Assessing relevance of tweets for risk communication

ABSTRACT Although Twitter is used for emergency management activities, the relevance of tweets during a hazard event is still open to debate. In this study, six different computational (i.e. Natural Language Processing) and spatiotemporal analytical approaches were implemented to assess the relevance of risk information extracted from tweets obtained during the 2013 Colorado flood event. Primarily, tweets containing information about the flooding events and its impacts were analysed. Examination of the relationships between tweet volume and its content with precipitation amount, damage extent, and official reports revealed that relevant tweets provided information about the event and its impacts rather than any other risk information that public expects to receive via alert messages. However, only 14% of the geo-tagged tweets and only 0.06% of the total fire hose tweets were found to be relevant to the event. By providing insight into the quality of social media data and its usefulness to emergency management activities, this study contributes to the literature on quality of big data. Future research in this area would focus on assessing the reliability of relevant tweets for disaster related situational awareness.

[1]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[2]  K. Ng,et al.  The Fukushima Nuclear Crisis Reemphasizes the Need for Improved Risk Communication and Better Use of Social Media , 2012, Health physics.

[3]  Leysia Palen,et al.  The Evolving Role of the Public Information Officer: An Examination of Social Media in Emergency Management , 2012 .

[4]  Vincent T. Covello,et al.  Effective risk communication : the role and responsibility of government and nongovernment organizations , 1989 .

[5]  F Cheong,et al.  Social Media Data Mining: A Social Network Analysis of Tweets During the Australian 2010-2011 Floods , 2011 .

[6]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[7]  John Yen,et al.  Classifying text messages for the haiti earthquake , 2011, ISCRAM.

[8]  Leysia Palen,et al.  Online public communications by police & fire services during the 2012 Hurricane Sandy , 2014, CHI.

[9]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[10]  Fernando González-Ladrón-de-Guevara,et al.  Towards an integrated crowdsourcing definition , 2012, J. Inf. Sci..

[11]  Arjen P. de Vries,et al.  Obtaining High-Quality Relevance Judgments Using Crowdsourcing , 2012, IEEE Internet Computing.

[12]  Pierre Tirilly,et al.  Language modeling for bag-of-visual words image categorization , 2008, CIVR '08.

[13]  Anna Trakoli,et al.  Risk Communication: A Handbook for Communicating Environmental, Safety, and Health Risks , 2015 .

[14]  Steffen Fritz,et al.  The Rise of Collaborative Mapping: Trends and Future Directions , 2013, ISPRS Int. J. Geo Inf..

[15]  Leysia Palen,et al.  Mastering social media: An analysis of Jefferson County's communications during the 2013 Colorado floods , 2014, ISCRAM.

[16]  Robert Dale,et al.  Handbook of Natural Language Processing , 2001, Computational Linguistics.

[17]  Vincent T. Covello,et al.  Risk Communication: An Emerging Area of Health Communication Research , 1992 .

[18]  David S. Ebert,et al.  Spatiotemporal social media analytics for abnormal event detection and examination using seasonal-trend decomposition , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[19]  Ashton M. Verdery,et al.  Enhancing big data in the social sciences with crowdsourcing: Data augmentation practices, techniques, and opportunities , 2016, PloS one.

[20]  Bernd Ludwig,et al.  Context relevance assessment and exploitation in mobile recommender systems , 2012, Personal and Ubiquitous Computing.

[21]  Edson C. Tandoc,et al.  Communicating on Twitter during a disaster: An analysis of tweets during Typhoon Haiyan in the Philippines , 2015, Comput. Hum. Behav..

[22]  Kazutoshi Sumiya,et al.  Discovery of unusual regional social activities using geo-tagged microblogs , 2011, World Wide Web.

[23]  Shelley Boulianne,et al.  Does compassion go viral? Social media, caring, and the Fort McMurray wildfire , 2018 .

[24]  Bandana Kar,et al.  Citizen science in risk communication in the era of ICT , 2016, Concurr. Comput. Pract. Exp..

[25]  Raymond H. Johnson Risk Communication, A Handbook for Communicating Environmental, Safety, and Health Risks, Third Edition , 2005 .

[26]  Vincent T. Covello,et al.  Effective Risk Communication , 1989 .

[27]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[28]  Omar Alonso,et al.  Using crowdsourcing for TREC relevance assessment , 2012, Inf. Process. Manag..

[29]  Alexander Zipf,et al.  The use of Volunteered Geographic Information (VGI) and Crowdsourcing in Disaster Management: a Systematic Literature Review , 2013, AMCIS.

[30]  David Gough,et al.  Weight of Evidence: a framework for the appraisal of the quality and relevance of evidence , 2007 .

[31]  Yangyong Zhu,et al.  The Challenges of Data Quality and Data Quality Assessment in the Big Data Era , 2015, Data Sci. J..

[32]  Adam Crowe The Elephant in the JIC: The Fundamental Flaw of Emergency Public Information within the NIMS Framework , 2010 .

[33]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[34]  Sam Meek,et al.  A flexible framework for assessing the quality of crowdsourced data , 2014 .

[35]  B H Morrow,et al.  Identifying and mapping community vulnerability. , 1999, Disasters.

[36]  Matthew Lease,et al.  Crowdsourcing Document Relevance Assessment with Mechanical Turk , 2010, Mturk@HLT-NAACL.

[37]  M. Simpson Global Climate Change Impacts in the United States , 2011 .

[38]  J Brian Houston,et al.  Social media and disasters: a functional framework for social media use in disaster planning, response, and research. , 2015, Disasters.

[39]  Walter Gillis Peacock,et al.  Social Science Research Needs for the Hurricane Forecast and Warning System , 2007 .

[40]  Philip Treleaven,et al.  Quantifying the Digital Traces of Hurricane Sandy on Flickr , 2013, Scientific Reports.

[41]  J. Fowler,et al.  Rapid assessment of disaster damage using social media activity , 2016, Science Advances.

[42]  Christoph Perger,et al.  Using control data to determine the reliability of volunteered geographic information about land cover , 2013, Int. J. Appl. Earth Obs. Geoinformation.

[43]  Dave Yates,et al.  Emergency knowledge management and social media technologies: A case study of the 2010 Haitian earthquake , 2011, Int. J. Inf. Manag..

[44]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[45]  David Filliat,et al.  A visual bag of words method for interactive qualitative localization and mapping , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[46]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[47]  Katrin Erk,et al.  A Structured Vector Space Model for Word Meaning in Context , 2008, EMNLP.

[48]  Harry Shum,et al.  An Empirical Study on Learning to Rank of Tweets , 2010, COLING.