Strategy for processing and analyzing social media data streams in emergencies

People are using social media to a greater extent, particularly in emergency situations. However, approaches for processing and analyzing the vast quantities of data produced currently lag far behind. In this paper we discuss important steps, and the associated challenges, for processing and analyzing social media in emergencies. In our research project EmerGent, a huge volume of low-quality messages will be continuously gathered from a variety of social media services such as Facebook or Twitter. Our aim is to design a software system that will process and analyze social media data, transforming the high volume of noisy data into a low volume of rich content that is useful to emergency personnel. Therefore, suitable techniques are needed to extract and condense key information from raw social media data, allowing detection of relevant events and generation of alerts pertinent to emergency personnel.

[1]  Claudia Diamantini,et al.  An integrated system for social information discovery , 2014, 2014 International Conference on Collaboration Technologies and Systems (CTS).

[2]  Fernando Diaz,et al.  Extracting information nuggets from disaster- Related messages in social media , 2013, ISCRAM.

[3]  D. Edwards Data Mining: Concepts, Models, Methods, and Algorithms , 2003 .

[4]  Thomas Ludwig,et al.  Social-QAS: Tailorable Quality Assessment Service for Social Media Content , 2015, IS-EUD.

[5]  Jie Yin,et al.  Using Social Media to Enhance Emergency Situation Awareness , 2012, IEEE Intelligent Systems.

[6]  Hila Becker,et al.  Event Identification in Social Media , 2009, WebDB.

[7]  Mehmed Kantardzic,et al.  Data-Mining Concepts , 2011 .

[8]  Surajit Chaudhuri,et al.  What next?: a half-dozen data management research goals for big data and the cloud , 2012, PODS.

[9]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[10]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[11]  Brooke Fisher Liu,et al.  Social media use during disasters: a review of the knowledge base and gaps. , 2012 .

[12]  Robert Tolksdorf,et al.  Case Studies on Ontology Reuse , 2005 .

[13]  Guy G. Gable,et al.  Information Quality in Social Media: A Conceptual Model , 2013, PACIS.

[14]  Mohammed J. Zaki Data Mining and Analysis: Fundamental Concepts and Algorithms , 2014 .

[15]  Stephan Prödel,et al.  Analysis of information quality criteria in a crisis situation as a characteristic of complex situations , 2010, ICIQ.

[16]  G. Jensen Key Criteria for Information Quality in the Use of Online Social Media for Emergency Management in New Zealand , 2012 .

[17]  Ming Zhou,et al.  Joint Inference of Named Entity Recognition and Normalization for Tweets , 2012, ACL.

[18]  Donna B. Stoddard,et al.  Quality of Social Media Data and Implications of Social Media for Data Quality , 2012, MIT International Conference on Information Quality.

[19]  Kalina Bontcheva,et al.  Making sense of social media streams through semantics: A survey , 2014, Semantic Web.

[20]  Leysia Palen,et al.  Mastering social media: An analysis of Jefferson County's communications during the 2013 Colorado floods , 2014, ISCRAM.

[21]  Ângela Guimarães Pereira,et al.  Do-it-yourself Justice - Considerations of Social Media use in a Crisis Situation: The Case of the 2011 Vancouver Riots , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[22]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[23]  Thomas Ludwig,et al.  Social Haystack , 2015, ACM Trans. Comput. Hum. Interact..

[24]  Christian Reuter,et al.  Technical Limitations for Designing Applications for Social Media , 2014, MuC Workshopband.

[25]  Thomas Ludwig,et al.  CrowdMonitor: Mobile Crowd Sensing for Assessing Physical and Digital Activities of Citizens during Emergencies , 2015, CHI.

[26]  Thomas Ludwig,et al.  XHELP: Design of a Cross-Platform Social-Media Application to Support Volunteer Moderators in Disasters , 2015, CHI.

[27]  M. de Rijke,et al.  Credibility Improves Topical Blog Post Retrieval , 2008, ACL.

[28]  Antony Galton,et al.  An ontology of information for emergency management , 2011, ISCRAM.

[29]  Ann Blandford,et al.  Situation awareness in emergency medical dispatch , 2004, Int. J. Hum. Comput. Stud..

[30]  Qi Gao,et al.  Semantic Enrichment of Twitter Posts for User Profile Construction on the Social Web , 2011, ESWC.

[31]  Jacob Eisenstein,et al.  What to do about bad language on the internet , 2013, NAACL.

[32]  Thomas Ludwig,et al.  Entwicklung eines SOA-basierten und anpassbaren Bewertungsdienstes für Inhalte aus sozialen Medien , 2014, GI-Jahrestagung.

[33]  Tharam S. Dillon,et al.  Content Quality Assessment Related Frameworks for Social Media , 2009, ICCSA.

[34]  Miriam A. M. Capretz,et al.  From Glossaries to Ontologies: Disaster Management Domain , 2011, ICSE 2011.

[35]  Leysia Palen,et al.  Online public communications by police & fire services during the 2012 Hurricane Sandy , 2014, CHI.

[36]  Volkmar Pipek,et al.  Crisis Management 2.0: Towards a Systematization of Social Software Use in Crisis Situations , 2012, Int. J. Inf. Syst. Crisis Response Manag..

[37]  Jason R. C. Nurse,et al.  Information Quality and Trustworthiness: A Topical State−of−the−Art Review , 2011 .