Humanitarian Technology : Science , Systems and Global Impact 2015 , HumTech 2015 Tackling the challenges of situational awareness extraction in Twitter with an adaptive approach

Abstract Twitter is widely perceived as a potential source of valuable information for responders to mass emergencies. Despite interest in the development of extraction systems for such information, little effort has been put towards systemic methods for obtaining all posts pertaining to a disaster from the live Twitter stream. Researchers rely on keyword-based filters to extract information in spite of evidence that such markers are absent in many informational tweets, and also neglect the topic and traffic dynamics of the relevant tweets as crises progress. Previous work has shown that such practices can often lead to the loss of critical information in the context of a disaster. We introduce an adaptive filter, tailored to the idiosyncrasies of the real-time Twitter feed, intended to extract disaster-related content. Furthermore, we introduce a novel data model based on a three-label classification scheme to describe the composition of the data-stream. We use this model to simulate Twitter streams, modelling various post-disaster scenarios, for the purpose of filter performance evaluation. The filter is able to remove over 85% of the non-crisis content, and achieves a three-fold reduction in the loss of relevant contents compared to the existing approaches. In combination, the method and the model are useful tools for extracting situational awareness and highlight important directions for future work in this area.

[1]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[2]  Tony Jaques,et al.  Learning from past crises - Do iconic cases help or hinder? , 2009 .

[3]  Aaron Smith,et al.  Social Media & Mobile Internet Use among Teens and Young Adults. Millennials. , 2010 .

[4]  David Lazer,et al.  #Bigbirds Never Die: Understanding Social Dynamics of Emergent Hashtags , 2013, ICWSM.

[5]  Iadh Ounis,et al.  Overview of the TREC-2012 Microblog Track | NIST , 2013 .

[6]  Derek Ruths,et al.  Novel Situational Information in Mass Emergencies: What does Twitter Provide? , 2014 .

[7]  Sriram Subramanian,et al.  Talking about tactile experiences , 2013, CHI.

[8]  Gordon V. Cormack,et al.  University of Waterloo: Logistic Regression and Reciprocal Rank Fusion at the Microblog Track , 2012, TREC.

[9]  Bertrand De Longueville,et al.  "OMG, from here, I can see the flames!": a use case of mining location based social networks to acquire spatio-temporal data on forest fires , 2009, LBSN '09.

[10]  Haji Mohammad Saleem,et al.  Effects of Disaster Characteristics on Twitter Event Signature , 2014 .

[11]  John Bambenek,et al.  Do Emergency Text Messaging Systems Put Students in More Danger , 2008 .

[12]  Stephen Robertson,et al.  The TREC-2001 Filtering Track Report | NIST , 2002 .

[13]  Mica R. Endsley,et al.  Toward a Theory of Situation Awareness in Dynamic Systems , 1995, Hum. Factors.

[14]  L. Palen Online Social Media in Crisis Events. , 2008 .

[15]  Lisl Zach,et al.  Microblogging for crisis communication: Examination of twitter use in response to a 2009 violent crisis in the Seattle-Tacoma, Washington area , 2010, ISCRAM.

[16]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[17]  Leysia Palen,et al.  NLP to the Rescue?: Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, International Conference on Web and Social Media.

[18]  Giacomo Berardi,et al.  ISTI@TREC Microblog Track 2012: Real-Time Filtering Through Supervised Learning , 2012, TREC.

[19]  Bruno S. Silvestre,et al.  Social Media? Get Serious! Understanding the Functional Building Blocks of Social Media , 2011 .

[20]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[21]  Michelle Drifka Hinson,et al.  Examining How Public Relations Practitioners Actually Are Using Social Media , 2009 .

[22]  Sungzoon Cho,et al.  EUS SVMs: Ensemble of Under-Sampled SVMs for Data Imbalance Problems , 2006, ICONIP.

[23]  Huiji Gao,et al.  Harnessing the Crowdsourcing Power of Social Media for Disaster Relief , 2011, IEEE Intelligent Systems.

[24]  Shady Elbassuoni,et al.  Practical extraction of disaster-relevant information from social media , 2013, WWW.

[25]  Maarit Makinen,et al.  Social Media and Postelection Crisis in Kenya , 2008 .

[26]  Leysia Palen,et al.  Twitter adoption and use in mass convergence and emergency events , 2009 .

[27]  Tiejun Zhao,et al.  HIT at TREC 2012 Microblog Track , 2012, TREC.

[28]  Stephen E. Robertson,et al.  The TREC 2002 Filtering Track Report , 2002, TREC.