Twitter Mining for Disaster Response: A Domain Adaptation Approach

Microblogging data such as Twitter data contains valuable information that has the potential to help improve the speed, quality, and efficiency of disaster response. Machine learning can help with this by prioritizing the tweets with respect to various classification criteria. However, supervised learning algorithms require labeled data to learn accurate classifiers. Unfortunately, for a new disaster, labeled tweets are not easily available, while they are usually available for previous disasters. Furthermore, unlabeled tweets from the current disaster are accumulating fast. We study the usefulness of labeled data from a prior source disaster, together with unlabeled data from the current target disaster to learn domain adaptation classifiers for the target. Experimental results suggest that, for some tasks, source data itself can be useful for classifying target data. However, for tasks specific to a particular disaster, domain adaptation approaches that use target unlabeled data in addition to source labeled data are superior.

[1]  Cornelia Caragea,et al.  Mapping moods: Geo-mapped sentiment analysis during hurricane sandy , 2014, ISCRAM.

[2]  R.J.P. Stronkman,et al.  Towards a realtime Twitter analysis during crises for operational crisis management , 2012, ISCRAM.

[3]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[4]  Leysia Palen,et al.  Chatter on the red: what hazards threat reveals about the social life of microblogged information , 2010, CSCW '10.

[5]  Shady Elbassuoni,et al.  Practical extraction of disaster-relevant information from social media , 2013, WWW.

[6]  Leysia Palen,et al.  Online public communications by police & fire services during the 2012 Hurricane Sandy , 2014, CHI.

[7]  Qiang Yang,et al.  Transferring Naive Bayes Classifiers for Text Classification , 2007, AAAI.

[8]  Fernando Diaz,et al.  Emergency-relief coordination on social media: Automatically matching resource requests and offers , 2013, First Monday.

[9]  B. Weitz Hosted By , 2003 .

[10]  Hongbo Xu,et al.  Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis , 2009, ECIR.

[11]  Aron Culotta,et al.  Tweedr: Mining twitter to inform disaster response , 2014, ISCRAM.

[12]  Amanda Lee Hughes,et al.  Crisis in a Networked World , 2009 .

[13]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[14]  Nic Herndon,et al.  Empirical Study of Domain Adaptation with Naïve Bayes on the Task of Splice Site Prediction , 2014, BIOINFORMATICS.

[15]  Fernando Diaz,et al.  Extracting information nuggets from disaster- Related messages in social media , 2013, ISCRAM.

[16]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[17]  Viswa Mani Kiran Peddinti,et al.  Domain Adaptation in Sentiment Analysis of Twitter , 2011, Analyzing Microtext.