Classifying text messages for the haiti earthquake

In case of emergencies (e.g., earthquakes, flooding), rapid responses are needed in order to address victims’ requests for help. Social media used around crises involves self-organizing behavior that can produce accurate results, often in advance of official communications. This allows affected population to send tweets or text messages, and hence, make them heard. The ability to classify tweets and text messages automatically, together with the ability to deliver the relevant information to the appropriate personnel are essential for enabling the personnel to timely and efficiently work to address the most urgent needs, and to understand the emergency situation better. In this study, we developed a reusable information technology infrastructure, called Enhanced Messaging for the Emergency Response Sector (EMERSE), which classifies and aggregates tweets and text messages about the Haiti disaster relief so that non-governmental organizations, relief workers, people in Haiti, and their friends and families can easily access them.

[1]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[2]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[3]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  J. C. BurgesChristopher A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[6]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[7]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[8]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Sarah Jane Delany,et al.  An Assessment of Case Base Reasoning for Short Text Message Classification , 2004 .

[11]  Xiang Yao,et al.  The Design of a Dynamic Emergency Response Management Information System (DERMIS) , 2004 .

[12]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[13]  José María Gómez Hidalgo,et al.  Content based SMS spam filtering , 2006, DocEng '06.

[14]  Gordon V. Cormack,et al.  Spam filtering for short messages , 2007, CIKM '07.

[15]  Leysia Palen,et al.  The emergence of online widescale interaction in unexpected events: assistance, alliance & retreat , 2008, CSCW.

[16]  Rakesh Gupta,et al.  Text Categorization with Knowledge Transfer from Heterogeneous Data Sources , 2008, AAAI.

[17]  Xiang Li,et al.  Building a Practical Ontology for Emergency Response Systems , 2008, 2008 International Conference on Computer Science and Software Engineering.

[18]  Amanda Lee Hughes,et al.  Collective Intelligence in Disaster: Examination of the Phenomenon in the Aftermath of the 2007 Virginia Tech Shooting , 2008 .

[19]  Vasant Honavar,et al.  Combining Super-Structuring and Abstraction on Sequence Classification , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[20]  B. Weitz Hosted By , 2003 .

[21]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[22]  Amanda Lee Hughes,et al.  Crisis in a Networked World , 2009 .

[23]  Jeannie A. Stamberger,et al.  Tweak the tweet: Leveraging microblogging proliferation with a prescriptive syntax to support citizen reporting , 2010, ISCRAM.

[24]  James H. Martin,et al.  A vision for technology-mediated support for public participation & assistance in mass emergencies & disasters , 2010 .

[25]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[26]  Robert Munro Crowdsourced translation for emergency response in Haiti: the global collaboration of local knowledge , 2010, AMTA.

[27]  Christopher D. Manning,et al.  Subword Variation in Text Message Classification , 2010, NAACL.