Sixth International Joint Conference on Natural Language Processing Proceedings of the Workshop on Language Processing and Crisis Information

This paper presents a notification system to identify earthquakes from firsthand reports published on Twitter. Tweets from target regions in Australia and New Zealand are checked for earthquake keyword frequency bursts and then processed to identify evidence of an earthquake. The benefit of our earthquake detector is that it relies on evidence of firsthand ‘felt’ reports from Twitter, provides an indication of the earthquake intensity and will be the trigger for further classification of Tweets for impact analysis. We describe how the detector has been incrementally improved, most notably by the introduction of a text classifier. During its initial five months of operation the system has generated 49 notifications of which 29 related to real earthquake events.

[1]  Taku Kudo,et al.  MeCab : Yet Another Part-of-Speech and Morphological Analyzer , 2005 .

[2]  Leysia Palen,et al.  Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, ICWSM.

[3]  David J. C. MacKay,et al.  A hierarchical Dirichlet language model , 1995, Natural Language Engineering.

[4]  Hidetsugu Nanba,et al.  Extracting Transportation Information and Traffic Problems from Tweets during a Disaster , 2012 .

[5]  Andrew McCallum,et al.  Reducing Labeling Effort for Structured Prediction Tasks , 2005, AAAI.

[6]  Philip Fei Wu,et al.  Online Community Response to Major Disaster: A Study of Tianya Forum in the 2008 Sichuan Earthquake , 2009 .

[7]  Eiji Aramaki,et al.  Use trend analysis of twitter after the great east japan earthquake , 2012, CSCW.

[8]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[9]  Burr Settles,et al.  Closing the Loop: Fast, Interactive Semi-Supervised Annotation With Queries on Features and Instances , 2011, EMNLP.

[10]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[11]  Leysia Palen,et al.  Learning from the crowd: Collaborative filtering techniques for identifying on-the-ground Twitterers during mass disruptions , 2012, ISCRAM.

[12]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[13]  Robert Munro Crowdsourced translation for emergency response in Haiti: the global collaboration of local knowledge , 2010, AMTA.

[14]  Jeannie A. Stamberger,et al.  Tweak the tweet: Leveraging microblogging proliferation with a prescriptive syntax to support citizen reporting , 2010, ISCRAM.

[15]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[16]  Thomas S. Huang,et al.  Relevance feedback in image retrieval: A comprehensive review , 2003, Multimedia Systems.

[17]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[18]  P. Hodor High Precision Information Extraction , 2000 .

[19]  Martha Palmer,et al.  Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events , 2012, LREC.

[20]  Sabine Geldof,et al.  CORAL: using natural language generation for navigational assistance , 2003 .

[21]  Graham Neubig,et al.  Safety Information Mining — What can NLP do in a disaster— , 2011, IJCNLP.

[22]  Petter Holme,et al.  Predictability of population displacement after the 2010 Haiti earthquake , 2012, Proceedings of the National Academy of Sciences.

[23]  Paul A. Viola,et al.  Interactive Information Extraction with Constrained Conditional Random Fields , 2004, AAAI.

[24]  Prem Melville,et al.  Sentiment analysis of blogs by combining lexical knowledge with text classification , 2009, KDD.

[25]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[26]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[27]  Mark D. Uncles,et al.  Discrete Choice Analysis: Theory and Application to Travel Demand , 1987 .

[28]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[29]  Shi Bing,et al.  Inductive learning algorithms and representations for text categorization , 2006 .

[30]  Son Doan,et al.  An analysis of Twitter messages in the 2011 Tohoku Earthquake , 2011, eHealth.

[31]  L. Bengtsson,et al.  Improved Response to Disasters and Outbreaks by Tracking Population Movements with Mobile Phone Network Data: A Post-Earthquake Geospatial Study in Haiti , 2011, PLoS medicine.

[32]  Daniel R. Montello,et al.  Elements of Good Route Directions in Familiar and Unfamiliar Environments , 1999, COSIT.

[33]  Carlos M. Oppus,et al.  A mobile disaster management system using the android technology , 2010 .