Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events

In times of mass emergency, vast amounts of data are generated via computer-mediated communication (CMC) that are difficult to manually collect and organize into a coherent picture. Yet valuable information is broadcast, and can provide useful insight into time- and safety-critical situations if captured and analyzed efficiently and effectively. We describe a natural language processing component of the EPIC (Empowering the Public with Information in Crisis) Project infrastructure, designed to extract linguistic and behavioral information from tweet text to aid in the task of information integration. The system incorporates linguistic annotation, in the form of Named Entity Tagging, as well as behavioral annotations to capture tweets contributing to situational awareness and analyze the information type of the tweet content. We show classification results and describe future integration of these classifiers in the larger EPIC infrastructure.

[1]  Martha Palmer,et al.  Getting the Most out of Transition-based Dependency Parsing , 2011, ACL.

[2]  John R. Harrald,et al.  Shared Situational Awareness in Emergency Management Mitigation and Response , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[3]  Martha Palmer,et al.  Twitter in mass emergency: what NLP techniques can contribute , 2010, HLT-NAACL 2010.

[4]  Kenneth Mark Anderson,et al.  Design and implementation of a data analytics infrastructure in support of crisis informatics research: NIER track , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[5]  James H. Martin,et al.  A vision for technology-mediated support for public participation & assistance in mass emergencies & disasters , 2010 .

[6]  Amanda Lee Hughes,et al.  Crisis in a Networked World , 2009 .

[7]  B. Weitz Hosted By , 2003 .

[8]  Philip V. Ogren,et al.  Knowtator: A Protégé plug-in for annotated corpus construction , 2006, NAACL.

[9]  Yan Qu,et al.  Online Community Response to Major Disaster: A Study of Tianya Forum in the 2008 Sichuan Earthquake , 2009, 2009 42nd Hawaii International Conference on System Sciences.

[10]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[11]  Mica R. Endsley,et al.  Toward a Theory of Situation Awareness in Dynamic Systems , 1995, Hum. Factors.

[12]  Leysia Palen,et al.  Natural Language Processing to the Rescue? Extracting "Situational Awareness" Tweets During Mass Emergency , 2011, ICWSM.

[13]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[14]  Leysia Palen,et al.  Chatter on the red: what hazards threat reveals about the social life of microblogged information , 2010, CSCW '10.

[15]  Mica R. Endsley,et al.  Theoretical Underpinnings of Situation Awareness, A Critical Review , 2000 .

[16]  Richard M. Schwartz,et al.  An Algorithm that Learns What's in a Name , 1999, Machine Learning.

[17]  Martha Palmer,et al.  Transition-based Semantic Role Labeling Using Predicate Argument Clustering , 2011, RELMS@ACL.

[18]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .