Real-time streaming intelligence: Integrating graph and NLP analytics

With the growth of social media, embedded sensors, and “smart” devices, those responsible for managing resources during emergencies, such as weather-related disasters, are transitioning from an era of data scarcity to data deluge. During a crisis situation, emergency managers must aggregate various data to assess the situation on the ground, evaluate response plans, give advice to state and local agencies, and inform the public. We make the case that social graph analysis and natural language modeling in real time are paramount to distilling useful intelligence from the large volumes of data available to crisis response personnel. Using ground truth information from social media data surrounding the 2012 Hurricane Sandy in New York City, we test and evaluate our real-time analytics platform to identify immediate and critical information that increases situational awareness during disastrous events.

[1]  David A. Bader,et al.  STINGER : Spatio-Temporal Interaction Networks and Graphs ( STING ) Extensible Representation , 2009 .

[2]  David A. Bader,et al.  STINGER: High performance data structure for streaming graphs , 2012, 2012 IEEE Conference on High Performance Extreme Computing.

[3]  R. Guimerà,et al.  The worldwide air transportation network: Anomalous centrality, community structure, and cities' global roles , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[4]  David A. Bader,et al.  Approximating Betweenness Centrality , 2007, WAW.

[5]  David A. Bader,et al.  Massive Social Network Analysis: Mining Twitter for Social Good , 2010, 2010 39th International Conference on Parallel Processing.

[6]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[7]  Carlos Castillo,et al.  AIDR: artificial intelligence for disaster response , 2014, WWW.

[8]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[9]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[10]  Melinda T. Gervasio,et al.  Crisis Response Planning: A Task Analysis , 1997 .

[11]  Martin L. Kersten,et al.  MonetDB/DataCell: Online Analytics in a Streaming Column-Store , 2012, Proc. VLDB Endow..

[12]  Chen Huang,et al.  Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake , 2011, CSCW.

[13]  David Ediger,et al.  Scalable Infrastructures for Data in Motion , 2014, 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[14]  Eric Gilbert,et al.  A statistical framework for streaming graph analysis , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[15]  Thomas Seidl,et al.  MOA: A Real-Time Analytics Open Source Framework , 2011, ECML/PKDD.

[16]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.