Real-time local topic extraction using density-based adaptive spatiotemporal clustering for enhancing local situation awareness

In the era of big data, we are witnessing the rapid growth of a new type of information source. In particular, tweets are one of the most widely used microblogging services for situation awareness during emergencies. In our previous work, we focused on geotagged tweets posted on Twitter that included location information as well as a time and text message. We previously developed a real-time analysis system using the (ε,τ)-density-based adaptive spatiotemporal clustering algorithm to analyze local topics and events. The proposed spatiotemporal analysis system successfully detects emerging bursty areas in which geotagged tweets related to observed topics are posted actively; however the system is tailor-made and specialized for a particular observed topic, therefore, it cannot identify other topics. To address this issue, we propose a new real-time spatiotemporal analysis system for enhancing local situation awareness using a density-based adaptive spatiotemporal clustering algorithm. In the proposed system, local bursty keywords are extracted and their bursty areas are identified. We evaluated the proposed system using actual real world topics related to weather in Japan. Experimental results show that the proposed system can extract local topics and events.

[1]  Malik Magdon-Ismail,et al.  Information Cascades in Social Media in Response to a Crisis : a Preliminary Model and a Case Study , 2012 .

[2]  Shaowen Wang,et al.  Spatiotemporal transformation of social media geostreams: a case study of Twitter for flu risk analysis , 2013, IWGS '13.

[3]  Thomas Ertl,et al.  Spatiotemporal anomaly detection through visual analysis of geolocated Twitter messages , 2012, 2012 IEEE Pacific Visualization Symposium.

[4]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[5]  Koji Zettsu,et al.  mTrend: discovery of topic movements on geo-microblogging messages , 2011, GIS.

[6]  Mor Naaman,et al.  Geographic information from georeferenced social media data , 2011, SIGSPACIAL.

[7]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[8]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[9]  Andrea Marchetti,et al.  EARS (earthquake alert and report system): a real time decision support system for earthquake crisis management , 2014, KDD.

[10]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[11]  Rob J Hyndman,et al.  Sample Quantiles in Statistical Packages , 1996 .

[12]  Jie Yin,et al.  Using Social Media to Enhance Emergency Situation Awareness , 2012, IEEE Intelligent Systems.

[13]  Hanna Suominen,et al.  Crisis management knowledge from social media , 2013, ADCS.

[14]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[15]  Yi Fang,et al.  Where not to go?: detecting road hazards using twitter , 2014, SIGIR.