Identifying relevant event content for real-time event detection

A variety of event detection algorithms for microblog services have been proposed, but their accuracy relies on the microblog feeds they analyse. Existing research explores datasets that are collected using either a set of manually predefined terms or information from external sources. These methods fail to provide comprehensive and quality feeds for real-time event detection. In this paper, we present a novel adaptive keyword identification approach to retrieve a greater amount of event relevant content. This approach continuously monitors emerging hashtags and rates them by their similarity to specific pre-defined event hashtags using TF-IDF vectors. Top rated emerging hashtags are added as filter criteria in real time. By comparing our proposed approach, called CETRe (Content-based Event Tweet Retrieval) with an existing baseline approach applied to real-world events, we show that CETRe not only identifies event topics and contents, but also enables better event detection.

[1]  Paolo Rosso,et al.  On the difficulty of clustering company tweets , 2010, SMUC '10.

[2]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[3]  Miles Osborne,et al.  Streaming First Story Detection with application to Twitter , 2010, NAACL.

[4]  Stefan Poslad,et al.  Exploiting hashtags for adaptive microblog crawling , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[5]  Feng Liang,et al.  Exploiting real-time information retrieval in the microblogosphere , 2012, JCDL '12.

[6]  Jeffrey Nichols,et al.  Summarizing sporting events using twitter , 2012, IUI '12.

[7]  Stefan Poslad,et al.  Adaptive Identification of Hashtags for Real-Time Event Data Collection , 2015, Recommendation and Search in Social Networks.

[8]  Regina Barzilay,et al.  Event Discovery in Social Media Feeds , 2011, ACL.

[9]  Alan F. Smeaton,et al.  Using Twitter to Detect and Tag Important Events in Sports Media , 2011, ICWSM.

[10]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[11]  Ilknur Celik,et al.  Leveraging the Semantics of Tweets for Adaptive Faceted Search on Twitter , 2011, SEMWEB.

[12]  Ari Rappoport,et al.  What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities , 2012, WSDM '12.

[13]  Hila Becker,et al.  Automatic Identification and Presentation of Twitter Content for Planned Events , 2011, ICWSM.

[14]  Christopher C. Yang,et al.  Discovering event evolution graphs from newswires , 2006, WWW '06.

[15]  Hila Becker,et al.  Identification and Characterization of Events in Social Media , 2011 .

[16]  Michelle R. Guy,et al.  Twitter earthquake detection: earthquake monitoring in a social world , 2012 .