A Simple Bayesian Modelling Approach to Event Extraction from Twitter

With the proliferation of social media sites, social streams have proven to contain the most up-to-date information on current events. Therefore, it is crucial to extract events from the social streams such as tweets. However, it is not straightforward to adapt the existing event extraction systems since texts in social media are fragmented and noisy. In this paper we propose a simple and yet effective Bayesian model, called Latent Event Model (LEM), to extract structured representation of events from social media. LEM is fully unsupervised and does not require annotated data for training. We evaluate LEM on a Twitter corpus. Experimental results show that the proposed model achieves 83% in F-measure, and outperforms the state-of-the-art baseline by over 7%.

[1]  Craig MacDonald,et al.  Can Twitter Replace Newswire for Breaking News? , 2013, ICWSM.

[2]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[3]  Jakub Piskorski,et al.  Real-Time News Event Extraction for Global Crisis Monitoring , 2008, NLDB.

[4]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[5]  Jakub Piskorski,et al.  Extracting Violent Events From On-Line News for Ontology Population , 2007, BIS.

[6]  Ralph Weischedel,et al.  PERFORMANCE MEASURES FOR INFORMATION EXTRACTION , 2007 .

[7]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Ralph Grishman,et al.  NYU's English ACE 2005 System Description , 2005 .

[9]  Uzay Kaymak,et al.  An Overview of Event Extraction from Text , 2011, DeRiVE@ISWC.

[10]  Angel X. Chang,et al.  SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[11]  Jakub Piskorski,et al.  Cluster-Centric Approach to News Event Extraction , 2008, New Trends in Multimedia and Network Information Systems.

[12]  Ming Zhou,et al.  Exacting Social Events for Tweets Using a Factor Graph , 2012, AAAI.

[13]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[14]  Regina Barzilay,et al.  Event Discovery in Social Media Feeds , 2011, ACL.

[15]  Inderjeet Mani,et al.  Robust Temporal Processing of News , 2000, ACL.