Event extraction from Twitter using Non-Parametric Bayesian Mixture Model with Word Embeddings

To extract structured representations of newsworthy events from Twitter, unsupervised models typically assume that tweets involving the same named entities and expressed using similar words are likely to belong to the same event. Hence, they group tweets into clusters based on the co-occurrence patterns of named entities and topical keywords. However, there are two main limitations. First, they require the number of events to be known beforehand, which is not realistic in practical applications. Second, they don’t recognise that the same named entity might be referred to by multiple mentions and tweets using different mentions would be wrongly assigned to different events. To overcome these limitations, we propose a non-parametric Bayesian mixture model with word embeddings for event extraction, in which the number of events can be inferred automatically and the issue of lexical variations for the same named entity can be dealt with properly. Our model has been evaluated on three datasets with sizes ranging between 2,499 and over 60 million tweets. Experimental results show that our model outperforms the baseline approach on all datasets by 5-8% in F-measure.

[1]  Vasudeva Varma,et al.  Structured Information Extraction from Natural Disaster Events on Twitter , 2014, Web-KR '14.

[2]  Krishnaprasad Thirunarayan,et al.  Extracting City Traffic Events from Social Streams , 2015, ACM Trans. Intell. Syst. Technol..

[3]  Eugene Agichtein,et al.  SEEFT: Planned Social Event Discovery and Attribute Extraction by Fusing Twitter and Web Content , 2015, ICWSM.

[4]  Radford M. Neal Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[5]  Jakub Piskorski,et al.  Cluster-Centric Approach to News Event Extraction , 2008, New Trends in Multimedia and Network Information Systems.

[6]  Liangyu Chen,et al.  A Simple Bayesian Modelling Approach to Event Extraction from Twitter , 2014, ACL.

[7]  Ming Zhou,et al.  Exacting Social Events for Tweets Using a Factor Graph , 2012, AAAI.

[8]  Regina Barzilay,et al.  Event Discovery in Social Media Feeds , 2011, ACL.

[9]  Michael Gertz,et al.  EvenTweet: Online Localized Event Detection from Twitter , 2013, Proc. VLDB Endow..

[10]  Jakub Piskorski,et al.  Real-Time News Event Extraction for Global Crisis Monitoring , 2008, NLDB.

[11]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[12]  P. Green,et al.  Modelling Heterogeneity With and Without the Dirichlet Process , 2001 .

[13]  M. Escobar,et al.  Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[14]  Ralph Grishman,et al.  NYU's English ACE 2005 System Description , 2005 .

[15]  Jordi Torres,et al.  Tweet-SCAN: An event discovery technique for geo-located tweets , 2017, Pattern Recognit. Lett..

[16]  Minh-Tien Nguyen,et al.  TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets During a Disaster for Reaction , 2015, PAKDD.

[17]  Liangyu Chen,et al.  An Unsupervised Framework of Exploring Events on Twitter: Filtering, Extraction and Categorization , 2015, AAAI.

[18]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[19]  H. Ishwaran,et al.  Exact and approximate sum representations for the Dirichlet process , 2002 .

[20]  Ana-Maria Popescu,et al.  Extracting events and event descriptions from Twitter , 2011, WWW.

[21]  D. Aldous Exchangeability and related topics , 1985 .

[22]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[23]  Craig MacDonald,et al.  Can Twitter Replace Newswire for Breaking News? , 2013, ICWSM.

[24]  Jim Pitman,et al.  Poisson–Dirichlet and GEM Invariant Distributions for Split-and-Merge Transformations of an Interval Partition , 2002, Combinatorics, Probability and Computing.

[25]  Jun Hu,et al.  What Is New in Our City? A Framework for Event Extraction Using Social Media Posts , 2015, PAKDD.