Jointly Event Extraction and Visualization on Twitter via Probabilistic Modelling

Event extraction from texts aims to detect structured information such as what has happened, to whom, where and when. Event extraction and visualization are typically considered as two different tasks. In this paper, we propose a novel approach based on probabilistic modelling to jointly extract and visualize events from tweets where both tasks benefit from each other. We model each event as a joint distribution over named entities, a date, a location and event-related keywords. Moreover, both tweets and event instances are associated with coordinates in the visualization space. The manifold assumption that the intrinsic geometry of tweets is a low-rank, non-linear manifold within the high-dimensional space is incorporated into the learning framework using a regularization. Experimental results show that the proposed approach can effectively deal with both event extraction and visualization and performs remarkably better than both the state-of-the-art event extraction method and a pipeline approach for event extraction and visualization.

[1]  Krishnaprasad Thirunarayan,et al.  Extracting City Traffic Events from Social Streams , 2015, ACM Trans. Intell. Syst. Technol..

[2]  Gilbert L. Peterson,et al.  Document Clustering and Visualization with Latent Dirichlet Allocation and Self-Organizing Maps , 2009, FLAIRS.

[3]  Liangyu Chen,et al.  An Unsupervised Framework of Exploring Events on Twitter: Filtering, Extraction and Categorization , 2015, AAAI.

[4]  Keiji Yanai,et al.  Visualization of Real-World Events with Geotagged Tweet Photos , 2012, 2012 IEEE International Conference on Multimedia and Expo Workshops.

[5]  Hady Wirawan Lauw,et al.  Manifold Learning for Jointly Modeling Topic and Visualization , 2014, AAAI.

[6]  Thomas L. Griffiths,et al.  Parametric Embedding for Class Visualization , 2004, Neural Computation.

[7]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[8]  Craig MacDonald,et al.  Can Twitter Replace Newswire for Breaking News? , 2013, ICWSM.

[9]  Ezequiel López-Rubio,et al.  Self-Organizing Dynamic Graphs , 2004, Neural Processing Letters.

[10]  Liangyu Chen,et al.  A Simple Bayesian Modelling Approach to Event Extraction from Twitter , 2014, ACL.

[11]  Naonori Ueda,et al.  Probabilistic latent semantic visualization: topic model for visualizing documents , 2008, KDD.

[12]  Xi Chen,et al.  Text classification with kernels on the multinomial manifold , 2005, SIGIR '05.

[13]  Minh-Tien Nguyen,et al.  TSum4act: A Framework for Retrieving and Summarizing Actionable Tweets During a Disaster for Reaction , 2015, PAKDD.

[14]  Jordi Torres,et al.  Tweet-SCAN: An event discovery technique for geo-located tweets , 2017, Pattern Recognit. Lett..

[15]  Regina Barzilay,et al.  Event Discovery in Social Media Feeds , 2011, ACL.

[16]  Jiawei Han,et al.  Modeling hidden topics on document manifold , 2008, CIKM '08.

[17]  Mihai Surdeanu,et al.  Event Extraction as Dependency Parsing , 2011, ACL.

[18]  Hady Wirawan Lauw,et al.  Semantic visualization for spherical representation , 2014, KDD.

[19]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[20]  Eugene Agichtein,et al.  SEEFT: Planned Social Event Discovery and Attribute Extraction by Fusing Twitter and Web Content , 2015, ICWSM.