论文信息 - Jointly Event Extraction and Visualization on Twitter via Probabilistic Modelling

Jointly Event Extraction and Visualization on Twitter via Probabilistic Modelling

Event extraction from texts aims to detect structured information such as what has happened, to whom, where and when. Event extraction and visualization are typically considered as two different tasks. In this paper, we propose a novel approach based on probabilistic modelling to jointly extract and visualize events from tweets where both tasks benefit from each other. We model each event as a joint distribution over named entities, a date, a location and event-related keywords. Moreover, both tweets and event instances are associated with coordinates in the visualization space. The manifold assumption that the intrinsic geometry of tweets is a low-rank, non-linear manifold within the high-dimensional space is incorporated into the learning framework using a regularization. Experimental results show that the proposed approach can effectively deal with both event extraction and visualization and performs remarkably better than both the state-of-the-art event extraction method and a pipeline approach for event extraction and visualization.

Deyu Zhou | Yulan He | Tianmeng Gao

[1] Krishnaprasad Thirunarayan,et al. Extracting City Traffic Events from Social Streams , 2015, ACM Trans. Intell. Syst. Technol..

[2] Gilbert L. Peterson,et al. Document Clustering and Visualization with Latent Dirichlet Allocation and Self-Organizing Maps , 2009, FLAIRS.

[3] Liangyu Chen,et al. An Unsupervised Framework of Exploring Events on Twitter: Filtering, Extraction and Categorization , 2015, AAAI.

[4] Keiji Yanai,et al. Visualization of Real-World Events with Geotagged Tweet Photos , 2012, 2012 IEEE International Conference on Multimedia and Expo Workshops.

[5] Hady Wirawan Lauw,et al. Manifold Learning for Jointly Modeling Topic and Visualization , 2014, AAAI.

[6] Thomas L. Griffiths,et al. Parametric Embedding for Class Visualization , 2004, Neural Computation.

[7] Thomas Hofmann,et al. Probabilistic latent semantic indexing , 1999, SIGIR '99.

[8] Craig MacDonald,et al. Can Twitter Replace Newswire for Breaking News? , 2013, ICWSM.

[9] Ezequiel López-Rubio,et al. Self-Organizing Dynamic Graphs , 2004, Neural Processing Letters.

[10] Liangyu Chen,et al. A Simple Bayesian Modelling Approach to Event Extraction from Twitter , 2014, ACL.

[11] Naonori Ueda,et al. Probabilistic latent semantic visualization: topic model for visualizing documents , 2008, KDD.