Social multi-modal event analysis via knowledge-based weighted topic model

Abstract Along with the development of mobile Internet and social network, people’s lifestyles are also changing, and many social websites (e.g. Facebook, YouTube, and WeChat) have sprung up, which leads to the emergence of a large number of multimedia data (e.g. text, image, and video) of various events. The goal of this paper is to mine event topics efficiently from massive and unordered social media data, which is beneficial to search, browse and monitor significant social events for users or governments. In order to achieve this goal, this paper proposes a novel Knowledge-Based Multi-Modal Weighted Topic Model (KBMMWTM) for social event analysis. The proposed KBMMWTM has following advantages: (1) The proposed KBMMWTM can effectively take advantage of the multi-modality of social events jointly. (2) The proposed KBMMWTM exploits word correlation in dataset as prior knowledge to improve the performance of event topic mining. We evaluate our KBMMWTM model on a real dataset and full experiments show that our model outperforms state-of-the-art models.

[1]  Gregor Heinrich,et al.  A Generic Approach to Topic Models , 2009, ECML/PKDD.

[2]  M. Shamim Hossain,et al.  Automatic Visual Concept Learning for Social Event Understanding , 2015, IEEE Transactions on Multimedia.

[3]  Helena Ahonen-Myka,et al.  Simple Semantics in Topic Detection and Tracking , 2004, Information Retrieval.

[4]  Hongfei Yan,et al.  Comparing Twitter and Traditional Media Using Topic Models , 2011, ECIR.

[5]  Meng Wang,et al.  Learning User Attributes via Mobile Social Multimedia Analytics , 2017, ACM Trans. Intell. Syst. Technol..

[6]  Chong-Wah Ngo,et al.  Multimodal News Story Clustering With Pairwise Visual Near-Duplicate Constraint , 2008, IEEE Transactions on Multimedia.

[7]  Igor Brigadir,et al.  Event Detection in Twitter using Aggressive Filtering and Hierarchical Tweet Clustering , 2014, SNOW-DC@WWW.

[8]  Gang Hua,et al.  Semantic Model Vectors for Complex Video Event Recognition , 2012, IEEE Transactions on Multimedia.

[9]  Mor Naaman,et al.  Diamonds in the rough: Social media visual analytics for journalistic inquiry , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[10]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[11]  Changsheng Xu,et al.  Cross-Domain Multi-Event Tracking via CO-PMHT , 2014, TOMM.

[12]  James Allan,et al.  On-Line New Event Detection and Tracking , 1998, SIGIR Forum.

[13]  Jian Yang,et al.  Incorporating Tweet Relationships into Topic Derivation , 2015, PACLING.

[14]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[15]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[16]  Dimitrios Tzovaras,et al.  Multi-Objective Optimization for Multimodal Visualization , 2014, IEEE Transactions on Multimedia.

[17]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[18]  Yi Yang,et al.  Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations , 2015, ACM Multimedia.