Multimedia Social Event Detection in Microblog

Event detection in social media platforms has become an important task. It facilities exploration and browsing of events with early plans for preventive measures. The main challenges in event detection lie in the characteristics of social media data, which are short/conversational, heterogeneous and live. Most of existing methods rely only on the textual information while ignoring the visual content as well as the intrinsic correlation among the heterogeneous social media data. In this paper, we propose an event detection method, which generates an intermediate semantic entity, named microblog clique (MC), to explore the highly correlated information among the noisy and short microblogs. The heterogeneous social media data is formulated as a hypergraph and the highly correlated ones are grouped to generate the MCs. Based on these MCs, a bipartite graph is constructed and partitioned to detect social events. The proposed method has been evaluated on the Brand-Social-Net dataset. Experimental results and comparison with state-of-the-art methods demonstrate the effectiveness of the proposed approach. Further evaluation has shown that the use of the visual content can significantly improve the event detection performance.

[1]  Marti A. Hearst Search User Interfaces , 2009 .

[2]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[3]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[4]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[5]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[6]  Thomas Gottron,et al.  Searching microblogs: coping with sparsity and document quality , 2011, CIKM '11.

[7]  Julio Gonzalo,et al.  A comparison of extrinsic clustering evaluation metrics based on formal constraints , 2008, Information Retrieval.

[8]  Yi Yang,et al.  Discriminative Nonnegative Spectral Clustering with Out-of-Sample Extension , 2013, IEEE Transactions on Knowledge and Data Engineering.

[9]  Hector Garcia-Molina,et al.  Overview of multidatabase transaction management , 2005, The VLDB Journal.

[10]  Yue Gao,et al.  Brand Data Gathering From Live Social Media Streams , 2014, ICMR.

[11]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[12]  Aristides Gionis,et al.  Event detection in activity networks , 2014, KDD.

[13]  Lars Schmidt-Thieme,et al.  Scalable Event-Based Clustering of Social Media Via Record Linkage Techniques , 2011, ICWSM.

[14]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[15]  Hila Becker,et al.  Learning similarity metrics for event identification in social media , 2010, WSDM '10.

[16]  Philipp Cimiano,et al.  Event-based classification of social media streams , 2012, ICMR.

[17]  Julio Gonzalo,et al.  A comparison of extrinsic clustering evaluation metrics based on formal constraints , 2009, Information Retrieval.

[18]  Qingshan Liu,et al.  Image retrieval via probabilistic hypergraph ranking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[20]  Matthew Hurst,et al.  Event Detection and Tracking in Social Streams , 2009, ICWSM.

[21]  Shih-Fu Chang,et al.  Segmentation using superpixels: A bipartite graph partitioning approach , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Philip S. Yu,et al.  Parameter Free Bursty Events Detection in Text Streams , 2005, VLDB.

[23]  James Allan,et al.  On-Line New Event Detection and Tracking , 1998, SIGIR Forum.