Integrating multiple types of features for event identification in social images

With the rapidly increasing popularity of social media sites, a large amount of user-generated data has been injected into the web. The data include a wide variety of real-world events. As a consequence, especially for social multimedia objects, it has become increasingly difficult to allow the browsing and organization of multimedia collections in a more effective manner. The approach we propose in this study addresses this problem, thus enabling the browsing and organization of multimedia collections in a natural way, i.e., by events. There have been some research studies on this problem. However, most of the previous approaches merge multiple types of features (e.g., textual content, visual content, user information and temporal information) of social media using a relatively simple mechanism. In this study, we merge multiple types of features in an integrated manner to identify the event associated with user-contributed social multimedia objects. We exploit the correlations between different types of features, i.e., textual content, visual content, user information and temporal information, to classify new social multimedia objects into their corresponding event categories. We accomplish this through a feature correlation graph (FCG) that uses features as nodes and the correlations among these features as edges for each event and individual multimedia object. We then employ a probabilistic model based on Markov random field to connect each new multimedia object with the correct event. We evaluate the algorithm on large-scale, real-world datasets of event images downloaded from Flickr, and the experimental results confirm the superiority of our approach over state-of-the-art approaches.

[1]  Pushmeet Kohli,et al.  Markov Random Fields for Vision and Image Processing , 2011 .

[2]  Heri Ramampiaro,et al.  A scalable algorithm for extraction and clustering of event-related pictures , 2012, Multimedia Tools and Applications.

[3]  Wolfgang Nejdl,et al.  Bringing order to your photos: event-driven classification of flickr images based on social knowledge , 2010, CIKM.

[4]  Ling Chen,et al.  Event detection from flickr data through wavelet-based spatial analysis , 2009, CIKM.

[5]  Zheng Chen,et al.  Latent semantic analysis for multiple-type interrelated data objects , 2006, SIGIR.

[6]  Jianping Fan,et al.  Leveraging loosely-tagged images and inter-object correlations for tag recommendation , 2010, ACM Multimedia.

[7]  Kuo Zhang,et al.  New event detection based on indexing-tree and named entity , 2007, SIGIR.

[8]  Hila Becker,et al.  Exploiting Social Links for Event Identification in Social Media , 2010 .

[9]  Christos Faloutsos,et al.  Automatic multimedia cross-modal correlation discovery , 2004, KDD.

[10]  Anthony K. H. Tung,et al.  Multiple feature fusion for social media applications , 2010, SIGMOD Conference.

[11]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[12]  Partha Pratim Talukdar,et al.  Improving Product Classification Using Images , 2011, 2011 IEEE 11th International Conference on Data Mining.

[13]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[14]  Yiming Yang,et al.  Learning approaches for detecting and tracking news events , 1999, IEEE Intell. Syst..

[15]  Pedro M. Domingos,et al.  Naive Bayes models for probability estimation , 2005, ICML.

[16]  Wei-Ying Ma,et al.  Clustering and searching WWW images using link and page layout analysis , 2007, TOMCCAP.

[17]  Nenghai Yu,et al.  Visual language modeling for image classification , 2007, MIR '07.

[18]  Gabriela Csurka,et al.  Semantic combination of textual and visual information in multimedia retrieval , 2011, ICMR.

[19]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[20]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  J. Laurie Snell,et al.  Markov Random Fields and Their Applications , 1980 .

[22]  Julio Gonzalo,et al.  A comparison of extrinsic clustering evaluation metrics based on formal constraints , 2009, Information Retrieval.

[23]  Gabriela Csurka,et al.  Crossing textual and visual content in different application scenarios , 2009, Multimedia Tools and Applications.

[24]  Changsheng Xu,et al.  Social event detection with robust high-order co-clustering , 2013, ICMR.

[25]  Bin Wang,et al.  A probabilistic model for retrospective news event detection , 2005, SIGIR '05.

[26]  Ebroul Izquierdo,et al.  Social event detection and retrieval in collaborative photo collections , 2012, ICMR '12.

[27]  Hila Becker,et al.  Learning similarity metrics for event identification in social media , 2010, WSDM '10.

[28]  Roelof van Zwol,et al.  Diversifying image search with user generated content , 2008, MIR '08.

[29]  James Allan,et al.  Text classification and named entities for new event detection , 2004, SIGIR '04.

[30]  Jiawei Han,et al.  Ranking-based classification of heterogeneous information networks , 2011, KDD.

[31]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[32]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[33]  W. Bruce Croft,et al.  A Markov random field model for term dependencies , 2005, SIGIR '05.

[34]  Adrian Popescu,et al.  Multimodal feature generation framework for semantic image classification , 2012, ICMR.

[35]  Tat-Seng Chua,et al.  A bootstrapping framework for annotating and retrieving WWW images , 2004, MULTIMEDIA '04.

[36]  Yejin Choi,et al.  Baby talk: Understanding and generating simple image descriptions , 2011, CVPR 2011.

[37]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[38]  Ebroul Izquierdo,et al.  Event-driven retrieval in collaborative photo collections , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).

[39]  Xing Xie,et al.  Effective browsing of web image search results , 2004, MIR '04.

[40]  Jiebo Luo,et al.  Event recognition: viewing the world with a third eye , 2008, ACM Multimedia.

[41]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.