Mining Tags from Flickr User Comments Using a Hybrid Ranking Model

In the Web2.0 era, user generated content has become the main source of information of many popular websites such as Flickr. In Flickr, each user can share his/her photos and browse others' easily. Tagging system is an important approach to the photo management in Flickr. Users can browse photos by clicking their attached tags. However, many photos have very few or even no tags, because only the up loader can mark tags for the photo. Meanwhile, when a user browses the photo he/she is interested in, he/she may have comments to express his/her independent viewpoint on the photo. Therefore, it is critical to recommend new tags or enrich the existing tag set based on user comments. Relying on Natural Language Processing (NLP) techniques, this paper introduces a word-based method in generating candidate tags extracted from user comments. In the phase of sorting and recommending tags, we propose an algorithm by jointly modeling the location information of candidate tags, statistical information and semantic similarity. Extensive experimental results demonstrate the effectiveness of our method.

[1]  Lars Schmidt-Thieme,et al.  Collaborative Tag Recommendations , 2007, GfKl.

[2]  Luca Cagliero,et al.  A Rule-Based Flickr Tag Recommendation System , 2013, Social Media Retrieval.

[3]  Daniel Gatica-Perez,et al.  Modeling Semantic Aspects for Cross-Media Image Indexing , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Andreas Hotho,et al.  FolkRank : A Ranking Algorithm for Folksonomies , 2006, LWA.

[5]  Geert-Jan Houben,et al.  Identification of useful user comments in social media: a case study on flickr commons , 2013, JCDL '13.

[6]  Jing Liu,et al.  Image annotation via graph learning , 2009, Pattern Recognit..

[7]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[8]  Yong Wang,et al.  Combining global, regional and contextual features for automatic image annotation , 2009, Pattern Recognit..

[9]  Federica Cena,et al.  Tag-based user modeling for social multi-device adaptive guides , 2008, User Modeling and User-Adapted Interaction.

[10]  Jia Wang,et al.  User comments for news recommendation in forum-based social media , 2010, Inf. Sci..

[11]  Ashish Sureka Mining User Comment Activity for Detecting Forum Spammers in YouTube , 2011, ArXiv.

[12]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[14]  Robert Graham,et al.  Plurality: a context-aware personalized tagging system , 2008, WWW.

[15]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[16]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[17]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.