Neighborhood-Based Tag Prediction

We consider the problem of tag prediction in collaborative tagging systems where users share and annotate resources on the Web. We put forward HAMLET, a novel approach to automatically propagate tags along the edges of a graph which relates similar documents. We identify the core principles underlying tag propagation for which we derive suitable scoring models combined in one overall ranking formula. Leveraging these scores, we present an efficient top-k tag selection algorithm that infers additional tags by carefully inspecting neighbors in the document graph. Experiments using real-world data demonstrate the viability of our approach in large-scale environments where tags are scarce.

[1]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[2]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[3]  Joost N. Kok,et al.  Knowledge Discovery in Databases: PKDD 2007, 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, Warsaw, Poland, September 17-21, 2007, Proceedings , 2007, PKDD.

[4]  P. Schmitz,et al.  Inducing Ontology from Flickr Tags , 2006 .

[5]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS.

[6]  Gerhard Weikum,et al.  Efficient top-k querying over social-tagging networks , 2008, SIGIR '08.

[7]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[8]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.

[9]  Yang Song,et al.  Real-time automatic tag recommendation , 2008, SIGIR '08.

[10]  Karl Aberer,et al.  From Web 1.0 to Web 2.0 and back -: how did your grandma use to tag? , 2008, WIDM '08.

[11]  Siegfried Handschuh,et al.  P-TAG: large scale automatic generation of personalized annotation tags for the web , 2007, WWW '07.

[12]  Wisam Dakka,et al.  Augmenting Wikipedia with Named Entity Tags , 2008, IJCNLP.

[13]  Jérôme Euzenat,et al.  A Survey of Schema-Based Matching Approaches , 2005, J. Data Semant..

[14]  Éva Tardos,et al.  Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields , 1999, 40th Annual Symposium on Foundations of Computer Science (Cat. No.99CB37039).

[15]  Andreas Hotho,et al.  Tag Recommendations in Folksonomies , 2007, LWA.

[16]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[17]  Gerhard Weikum,et al.  Graph-based text classification: learn from your neighbors , 2006, SIGIR.

[18]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[19]  Yong Yu,et al.  Exploring social annotations for the semantic web , 2006, WWW '06.

[20]  Mor Naaman,et al.  Towards automatic extraction of event and place semantics from flickr tags , 2007, SIGIR.

[21]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[22]  Ronald Fagin,et al.  Combining Fuzzy Information from Multiple Systems , 1999, J. Comput. Syst. Sci..

[23]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[24]  Gustavo Rossi,et al.  Web Engineering , 2001, Lecture Notes in Computer Science.

[25]  Mark B. Sandler,et al.  Automatic Interlinking of Music Datasets on the Semantic Web , 2008, LDOW.

[26]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[27]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[28]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[29]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[30]  Giuseppe Attardi,et al.  Ranking very many typed entities on wikipedia , 2007, CIKM '07.

[31]  Dan Brickley,et al.  SKOS Core: Simple knowledge organisation for the Web , 2005, Dublin Core Conference.

[32]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[33]  Brian D. Davison Topical locality in the Web , 2000, SIGIR '00.

[34]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[35]  Min Wang,et al.  A declarative framework for semantic link discovery over relational data , 2009, WWW '09.

[36]  Satoshi Nakamura,et al.  Towards Improving Web Search by Utilizing Social Bookmarks , 2007, ICWE.

[37]  Zibin Zheng,et al.  Distributed QoS Evaluation for Real-World Web Services , 2010, 2010 IEEE International Conference on Web Services.

[38]  Brian Hayes,et al.  What Is Cloud Computing? , 2019, Cloud Technologies.

[39]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[40]  Martin Hepp,et al.  Harvesting Wiki Consensus - Using Wikipedia Entries as Ontology Elements , 2006, SemWiki.

[41]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[42]  Martin Gaedke,et al.  Silk - A Link Discovery Framework for the Web of Data , 2009, LDOW.

[43]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[44]  Hector Garcia-Molina,et al.  Social tag prediction , 2008, SIGIR '08.

[45]  Michael R. Lyu,et al.  Effective missing data prediction for collaborative filtering , 2007, SIGIR.

[46]  Junfeng Zhao,et al.  Personalized QoS Prediction forWeb Services via Collaborative Filtering , 2007, IEEE International Conference on Web Services (ICWS 2007).