From Web 1.0 to Web 2.0 and back -: how did your grandma use to tag?

We consider the applicability of terms extracted from anchortext as a source of Web page descriptions in the form of tags. With a relatively simple and easy-to-use method, we show that anchortext significantly overlaps with tags obtained from the popular tagging portal del.icio.us. Considering the size and diversity of the user community potentially involved in social tagging, this observation is rather surprising. Furthermore, we show by an evaluation using human-created relevance assessments the general suitability of the anchortext tag generation in terms of user-perceived precision values. The awareness of this easy-to-obtain source of tags could trigger the rise of new tagging portals pushed by this automatic bootstrapping process or be applied in already existing portals to increase the number of tags per page by merely looking at the anchortext which exists anyway.

[1]  Atsushi Fujii Modeling anchor text and classifying queries to enhance web document retrieval , 2008, WWW.

[2]  Siegfried Handschuh,et al.  P-TAG: large scale automatic generation of personalized annotation tags for the web , 2007, WWW '07.

[3]  Hsi-Jian Lee,et al.  Anchor text mining for translation of Web queries: A transitive translation approach , 2004, TOIS.

[4]  Yi Zhang,et al.  Web based linkage , 2007, WIDM '07.

[5]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[6]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[7]  Christoph Meinel,et al.  Authors vs. readers: a comparative study of document metadata and content in the www , 2007, DocEng '07.

[8]  Stephen E. Robertson,et al.  Effective site finding using link anchor information , 2001, SIGIR '01.

[9]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[10]  Tony Hammond,et al.  Social Bookmarking Tools (I): A General Overview , 2005, D Lib Mag..

[11]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[12]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[13]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[14]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[15]  Karl Aberer,et al.  To tag or not to tag -: harvesting adjacent metadata in large-scale tagging systems , 2008, SIGIR '08.

[16]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[17]  Lawrence Birnbaum,et al.  TagAssist: Automatic Tag Suggestion for Blog Posts , 2007, ICWSM.

[18]  Reiner Kraft,et al.  Mining anchor text for query refinement , 2004, WWW '04.

[19]  Karl Aberer,et al.  PicShark: mitigating metadata scarcity through large-scale P2P collaboration , 2008, The VLDB Journal.

[20]  Cécile Paris,et al.  Automatically summarising Web sites: is there a way around it? , 2000, CIKM '00.

[21]  Hector Garcia-Molina,et al.  Social tag prediction , 2008, SIGIR '08.

[22]  Hongyuan Zha,et al.  Exploring social annotations for information retrieval , 2008, WWW.

[23]  Kevin S. McCurley,et al.  Analysis of anchor text for web search , 2003, SIGIR.

[24]  Satoshi Nakamura,et al.  Can social bookmarking enhance search in the web? , 2007, JCDL '07.

[25]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.