Socialtagger - collaborative tagging for blogs in the long tail

Social bookmarking is the process through which users share tags for online resources like blogs with others. Such collaborative tags provide valuable metadata for retrieval systems. While the successes of collaborative tagging systems have been demonstrated by popular websites like Del.icio.us, these sites cover only a small fraction of the available blogs on the web. The vast majority of the blogs are not available on any collaborative tagging system and are often tagged only by the authors. This lack of coverage of collaborative tags is a considerable roadblock in using the tag metadata in a web scale information retrieval system. To solve this problem we propose and implement a system to automatically recommend collaborative tags for a blog. The automatically generated tags will help to surface the blogs by making them available on social book marking sites and allow them to be easily discovered and potentially further tagged by a wider population.

[1]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[2]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[3]  Ravi Kumar,et al.  On the Bursty Evolution of Blogspace , 2003, WWW '03.

[4]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[5]  Gilad Mishne,et al.  AutoTag: a collaborative approach to automated tag assignment for weblog posts , 2006, WWW '06.

[6]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[7]  Nello Cristianini,et al.  Inferring a Semantic Representation of Text via Cross-Language Correlation Analysis , 2002, NIPS.

[8]  Hsinchun Chen,et al.  Collaborative systems: solving the vocabulary problem , 1994, Computer.

[9]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[10]  Grigory Begelman,et al.  Automated Tag Clustering: Improving search and exploration in the tag space , 2006 .

[11]  Lawrence Birnbaum,et al.  TagAssist: Automatic Tag Suggestion for Blog Posts , 2007, ICWSM.

[12]  George Macgregor,et al.  Collaborative tagging as a knowledge organisation and resource discovery tool , 2006 .

[13]  Jianchang Mao,et al.  Towards the Semantic Web: Collaborative Tag Suggestions , 2006 .

[14]  Shankara B. Subramanya,et al.  Clustering Blogs with Collective Wisdom , 2008, 2008 Eighth International Conference on Web Engineering.

[15]  Christopher H. Brooks,et al.  Improved annotation of the blogosphere via autotagging and hierarchical clustering , 2006, WWW '06.

[16]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[17]  D. Watts Is Justin Timberlake a product of cumulative advantage , 2007 .

[18]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .