Learning Contextualised Weblog Topics

The blogosphere refers to the distributed network of user opinions published on the WWW. Whereas centralized review sites such Amazon.com previously allowed users to post opinions on goods such as books and CDs, blogging software allows users to publish opinions on any topic without constraints on predefined schema. However, centralized review sites such as Amazon.com have one significant advantage: reviews pertaining to a single topic are collected together in one place, allowing readers to peruse a diverse range of opinions quickly. In this paper we examine how such a topiccentric view of the Blogosphere can be created. We characterise the problems in aligning similar concepts created by a set of distributed, autonomous users and describe current initiatives to solve the problem. Finally, we introduce the Tagsocratic project, a novel initiative to solve the concept alignment problem using techniques derived from research in language acquisition among distributed, autonomous agents.

[1]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[3]  Sébastien Paquet,et al.  Topic sharing infrastructure for weblog networks , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[4]  Paolo Avesani,et al.  A Peer-to-Peer Advertising Game , 2003, ICSOC.

[5]  RahmErhard,et al.  A survey of approaches to automatic schema matching , 2001, VLDB 2001.

[6]  David R. Karger,et al.  What would it mean to blog on the semantic web? , 2005, J. Web Semant..

[7]  Anjo Anjewierden,et al.  Shared conceptualisations in weblogs , 2004 .

[8]  Bonnie A. Nardi,et al.  Blogging by the rest of us , 2004, CHI EA '04.

[9]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[10]  Erhard Rahm,et al.  COMA - A System for Flexible Combination of Schema Matching Approaches , 2002, VLDB.

[11]  Bonnie A. Nardi,et al.  Blogging as social activity, or, would you let 900 million people read your diary? , 2004, CSCW.

[12]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[13]  Steve Cayzer,et al.  Semantic blogging and decentralized knowledge management , 2004, CACM.

[14]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[15]  Luc Steels,et al.  Spatially Distributed Naming Games , 1998, Adv. Complex Syst..

[16]  Judit Bar-Ilan An outsider's view on "topic-oriented blogging" , 2004, WWW Alt. '04.