Social networking and tagging have taken off at an unexpected scale and speed, opening huge opportunities to enhance the user search experience. We present Gossple, a new, user-centric, approach to improve the exploration of the Internet. Underlying Gossple lies the intuition that while social networks provides news from your old buddies, you can learn a lot more from people you don't know, but with whom you share many (tagging) interests. More specifically, considering a collaborative tagging system with active taggers annotating content, Gossple expands the search query, of any user u, with tags that are considered "close" enough with respect to users that are "close" to u.
Gossple users create their own network of social acquaintances in a gossip-based manner, by dynamically computing the estimation of a distance between taggers, based on cosine similarity between tags and items. These connections are used to feed a TagMap: our central abstraction that captures the personalised relationships between tags. The TagMap is then used by Gossple to meaningfully expand queries leveraging the personalised network. This is achieved through the TagRank algorithm, an adaptation of the celebrated pagerank algorithm, which automatically determines which tags best expand a list of tags in a given query.
Gossple has no central authority: every user stores its own items and its tagging behaviour is stored only by its neighbours. The resulting networks are live, dynamic and do not require any underlying structure. We report on our evaluation of Gossple with CiteUlike traces, involving 33,834 users. In short, we show that, with little information stored at every peer, Gossple enables to retrieve items that cannot be retrieved with state of the art search systems (completeness).
[1]
Fabio Crestani,et al.
Tag data and personalized information retrieval
,
2008,
SSM '08.
[2]
Laks V. S. Lakshmanan,et al.
Efficient network aware search in collaborative tagging sites
,
2008,
Proc. VLDB Endow..
[3]
Márk Jelasity,et al.
T-Man: Gossip-Based Overlay Topology Management
,
2005,
Engineering Self-Organising Systems.
[4]
Sergey Brin,et al.
The Anatomy of a Large-Scale Hypertextual Web Search Engine
,
1998,
Comput. Networks.
[5]
Anne-Marie Kermarrec,et al.
The Peer Sampling Service: Experimental Evaluation of Unstructured Gossip-Based Implementations
,
2004,
Middleware.
[6]
Ciro Cattuto,et al.
Semantic Analysis of Tag Similarity Measures in Collaborative Tagging Systems
,
2008,
LWA.
[7]
Licia Capra,et al.
Social ranking: uncovering relevant content using tag-based recommender systems
,
2008,
RecSys '08.
[8]
Dániel Fogaras,et al.
Towards Scaling Fully Personalized PageRank: Algorithms, Lower Bounds, and Experiments
,
2005,
Internet Math..
[9]
Gerhard Weikum,et al.
Peer-to-Peer Information Search: Semantic, Social, or Spiritual?
,
2007,
IEEE Data Eng. Bull..
[10]
Alfons Kemper,et al.
Bulletin of the Ieee Computer Society Technical Committee on Data Engineering
,
1999
.