Network-aware search in social tagging applications: instance optimality versus efficiency

We consider in this paper top-k query answering in social applications, with a focus on social tagging. This problem requires a significant departure from socially agnostic techniques. In a network- aware context, one can (and should) exploit the social links, which can indicate how users relate to the seeker and how much weight their tagging actions should have in the result build-up. We propose algorithms that have the potential to scale to current applications. While the problem has already been considered in previous literature, this was done either under strong simplifying assumptions or under choices that cannot scale to even moderate-size real-world applications. We first revisit a key aspect of the problem, which is accessing the closest or most relevant users for a given seeker. We describe how this can be done on the fly (without any pre- computations) for several possible choices -- arguably the most natural ones -- of proximity computation in a user network. Based on this, our top-k algorithm is sound and complete, addressing the applicability issues of the existing ones. Moreover, it performs significantly better in general and is instance optimal in the case when the search relies exclusively on the social weight of tagging actions. To further address the efficiency needs of online applications, for which the exact search, albeit optimal, may still be expensive, we then consider approximate algorithms. Specifically, these rely on concise statistics about the social network or on approximate shortest-paths computations. Extensive experiments on real-world data from Twitter show that our techniques can drastically improve response time, without sacrificing precision.

[1]  Yong Yu,et al.  Exploring folksonomy for personalized search , 2008, SIGIR '08.

[2]  Jan Vondrák,et al.  On Principles of Egocentric Person Search in Social Networks , 2011, VLDS.

[3]  Gerhard Weikum,et al.  Efficient top-k querying over social-tagging networks , 2008, SIGIR '08.

[4]  Ken C. K. Lee,et al.  On top-k social web search , 2010, CIKM.

[5]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS '01.

[6]  Jun Wang,et al.  Personalization of tagging systems , 2010, Inf. Process. Manag..

[7]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[8]  Christian S. Jensen,et al.  Retrieving top-k prestige-based relevant spatial web objects , 2010, Proc. VLDB Endow..

[9]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[10]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[11]  Ioannis Konstas,et al.  On social networks and collaborative recommendation , 2009, SIGIR.

[12]  Reynold Cheng,et al.  CubeLSI: An effective and efficient method for searching resources in social tagging systems , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[13]  Azadeh Iranmehr,et al.  Trust Management for Semantic Web , 2009, 2009 Second International Conference on Computer and Electrical Engineering.

[14]  Ido Guy,et al.  Personalized social search based on the user's social network , 2009, CIKM.

[15]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[16]  Ashish Goel,et al.  Partitioned multi-indexing: bringing order to social search , 2012, WWW.

[17]  Laks V. S. Lakshmanan,et al.  Efficient network aware search in collaborative tagging sites , 2008, Proc. VLDB Endow..

[18]  Ji-Rong Wen,et al.  WWW 2007 / Track: Search Session: Personalization A Largescale Evaluation and Analysis of Personalized Search Strategies ABSTRACT , 2022 .

[19]  Aristides Gionis,et al.  Fast shortest path distance estimation in large networks , 2009, CIKM.

[20]  Cong Yu,et al.  SocialScope: Enabling Information Discovery on Social Content Sites , 2009, CIDR.

[21]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[22]  Cong Yu,et al.  Building community-centric information exploration applications on social content sites , 2009, SIGMOD Conference.