Exploitation of semantic relationships and hierarchical data structures to support a user in his annotation and browsing activities in folksonomies

In this paper we present a new approach to supporting users to annotate and browse resources referred by a folksonomy. Our approach is characterized by the following novelties: (i) it proposes a probabilistic technique to quickly and accurately determine the similarity and the generalization degrees of two tags; (ii) it proposes two hierarchical structures and two related algorithms to arrange groups of semantically related tags in a hierarchy; this allows users to visualize tags of their interests according to desired semantic granularities and, then, helps them to find those tags best expressing their information needs. In this paper we first illustrate the technical characteristics of our approach; then we describe various experiments allowing its performance to be tested; finally, we compare it with other related approaches already proposed in the literature.

[1]  Christopher H. Brooks,et al.  Improved annotation of the blogosphere via autotagging and hierarchical clustering , 2006, WWW '06.

[2]  Steffen Staab,et al.  Organizing Resources on Tagging Systems using TORG , 2007 .

[3]  David Eppstein,et al.  Fast approximation of centrality , 2000, SODA '01.

[4]  David D. Jensen,et al.  Using structure indices for efficient approximation of network properties , 2006, KDD '06.

[5]  P. Schmitz,et al.  Inducing Ontology from Flickr Tags , 2006 .

[6]  Nigel Shadbolt,et al.  Tag Meaning Disambiguation through Analysis of Tripartite Structure of Folksonomies , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[7]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[8]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[9]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[10]  Rui Li,et al.  Towards effective browsing of large scale social annotations , 2007, WWW '07.

[11]  Vittorio Loreto,et al.  Network properties of folksonomies , 2007, AI Commun..

[12]  Céline Van Damme,et al.  FolksOntology : An Integrated Approach for Turning Folksonomies into Ontologies , 2007 .

[13]  Shlomo Moran,et al.  SALSA: the stochastic approach for link-structure analysis , 2001, TOIS.

[14]  Andreas Hotho,et al.  Mining Association Rules in Folksonomies , 2006, Data Science and Classification.

[15]  Julius T. Tou,et al.  Information Systems , 1973, GI Jahrestagung.

[16]  Edith Cohen,et al.  Size-Estimation Framework with Applications to Transitive Closure and Reachability , 1997, J. Comput. Syst. Sci..

[17]  Marco Colombetti,et al.  Using WordNet to turn a Folksonomy into a Hierarchy of Concepts , 2007, SWAP.

[18]  S. Muthukrishnan,et al.  Generalized substring selectivity estimation , 2003, J. Comput. Syst. Sci..

[19]  Hector Garcia-Molina,et al.  Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems , 2006 .

[20]  Kristina Lerman,et al.  Social Information Processing in News Aggregation , 2007, IEEE Internet Computing.

[21]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[22]  Sriram Raghavan,et al.  Representing Web graphs , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[23]  Yong Yu,et al.  Exploring social annotations for the semantic web , 2006, WWW '06.

[24]  Clifford Stein,et al.  Introduction to Algorithms, 2nd edition. , 2001 .

[25]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[26]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[27]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[28]  Georgia Koutrika,et al.  Can social bookmarking improve web search? , 2008, WSDM '08.

[29]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[30]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[31]  Subhasri Duttagupta,et al.  Developmental informatics at IIT Bombay , 2007, SGMD.

[32]  Enrico Motta,et al.  Integrating Folksonomies with the Semantic Web , 2007, ESWC.

[33]  Community Systems Group Community systems research at Yahoo! , 2007, SGMD.

[34]  Grigory Begelman,et al.  Automated Tag Clustering: Improving search and exploration in the tag space , 2006 .

[35]  Andrei Z. Broder,et al.  On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[36]  David R. Millen,et al.  Dogear: Social bookmarking in the enterprise , 2006, CHI.

[37]  Andrei Z. Broder,et al.  Workshop on Algorithms and Models for the Web Graph , 2007, WAW.

[38]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[39]  Sreenivas Gollapudi,et al.  Using Bloom Filters to Speed Up HITS-Like Ranking Algorithms , 2007, WAW.

[40]  De MeoPasquale,et al.  Exploitation of semantic relationships and hierarchical data structures to support a user in his annotation and browsing activities in folksonomies , 2009 .

[41]  Alan M. Frieze,et al.  Clustering in large graphs and matrices , 1999, SODA '99.

[42]  Dominik Benz,et al.  Supporting collaborative hierarchical classification: Bookmarks as an example , 2007, Comput. Networks.