Towards Mining Semantic Maturity in Social Bookmarking Systems

The existence of emergent semantics within social metadata (such as tags in bookmarking systems) has been proven by a large number of successful approaches making the implicit semantic structures explicit. However, much less attention has been given to the factors which influence the “maturing” process of these structures over time. A natural hypothesis is that tags become semantically more and more mature whenever many users use them in the same contexts. This would allow to describe a tag by a specific and informative “semantic fingerprint” in the context of tagged resoures. However, the question of assessing the quality of such fingerprints has been seldomly addressed. In this paper, we provide a systematic approach of mining semantic maturity profiles within folksonomy-based tag properties. Our ultimate goal is to provide a characterization of “mature tags”. Additionally, we consider semantic information about the tags as a gold-standard source for the characterization of the collected results. Our initial results suggest that a suitable composition of tag properties allows the identification of more mature tag subsets. The presented work has implications for a number of problems related to social tagging systems, including tag ranking, tag recommendation, and the capturing of light-weight ontologies from tagging data.

[1]  Ciro Cattuto,et al.  Semantic Grounding of Tag Relatedness in Social Bookmarking Systems , 2008, SEMWEB.

[2]  Hector Garcia-Molina,et al.  Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems , 2006 .

[3]  Dominik Benz,et al.  Semantics made by you and me: Self-emerging ontologies can capture the diversity of shared knowledge , 2010 .

[4]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[5]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[6]  Ulrik Brandes,et al.  Centrality Estimation in Large Networks , 2007, Int. J. Bifurc. Chaos.

[7]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[8]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[9]  Dominik Benz,et al.  Stop thinking, start tagging: tag semantics emerge from collaborative verbosity , 2010, WWW '10.

[10]  Andreas Hotho,et al.  Tag Recommendations in Folksonomies , 2007, LWA.

[11]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[12]  Frank Puppe,et al.  Exploiting Background Knowledge for Knowledge-Intensive Subgroup Discovery , 2005, IJCAI.

[13]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[14]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[15]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[16]  Bernardo A. Huberman,et al.  The Structure of Collaborative Tagging Systems , 2005, ArXiv.

[17]  Yong Yu,et al.  An Unsupervised Model for Exploring Hierarchical Semantics from Social Annotations , 2007, ISWC/ASWC.

[18]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[19]  Sofia Angeletou Semantic Enrichment of Folksonomy Tagspaces , 2008, International Semantic Web Conference.

[20]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[21]  Stefan Wrobel,et al.  An Algorithm for Multi-relational Discovery of Subgroups , 1997, PKDD.

[22]  C. Cattuto Semiotic dynamics in online social communities , 2006 .