Semantics made by you and me: Self-emerging ontologies can capture the diversity of shared knowledge

The participatory nature of many Web 2.0 platforms makes a large portion of users’ interactions with each other and with information resources digitally observable. The assumption that the evolving structure of these digital records contains implicit evidences for the underlying semantics has been proven by successful approaches of making the emergent semantics explicit, e.g. in the form of lightweight ontologies. In this paper, we provide further evidence for the great potential of self-emerging ontologies from Web 2.0 data, exemplified by collaborative tagging systems. We hereby combine and extend prior research, where we identified crucial aspects for successful methods to infer tag semantics. The additional contribution of this paper is to propose an extended methodology to induce a hierarchical organization scheme from the initially flat tag space which captures the semantics and the diversity of the shared knowledge. It comprises the introduction of a synsetized folksonomy (which tackles the problem of synonymous tags) and a clustering approach for tag sense disambiguation. In order to assess the quality of the learned semantics, we compare the inferred organization scheme with manually built categorization schemes from WordNet and Wikipedia. Our results exhibit clear similarities; so in summary, our work demonstrates a successful example of self-emergent ontologies from Web 2.0 data.

[1]  Simone Paolo Ponzetto,et al.  Deriving a Large-Scale Taxonomy from Wikipedia , 2007, AAAI.

[2]  Andreas Hotho,et al.  Mining Association Rules in Folksonomies , 2006, Data Science and Classification.

[3]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[4]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[5]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[6]  Ciro Cattuto,et al.  Semantic Grounding of Tag Relatedness in Social Bookmarking Systems , 2008, SEMWEB.

[7]  Yong Yu,et al.  Emergent Semantics from Folksonomies: A Quantitative Study , 2006, J. Data Semant..

[8]  Hector Garcia-Molina,et al.  Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems , 2006 .

[9]  Andreas Hotho,et al.  Semantic Network Analysis of Ontologies , 2006, LWA.

[10]  Steffen Staab,et al.  On How to Perform a Gold Standard Based Evaluation of Ontology Learning , 2006, SEMWEB.

[11]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[12]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[13]  Vittorio Loreto,et al.  Network properties of folksonomies , 2007, AI Commun..

[14]  P. Schmitz,et al.  Inducing Ontology from Flickr Tags , 2006 .

[15]  Dominik Benz,et al.  Automatic Bookmark Classification: A Collaborative Approach , 2006 .

[16]  Sofia Angeletou Semantic Enrichment of Folksonomy Tagspaces , 2008, International Semantic Web Conference.

[17]  Tony Hammond,et al.  Social Bookmarking Tools (II): A Case Study - Connotea , 2005, D Lib Mag..

[18]  Valentin Robu,et al.  The Dynamics and Semantics of Collaborative Tagging , 2006, SAAW@ISWC.

[19]  Bernardo A. Huberman,et al.  The Structure of Collaborative Tagging Systems , 2005, ArXiv.

[20]  Ciro Cattuto,et al.  Evaluating similarity measures for emergent semantics of social tagging , 2009, WWW '09.

[21]  Tony Hammond,et al.  Social Bookmarking Tools (I): A General Overview , 2005, D Lib Mag..

[22]  Lars Schmidt-Thieme,et al.  Folksonomy-Based Collabulary Learning , 2008, International Semantic Web Conference.

[23]  Grigory Begelman,et al.  Automated Tag Clustering: Improving search and exploration in the tag space , 2006 .

[24]  Vittorio Loreto,et al.  Semiotic dynamics and collaborative tagging , 2006, Proceedings of the National Academy of Sciences.

[25]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[26]  Marcel Ausloos,et al.  Contextualising tags in collaborative tagging systems , 2009, HT '09.

[27]  Dominik Benz,et al.  Evaluation Strategies for Learning Algorithms of Hierarchies , 2008, GfKl.

[28]  Nigel Shadbolt,et al.  Contextualising Tags in Collaborative Tagging Systems , 2009 .

[29]  Patrick Pantel,et al.  Document clustering with committees , 2002, SIGIR '02.

[30]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[31]  Dominik Benz,et al.  Position Paper: Ontology Learning from Folksonomies , 2007, LWA.