Augmenting Lightweight Domain Ontologies with Social Evidence Sources

Recent research shows the potential of utilizing data collected through Web 2.0 applications to capture changes in a domain's terminology. This paper presents an approach to augment corpus-based ontology learning by considering terms from collaborative tagging systems, social networking platforms, and micro-blogging services. The proposed framework collects information on the domain's terminology from domain documents and a seed ontology in a triple store. Data from social sources such as Delicious, Flickr, Technorati and Twitter provide an outside view of the domain and help incorporate external knowledge into the ontology learning process. The neural network technique of spreading activation is used to identify relevant new concepts, and to determine their positions in the extended ontology. Evaluating the method with two measures (PMI and expert judgements) demonstrates the significant benefits of social evidence sources for ontology learning.

[1]  Arno Scharl,et al.  Capturing and Classifying Ontology Evolution in News Media Archives , 2008, 2008 19th International Workshop on Database and Expert Systems Applications.

[2]  Wendy Hall,et al.  Building a Pragmatic Semantic Web , 2008, IEEE Intelligent Systems.

[3]  James A. Hendler,et al.  Web 3.0 Emerging , 2009, Computer.

[4]  Claudio Giuliano,et al.  Relation extraction and the influence of automatic named-entity recognition , 2007, TSLP.

[5]  Harith Alani,et al.  Social Support for Ontological Mediation and Data Integration , 2009, Int. J. Virtual Communities Soc. Netw..

[6]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[7]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[8]  Enrico Motta,et al.  Integrating Folksonomies with the Semantic Web , 2007, ESWC.

[9]  A. Weichselbraun,et al.  Web Content Mining for Comparing Corporate and Third-Party Online Reporting : A Case Study on Solid Waste Management , 2009 .

[10]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[11]  Ciro Cattuto,et al.  Semantic Grounding of Tag Relatedness in Social Bookmarking Systems , 2008, SEMWEB.

[12]  P. Schmitz,et al.  Inducing Ontology from Flickr Tags , 2006 .

[13]  David Sánchez,et al.  Learning non-taxonomic relationships from web documents for domain ontology construction , 2008, Data Knowl. Eng..

[14]  Enrico Motta,et al.  Toward a New Generation of Semantic Web Applications , 2008, IEEE Intelligent Systems.

[15]  Elizabeth Chang,et al.  Semi-Automatic Ontology Extension Using Spreading Activation , 2005 .

[16]  Arno Scharl,et al.  Analyzing News Media Coverage to Acquire and Structure Tourism Knowledge , 2008, J. Inf. Technol. Tour..

[17]  Steffen Staab,et al.  Ontology Learning Part One - On Discoverying Taxonomic Relations from the Web , 2002 .

[18]  Hector Garcia-Molina,et al.  Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems , 2006 .

[19]  Enrico Motta,et al.  Bridging the gap between folksonomies and the semantic web: an experience report , 2007 .