Measuring Data-Driven Ontology Changes using Text Mining

Most current ontology management systems concentrate on detecting usage-driven changes and representing changes formally in order to maintain the consistency. In this paper, we present a semi-automatic approach for measuring and visualising data-driven changes through ontology learning. Terms are first generated using text mining techniques using an ontology learning module, and then classified automatically into clusters. The clusters are then manually named, which is the only manual process in this system. Each cluster is considered as a sub-concept of the root concept, and thus one dimension of the feature space describing the root concept. The changes of terms in each cluster contributes to the change of the root concept. Using our system, Web documents are collected at different time periods and fed into the system to generate different versions of the same ontology for each time period. The paper presents several ways of visualising and analysing the changes. Initial experiments on online media data have demonstrated the promising capabilities of our system.

[1]  Mohammed Bennamoun,et al.  Determining Termhood for Learning Domain Ontologies in a Probabilistic Framework , 2007, AusDM.

[2]  Marc Ehrig,et al.  What is Ontology Merging? - A Category-Theoretical Perspective using Pushouts , 2005, AAAI 2005.

[3]  Jos de Bruijn,et al.  D4.2.1 State-of-the-art survey on Ontology Merging and Aligning V1 , 2004 .

[4]  Grigoris Antoniou,et al.  Evolving Ontology Evolution , 2006, SOFSEM.

[5]  Peter Haase,et al.  Eu-ist Integrated Project (ip) Ist-2003-506826 Sekt D3.1.1 Ontology Management and Evolution – Survey, Methods and Prototypes Sekt Consortium , 2004 .

[6]  Sophia Ananiadou,et al.  Identifying Terms by their Family and Friends , 2000, COLING.

[7]  Mark A. Musen,et al.  A Framework for Ontology Evolution in Collaborative Environments , 2006, SEMWEB.

[8]  Yannis Kalfoglou,et al.  Ontology mapping: the state of the art , 2003, The Knowledge Engineering Review.

[9]  Michel C. A. Klein,et al.  Change Management for Distributed Ontologies , 2004 .

[10]  Mohammed Bennamoun,et al.  Determining the Unithood of Word Sequences using Mutual Information and Independence Measure , 2008, ArXiv.

[11]  Paul M. B. Vitányi,et al.  Automatic Extraction of Meaning from the Web , 2006, 2006 IEEE International Symposium on Information Theory.

[12]  Mohammed Bennamoun,et al.  Featureless similarities for terms clustering using tree-traversing ants , 2006, PCAR '06.

[13]  York Sure-Vetter,et al.  Ontology Mapping - An Integrated Approach , 2004, ESWS.

[14]  Ljiljana Stojanovic,et al.  Methods and tools for ontology evolution , 2004 .

[15]  York Sure,et al.  D3.1.2 Incremental Ontology Evolution - Evaluation , 2006 .

[16]  Roberto Basili,et al.  A Contrastive Approach to Term Extraction , 2001 .

[17]  Marc Ehrig,et al.  State of the art on ontology alignment , 2013 .

[18]  Sophia Ananiadou,et al.  Term extraction using a similarity-based approach , 2001 .

[19]  Roberto Basili,et al.  Modelling the syntactic contextual information for term extraction , 2001 .

[20]  Mohammed Bennamoun,et al.  Tree-Traversing Ant Algorithm for term clustering based on featureless similarities , 2007, Data Mining and Knowledge Discovery.

[21]  G. Flouris,et al.  On belief change and ontology evolution , 2006 .

[22]  Olga De Troyer,et al.  Understanding ontology evolution: A change detection approach , 2007, J. Web Semant..

[23]  Elizabeth Chang,et al.  Semi-Automatic Ontology Extension Using Spreading Activation , 2005 .

[24]  Helena Sofia Pinto,et al.  Ontology Integration: How to perform the Process , 2001, OIS@IJCAI.

[25]  Steffen Staab,et al.  04391 Executive Summary -- Semantic Interoperability and Integration , 2005 .