We present a method based on clustering techniques to detect possible/probable novel concepts or concept drift in a Description Logics knowledge base. The method exploits a semi-distance measure defined for individuals, that is based on a finite number of dimensions corresponding to a committee of discriminating features (concept descriptions). A maximally discriminating group of features is obtained with a randomized optimization method. In the algorithm, the possible clusterings are represented as medoids (w.r.t. the given metric) of variable length. The number of clusters is not required as a parameter, the method is able to find an optimal choice by means of evolutionary operators and a proper fitness function. An experimentation proves the feasibility of our method and its effectiveness in terms of clustering validity indices. With a supervised learning phase, each cluster can be assigned with a refined or newly constructed intensional definition expressed in the adopted language.
[1]
Martin Hepp,et al.
Products and Services Ontologies: A Methodology for Deriving OWL Ontologies from Industrial Categorization Standards
,
2006,
Int. J. Semantic Web Inf. Syst..
[2]
Ambjörn Naeve,et al.
The Human Semantic Web Shifting from Knowledge Push to Knowledge Pull
,
2005,
Int. J. Semantic Web Inf. Syst..
[3]
A. Sheth.
International Journal on Semantic Web & Information Systems
,
.
[4]
Dimitrios Skoutas,et al.
Ontology-Based Conceptual Design of ETL Processes for Both Structured and Semi-Structured Data
,
2007,
Int. J. Semantic Web Inf. Syst..