Conceptual Clustering Applied to Ontologies

A clustering method is presented which can be applied to semantically annotated resources in the context of ontological knowledge bases. This method can be used to discover emerging groupings of resources expressed in the standard ontology languages. The method exploits a language-independent semi-distance measure over the space of resources, that is based on their semantics w.r.t. a number of dimensions corresponding to a committee of discriminating features represented by concept descriptions. A maximally discriminating group of features can be constructed through a feature construction method based on genetic programming. The evolutionary clustering algorithm proposed is based on the notion of medoids applied to relational representations. It is able to induce a set of clusters by means of a fitness function based on a discernibility criterion. An experimentation with some ontologies proves the feasibility of our method.

[1]  Diego Calvanese,et al.  The Description Logic Handbook , 2007 .

[2]  C.-Y. Lee,et al.  Variable Length Genomes for Evolutionary Algorithms , 2000, GECCO.

[3]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[4]  Olfa Nasraoui,et al.  One Step Evolutionary Mining of Context Sensitive Associations and Web Navigation Patterns , 2002, SDM.

[5]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[6]  Shusaku Tsumoto,et al.  An indiscernibility-based clustering method , 2005, 2005 IEEE International Conference on Granular Computing.

[7]  Katharina Morik,et al.  A Polynomial Approach to the Constructive Induction of Structural Knowledge , 2004, Machine Learning.

[8]  Roberto Basili,et al.  AI*IA 2007: Artificial Intelligence and Human-Oriented Computing, 10th Congress of the Italian Association for Artificial Intelligence, Rome, Italy, September 10-13, 2007, Proceedings , 2007, AI*IA.

[9]  D. Fogel,et al.  Discovering patterns in spatial data using evolutionary programming , 1996 .

[10]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[11]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[12]  Nicola Fanizzi,et al.  A Hierarchical Clustering Procedure for Semantically Annotated Resources , 2007, AI*IA.

[13]  Ryszard S. Michalski,et al.  Conceptual Clustering of Structured Objects: A Goal-Oriented Approach , 1986, Artif. Intell..

[14]  Shan-Hwei Nienhuys-Cheng Distances and Limits on Herbrand Interpretations , 1998, ILP.

[15]  Dino Pedreschi,et al.  Machine Learning: ECML 2004 , 2004, Lecture Notes in Computer Science.

[16]  Pavel Zezula,et al.  Similarity Search - The Metric Space Approach , 2005, Advances in Database Systems.

[17]  Pavel Zezula,et al.  Similarity Search: The Metric Space Approach (Advances in Database Systems) , 2005 .

[18]  James C. Bezdek,et al.  Clustering with a genetically optimized approach , 1999, IEEE Trans. Evol. Comput..

[19]  Luigi Iannone,et al.  Concept Formation in Expressive Description Logics , 2004, ECML.

[20]  K. N. King 2006 IEEE International Conference on Granular Computing , 2006, IEEE Comput. Intell. Mag..

[21]  Alexander Borgida,et al.  Towards Measuring Similarity in Description Logics , 2005, Description Logics.

[22]  Michèle Sebag,et al.  Distance Induction in First Order Logic , 1997, ILP.

[23]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[24]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[25]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[26]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[27]  Nicola Fanizzi,et al.  Reasoning by Analogy in Description Logics Through Instance-based Learning , 2006, SWAP.

[28]  Boi Faltings,et al.  OSS: A Semantic Similarity Function based on Hierarchical Ontologies , 2007, IJCAI.

[29]  Jens Lehmann,et al.  A Refinement Operator Based Learning Algorithm for the ALC Description Logic , 2007, ILP.

[30]  Luigi Iannone,et al.  An algorithm based on counterfactuals for concept learning in the Semantic Web , 2005, Applied Intelligence.

[31]  Nicola Fanizzi,et al.  Induction of Optimal Semi-distances for Individuals based on Feature Sets , 2007, Description Logics.