Predicting Category Additions in a Topic Hierarchy

This paper discusses the problem of predicting the structural changes in an ontology. It addresses ontologies that contain instances in addition to concepts. The focus is on an ontology where the instances are textual documents, but the approach presented in this document is general enough to also work with other kinds of instances, as long as a similarity measure can be defined over them. We examine the changes in the Open Directory Project ontology of Web pages over a period of several years and analyze the most common types of structural changes that took place during that time. We then present an approach for predicting one of the more common types of structural changes, namely the addition of a new concept that becomes the subconcept of an existing parent concept and adopts a few instances of this existing parent concept. We describe how this task can be formulated as a machine-learning problem and present an experimental evaluation of this approach that shows promising results of the proposed approach.

[1]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[2]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[3]  David D. Lewis,et al.  Representation and Learning in Information Retrieval , 1991 .

[4]  Johanna Völker,et al.  A Framework for Ontology Learning and Data-driven Change Discovery , 2005 .

[5]  Frank van Harmelen,et al.  A Framework for Handling Inconsistency in Changing Ontologies , 2005, SEMWEB.

[6]  Dunja Mladenic,et al.  Simple classification into large topic ontology of web documents , 2005, 27th International Conference on Information Technology Interfaces, 2005..

[7]  Ljiljana Stojanovic,et al.  Methods and tools for ontology evolution , 2004 .

[8]  Boris Motik,et al.  Ontologies for Enterprise Knowledge Management , 2003, IEEE Intell. Syst..

[9]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[10]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[11]  Ting Su,et al.  In search of deterministic methods for initializing K-means and Gaussian mixture clustering , 2007, Intell. Data Anal..

[12]  Peter Haase,et al.  Management of dynamic knowledge , 2005, J. Knowl. Manag..

[13]  Grigoris Antoniou,et al.  A Classification of Ontology Change , 2006, SWAP.

[14]  Ljiljana Stojanovic,et al.  Consistent Evolution of OWL Ontologies , 2005, ESWC.

[15]  David D. Lewis,et al.  Learning in Intelligent Information Retrieval , 1991, ML.