Metric-based ontology learning

Ontology learning is an important task in Artificial Intelligence, Semantic Web and Text Mining. This paper presents a novel framework for, and solutions to, three practical problems in ontology learning. An incremental clustering approach is used to solve the problem of unknown group names. Learned models at each level of an ontology address the problem of no control over concept abstractness. A metric learning module moves beyond the limitation of traditional use of features and incorporates heterogeneous semantic evidence into the learning process. The metric-based learning framework integrates these separate components into a single, unified solution. An extensive evaluation with WordNet and Open Directory Project data demonstrates that the method is more effective than a state-of-the-art baseline algorithm.

[1]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[2]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[3]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[4]  Marta Sabou,et al.  Extracting ontologies from software documentation: a semi-automatic method and its evaluation , 2004 .

[5]  Eva Blomqvist Fully Automatic Construction of Enterprise Ontologies Using Design Patterns: Initial Method and First Experiences , 2005, OTM Conferences.

[6]  Latifur Khan,et al.  Automatic Ontology Derivation Using Clustering for Image Classification , 2002, Multimedia Information Systems.

[7]  Gilad Mishne,et al.  Learning domain ontologies for Web service descriptions: an experiment in bioinformatics , 2005, WWW '05.

[8]  W. T. Tutte Graph Theory , 1984 .

[9]  D. Crystal The Cambridge Encyclopedia of the English Language , 1998 .

[10]  Nacéra Bennacer,et al.  Ontology Discovery from Web Pages : Application to Tourism , 2004 .

[11]  Johanna Völker,et al.  A Framework for Ontology Learning and Data-driven Change Discovery , 2005 .

[12]  Grace Hui Yang,et al.  Ontology generation for large email collections , 2008, DG.O.

[13]  F. Colace,et al.  An automatic algorithm for building ontologies from data , 2004, Proceedings. 2004 International Conference on Information and Communication Technologies: From Theory to Applications, 2004..

[14]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[15]  Daniel Jurafsky,et al.  Semantic Taxonomy Induction from Heterogenous Evidence , 2006, ACL.

[16]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.