OntoPlus: Text-driven ontology extension using ontology content, structure and co-occurrence information

This paper addresses the process of semi-automatic text-driven ontology extension using ontology content, structure and co-occurrence information. A novel OntoPlus methodology is proposed for semi-automatic ontology extension based on text mining methods. It allows for the effective extension of the large ontologies, providing a ranked list of potentially relevant concepts and relationships given a new concept (e.g., glossary term) to be inserted in the ontology. A number of experiments are conducted, evaluating measures for ranking correspondence between existing ontology concepts and new domain concepts suggested for the ontology extension. Measures for ranking are based on incorporating ontology content, structure and co-occurrence information. The experiments are performed using a well known Cyc ontology and textual material from two domains – finances and, fisheries & aquaculture. Our experiments show that the best results are achieved by combining content, structure and co-occurrence information. Furthermore, ontology content and structure seem to be more important than co-occurrence for our data in the financial domain. At the same time, ontology content and co-occurrence seem to have higher importance for our fisheries & aquaculture domain.

[1]  David Sánchez,et al.  Ontology-driven web-based semantic similarity , 2010, Journal of Intelligent Information Systems.

[2]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[3]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[4]  David Baxter,et al.  On the Application of the Cyc Ontology to Word Sense Disambiguation , 2006, FLAIRS.

[5]  James A. Hendler,et al.  Dynamic Ontologies on the Web , 2000, AAAI/IAAI.

[6]  T. Enyakong EXTENDING ONTOLOGY TREE USING NLP TECHNIQUE , 2001 .

[7]  Peter Spyns,et al.  Unsupervised Text Mining for the Learning of DOGMA-inspired Ontologies , 2005 .

[8]  R. Studer,et al.  Semantic Web Technologies: Trends and Research in Ontology-based Systems , 2006 .

[9]  Dunja Mladenic,et al.  Knowledge Discovery for Ontology Construction , 2006 .

[10]  Steffen Staab,et al.  Learning Taxonomic Relations from Heterogeneous Evidence , 2004 .

[11]  Olena Medelyan,et al.  Integrating Cyc and Wikipedia: Folksonomy meets rigorously defined common-sense , 2008, AAAI 2008.

[12]  Dunja Mladenic,et al.  OntoGen: Semi-automatic Ontology Editor , 2007, HCI.

[13]  Johanna Völker,et al.  A Framework for Ontology Learning and Data-driven Change Discovery , 2005 .

[14]  Steffen Staab,et al.  Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis , 2005, J. Artif. Intell. Res..

[15]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[16]  Carole D. Hafner,et al.  The State of the Art in Ontology Design: A Survey and Comparative Review , 1997, AI Mag..

[17]  Jin Liu,et al.  Semi-Automatic Ontology Engineering in Business Applications , 2008, GI Jahrestagung.

[18]  Michael J. Witbrock,et al.  Automated Population of Cyc: Extracting Information about Named-entities from the Web , 2006, FLAIRS.

[19]  Steffen Staab,et al.  Discovering Conceptual Relations from Text , 2000, ECAI.

[20]  Udo Hahn,et al.  Towards Text Knowledge Engineering , 1998, AAAI/IAAI.

[21]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[22]  David Baxter,et al.  On the Effective Use of Cyc in a Question Answering System , 2005 .

[23]  Olatz Ansa,et al.  Enriching very large ontologies using the WWW , 2000, ECAI Workshop on Ontology Learning.

[24]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[25]  Peter Wagner,et al.  An Interactive Dialogue System for Knowledge Acquisition in Cyc , 2003, IJCAI 2003.

[26]  Christian Wolff,et al.  Learning Relations Using Collocations , 2001, Workshop on Ontology Learning.

[27]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[28]  Dunja Mladenic,et al.  Entity Resolution in Texts Using Statistical Learning and Ontologies , 2009, ASWC.

[29]  John Kinsella,et al.  Ontology-Driven Hypothesis Generation to Explain Anomalous Patient Responses to Treatment , 2009, SGAI Conf..

[30]  Thomas R. Gruber,et al.  A Translation Approach to Portable Ontologies , 1993 .

[31]  Balakrishnan Chandrasekaran,et al.  What are ontologies, and why do we need them? , 1999, IEEE Intell. Syst..

[32]  Steffen Staab,et al.  Strategies for the Evaluation of Ontology Learning , 2008, Ontology Learning and Population.

[33]  Olena Medelyan,et al.  "All You Can Eat" Ontology-Building: Feeding Wikipedia to Cyc , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[34]  Rubén Prieto Díaz A Faceted Approach to Building Ontologies. , 2003 .

[35]  Wim Peters,et al.  SPRAT : a tool for automatic semantic pattern-based ontology population , 2009 .

[36]  Philipp Cimiano,et al.  Ontology Learning from Text: Methods, Evaluation and Applications , 2005 .