Ontology Learning from Text: A Survey of Methods

After the vision of the Semantic Web was broadcasted at the turn of the millennium, ontology became a synonym for the solution to many problems concerning the fact that computers do not understand human language: if there were an ontology and every document were marked up with it and we had agents that would understand the markup, then computers would finally be able to process our queries in a really sophisticated way. Some years later, the success of Google shows us that the vision has not come true, being hampered by the incredible amount of extra work required for the intellectual encoding of semantic mark-up – as compared to simply uploading an HTML page. To alleviate this acquisition bottleneck, the field of ontology learning has since emerged as an important sub-field of ontology engineering. It is widely accepted that ontologies can facilitate text understanding and automatic processing of textual resources. Moving from words to concepts not only mitigates data sparseness issues, but also promises appealing solutions to polysemy and homonymy by finding non-ambiguous concepts that may map to various realizations in – possibly ambiguous – words. Numerous applications using lexical-semantic databases like WordNet (Miller, 1990) and its non-English counterparts, e.g. EuroWordNet (Vossen, 1997) or CoreNet (Choi and Bae, 2004) demonstrate the utility of semantic resources for natural language processing. Learning semantic resources from text instead of manually creating them might be dangerous in terms of correctness, but has undeniable advantages: Creating resources for text processing from the texts to be processed will fit the semantic component neatly and directly to them, which will never be possible with general-purpose resources. Further, the cost per entry is greatly reduced, giving rise to much larger resources than an advocate of a manual approach could ever afford. On the other hand, none of the methods used today are good enough for creating semantic resources of any kind in a completely unsupervised fashion, albeit automatic methods can facilitate manual construction to a large extent. The term ontology is understood in a variety of ways and has been used in philosophy for many centuries. In contrast, the notion of ontology in the field of computer science is younger – but almost used as inconsistently, when it comes to the details of the definition. The intention of this essay is to give an overview of different methods that learn ontologies or ontology-like structures from unstructured text. Ontology learning from other sources, issues in description languages, ontology editors, ontology merging and ontology evolving transcend the scope of this article. Surveys on ontology learning from text and other sources can be found in Ding and Foo (2002) and Gomez-Perez

[1]  Gerhard Paass,et al.  Learning Prototype Ontologies by Hierachical Latent Semantic Analysis , 2004, LWA.

[2]  F. Dornseiff,et al.  Der deutsche Wortschatz nach Sachgruppen , 2020 .

[3]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[4]  Carlo Strapparava,et al.  Domain Kernels for Word Sense Disambiguation , 2005, ACL.

[5]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[6]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[7]  Adam Kilgarriff,et al.  SENSEVAL: an exercise in evaluating world sense disambiguation programs , 1998, LREC.

[8]  David Sánchez,et al.  Web-scale taxonomy learning , 2005 .

[9]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[10]  George W. Davidson,et al.  Roget's Thesaurus of English Words and Phrases , 1982 .

[11]  Christian Biemann,et al.  Semiautomatic Extension of CoreNet using a Bootstrapping Mechanism on Corpus-based Co-occurrences , 2004, COLING.

[12]  Adam Pease,et al.  IEEE standard upper ontology: a progress report , 2002, The Knowledge Engineering Review.

[13]  Marius Pasca,et al.  Finding Instance Names and Alternative Glosses on the Web: WordNet Reloaded , 2005, CICLing.

[14]  Hans Friedrich Witschel,et al.  Using Decision Trees and Text Mining Techniques for Extending Taxonomies , 2005 .

[15]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[16]  Suresh Manandhar,et al.  Extending a Lexical Ontology by a Combination of Distributional Semantics Signatures , 2002, EKAW.

[17]  Burghard B. Rieger Feasible Fuzzy Semantics On Some Problems of How to Handle Word Meaning Empirically , 1981 .

[18]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[19]  Gilles Bisson,et al.  Designing Clustering Methods for Ontology Building - The Mo'K Workbench , 2000, ECAI Workshop on Ontology Learning.

[20]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[21]  Andreas Wagner,et al.  Enriching a lexical semantic net with selectional preferences by means of statistical corpus analysis , 2000, ECAI Workshop on Ontology Learning.

[22]  Piek Vossen,et al.  EuroWordNet: a multilingual database for information retrieval , 1997 .

[23]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[24]  Steffen Staab,et al.  Learning by googling , 2004, SKDD.

[25]  W. Quine Ontological Relativity and Other Essays , 1969 .

[26]  Walter Pagel,et al.  Historisches Wörterbuch der Philosophie , 1981, Medical History.

[27]  Hermann Helbig Die semantische Struktur natürlicher Sprache , 2001 .

[28]  B. Hammond Ontology , 2004, Lawrence Booth’s Book of Visions.

[29]  Nancy Chinchor,et al.  Overview of MUC-7/MET-2 , 1998 .

[30]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[31]  Sofia Stamou,et al.  Retrieval Efficiency of Normalized Query Expansion , 2005, CICLing.

[32]  Yorick Wilks,et al.  The ontology: Chimaera or Pegasus , 2005 .

[33]  Steffen Staab,et al.  Comparing Conceptual, Divise and Agglomerative Clustering for Learning Taxonomies from Text , 2004, ECAI.

[34]  Dominic Widdows,et al.  Unsupervised methods for developing taxonomies by combining syntactic and statistical information , 2003, NAACL.

[35]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993 .

[36]  L. Floridi Blackwell Guide to the Philosophy of Computing and Information , 2003 .

[37]  Zellig S. Harris,et al.  Mathematical structures of language , 1968, Interscience tracts in pure and applied mathematics.

[38]  Sven Hartrumpf,et al.  University of Hagen at CLEF 2004: Indexing and Translating Concepts for the GIRT Task , 2004, CLEF.

[39]  Steffen Staab,et al.  Ontology Learning , 2004, Encyclopedia of Machine Learning and Data Mining.

[40]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[41]  Christian Biemann,et al.  Automatic Acquisition of Paradigmatic Relations Using Iterated Co-occurrences , 2004, LREC.

[42]  Frank Keller,et al.  Using the Web to Overcome Data Sparseness , 2002, EMNLP.

[43]  David Yarowsky,et al.  A method for disambiguating word senses in a large corpus , 1992, Comput. Humanit..

[44]  Mark A. Bedau,et al.  Blackwell Guide to the Philosophy of Computing and Information , 2003 .

[45]  Yorick Wilks,et al.  Data Driven Ontology Evaluation , 2004, LREC.

[46]  Eduard H. Hovy,et al.  Fine Grained Classification of Named Entities , 2002, COLING.

[47]  Brian Roark,et al.  Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction , 2000, COLING.

[48]  Steffen Staab,et al.  Learning Concept Hierarchies from Text with a Guided Agglomerative Clustering Algorithm , 2005, ICML 2005.

[49]  Eduard Hovy,et al.  Towards terascale knowledge acquisition , 2004, COLING 2004.

[50]  Veronique Hoste,et al.  Optimization issues in machine learning of coreference resolution , 2005 .

[51]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[52]  Band , 1943 .

[53]  李幼升,et al.  Ph , 1989 .

[54]  Gerda Ruge,et al.  Experiments on Linguistically-Based Term Associations , 1992, Inf. Process. Manag..

[55]  Kalina Bontcheva,et al.  Evolving GATE to meet new challenges in language engineering , 2004, Natural Language Engineering.

[56]  Ellen Riloff,et al.  A Corpus-Based Approach for Building Semantic Lexicons , 1997, EMNLP.

[57]  Umberto Eco,et al.  A theory of semiotics , 1976, Advances in semiotics.

[58]  Ellen Riloff,et al.  Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[59]  Ellen Riloff,et al.  A Bootstrapping Method for Learning Semantic Lexicons using Extraction Pattern Contexts , 2002, EMNLP.

[60]  Olatz Ansa,et al.  Enriching very large ontologies using the WWW , 2000, ECAI Workshop on Ontology Learning.

[61]  Reinhard Rapp,et al.  The Computation of Word Associations: Comparing Syntagmatic and Paradigmatic Approaches , 2002, COLING.

[62]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[63]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[64]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[65]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[66]  Schubert Foo,et al.  Ontology research and development. Part 1 - a review of ontology generation , 2002, J. Inf. Sci..

[67]  Steffen Staab,et al.  GETESS - Searching the Web Exploiting German Texts , 1999, CIA.

[68]  Borys Omelayenko,et al.  Learning of Ontologies from the Web: the Analysis of Existent Approaches , 2001, WebDyn@ICDT.

[69]  Rainer Osswald,et al.  Automatische Erweiterung eines seman-tikbasierten Lexikons durch Bootstrapping auf gro?en Korpora , 2005 .

[70]  Ido Dagan,et al.  Similarity-Based Estimation of Word Cooccurrence Probabilities , 1994, ACL.

[71]  Hee-Sook Bae,et al.  Procedures and Problems in Korean-Chinese-Japanese Wordnet with Shared Semantic Hierarchy , 2004 .

[72]  F. D. Saussure Cours de linguistique générale , 1924 .

[73]  David Faure,et al.  ASIUM: Learning subcategorization frames and restrictions of se-18 lection , 1998 .