Text Mining Techniques to Automatically Enrich a Domain Ontology

Though the utility of domain ontologies is now widely acknowledged in the IT (Information Technology) community, several barriers must be overcome before ontologies become practical and useful tools. A critical issue is the ontology construction, i.e., the task of identifying, defining, and entering the concept definitions. In case of large and complex application domains this task can be lengthy, costly, and controversial (since different persons may have different points of view about the same concept). To reduce time, cost (and, sometimes, harsh discussions) it is highly advisable to refer, in constructing or updating an ontology, to the documents available in the field. Text mining tools may be of great help in this task. The work presented in this paper illustrates the guidelines of SymOntos, ontology management system, and the text mining approach adopted herein to support ontology building. The latter operates by extracting, from the related literature, the prominent domain concepts and the semantic relations among them.

[1]  Paola Velardi,et al.  Semantic tagging of unknown proper nouns , 1999, Nat. Lang. Eng..

[2]  Yorick Wilks,et al.  Book Reviews: Electric Words: Dictionaries, Computers, and Meanings , 1996, CL.

[3]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[4]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[5]  Stuart C. Shapiro Review of Knowledge representation: logical, philosophical, and computational foundations by John F. Sowa. Brooks/Cole 2000. , 2001 .

[6]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[8]  Roberto Basili,et al.  An Empirical Symbolic Approach to Natural Language Processing , 1996, Artif. Intell..

[9]  Robert H. Gregory Document processing , 1955, AIEE-IRE '55 (Eastern).

[10]  Olatz Ansa,et al.  Enriching very large ontologies using the WWW , 2000, ECAI Workshop on Ontology Learning.

[11]  Setrag Khoshafian Object orientation , 1990 .

[12]  Andreas Wagner,et al.  Enriching a lexical semantic net with selectional preferences by means of statistical corpus analysis , 2000, ECAI Workshop on Ontology Learning.

[13]  강승식,et al.  [서평]「Electric Words : Dictionaries, Computers and Meanings」 , 1997 .

[14]  Steffen Staab,et al.  Learning Ontologies for the Semantic Web , 2001 .

[15]  Ronald J. Brachman,et al.  ON THE EPISTEMOLOGICAL STATUS OF SEMANTIC NETWORKS , 1979 .

[16]  Roberto Basili,et al.  Identification of Relevant Terms to Support the Construction of Domain Ontologies , 2001, HTLKM@ACL.

[17]  Béatrice Daille,et al.  Study and Implementation of Combined Techniques for Automatic Extraction of Terminology , 1994 .

[18]  Roberto Basili,et al.  Customizable Modular Lexicalized Parsing , 2000, IWPT.

[19]  Setrag Khoshafian,et al.  Object orientation: concepts, languages, databases, user interfaces , 1990 .

[20]  Piek Vossen,et al.  Extending, trimming and fusing WordNet for technical documents , 2001 .

[21]  Emmanuel Morin Projecting Corpus-Based Semantic Links on a Thesaurus , 1999, ACL.

[22]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.

[23]  Paola Velardi,et al.  Automatic adaptation of proper noun dictionaries through cooperation of machine learning and probabilistic methods , 2000, SIGIR '00.

[24]  Roberto Basili,et al.  Inducing Terminology for Lexical Acquisition , 1997, EMNLP.