Ontology Learning Using Word Net Lexical Expansion and Text Mining

In knowledge management systems, ontologies play an important role as a backbone for providing and accessing knowledge sources. They are largely used in the next generation of the Semantic Web that focuses on supporting a better cooperation between humans and ma‐ chines [2]. Since manual ontology construction is costly, time-consuming, error-prone, and inflexible to change, it is hoped that an automated ontology learning process will result in more effective and more efficient ontology construction and also be able to create ontologies that better match a specific application [20]. Ontology learning has recently become a major focus for research whose goal is to facilitate the construction of ontologies by decreasing the amount of effort required to produce an ontology for a new domain. However, most current approaches deal with narrowly-defined specific tasks or a single part of the ontology learn‐ ing process rather than providing complete support to users. There are few studies that at‐ tempt to automate the entire ontology learning process from the collection of domainspecific literature and filtering out documents irrelevant to the domain, to text mining to build new ontologies or enrich existing ones.

[1]  Qiang Wang,et al.  Ontology-Based Focused Crawling , 2009, 2009 International Conference on Information, Process, and Knowledge Management.

[2]  Jennifer L. Leopold,et al.  An Anatomical Ontology for Amphibians , 2006, Pacific Symposium on Biocomputing.

[3]  Yiming Yang,et al.  Expert network: effective and efficient learning from human decisions in text categorization and retrieval , 1994, SIGIR '94.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[6]  Paola Velardi,et al.  Using text processing techniques to automatically enrich a domain ontology , 2001, FOIS.

[7]  Anand Kumar,et al.  Text mining and ontologies in biomedicine: Making sense of raw text , 2005, Briefings Bioinform..

[8]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[9]  Mark Stevenson,et al.  Combining Disambiguation Techniques to Enrich an Ontology , 2002 .

[10]  Philipp Cimiano,et al.  Ontology Learning from Text: Methods, Evaluation and Applications , 2005 .

[11]  Ah-Hwee Tan,et al.  Text Mining: The state of the art and the challenges , 2000 .

[12]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[13]  Qiang Wang,et al.  An Ontology Learning Framework Using Focused Crawler and Text Mining , 2009 .

[14]  Hans-Peter Kriegel,et al.  Focused Web Crawling: A Generic Framework for Specifying the User Interest and for Adaptive Crawling Strategies , 2001 .

[15]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[16]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[17]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[18]  Yiming Yang,et al.  A scalability analysis of classifiers in text categorization , 2003, SIGIR.

[19]  Hiep Phuc Luong,et al.  Enriching concept descriptions in an amphibian ontology with vocabulary extracted from wordnet , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[20]  Euripides G. M. Petrakis,et al.  Semantic similarity methods in wordNet and their application to information retrieval on the web , 2005, WIDM '05.

[21]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[22]  Michael W. Berry,et al.  Survey of Text Mining , 2003, Springer New York.

[23]  Susan Gauch,et al.  Using Text Mining to Enrich the Vocabulary of Domain Ontologies , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[24]  Susan Gauch,et al.  KeyConcept : Un motor de búsqueda conceptual , 2006 .

[25]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[26]  Nuno Seco,et al.  Design, Implementation and Evaluation of a New Semantic Similarity Metric Combining Features and Intrinsic Information Content , 2008, OTM Conferences.

[27]  Mehrnoush Shamsfard,et al.  The state of the art in ontology learning: a framework for comparison , 2003, The Knowledge Engineering Review.

[28]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[29]  Martin Volk,et al.  Enriching an ontology with WordNet based on similarity measures , 2005 .

[30]  Aldo Gangemi,et al.  The OntoWordNet Project: Extension and Axiomatization of Conceptual Relations in WordNet , 2003, OTM.

[31]  Olatz Ansa,et al.  Enriching very large ontologies using the WWW , 2000, ECAI Workshop on Ontology Learning.

[32]  Susan T. Dumais,et al.  Inductive learning algorithms and representations for text categorization , 1998, CIKM '98.

[33]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[34]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[35]  Borys Omelayenko,et al.  Learning of Ontologies from the Web: the Analysis of Existent Approaches , 2001, WebDyn@ICDT.

[36]  David M. W. Powers,et al.  Measuring Semantic Similarity in the Taxonomy of WordNet , 2005, ACSC.

[37]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[38]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[39]  Steffen Staab,et al.  Bootstrapping an Ontology-Based Information Extraction System , 2003, Intelligent Exploration of the Web.

[40]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[41]  Andreas Hotho,et al.  A Brief Survey of Text Mining , 2005, LDV Forum.

[42]  Paul Buitelaar,et al.  Lexical Enrichment of a Human Anatomy Ontology using WordNet , 2007 .