Taxonomy Learning using Term Specificity and Similarity

Learning taxonomy for technical terms is difficult and tedious task, especially when new terms should be included. The goal of this paper is to assign taxonomic relations among technical terms. We propose new approach to the problem that relies on term specificity and similarity measures. Term specificity and similarity are necessary conditions for taxonomy learning, because highly specific terms tend to locate in deep levels and semantically similar terms are close to each other in taxonomy. We analyzed various features used in previous researches in view of term specificity and similarity, and applied optimal features for term specificity and similarity to our method.

[1]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[2]  Hitoshi Isahara,et al.  Extraction of Hierarchies Based on Inclusion of Co-occurring Words with Frequency Information , 2005, IJCAI.

[3]  Kellyn Kruger,et al.  Fully Automatic Acquisition of Taxonomic Knowledge from Large Corpora of Texts: Limited Syntax Knowledge Representation System Based on Natural Language , 2000, ISMIS.

[4]  Steffen Staab,et al.  Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis , 2005, J. Artif. Intell. Res..

[5]  Farid Cerbah,et al.  Exogeneous and Endogeneous Approaches to Semantic Categorization of Unknown Technical Terms , 2000, COLING.

[6]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[7]  Akiko Aizawa,et al.  An information-theoretic perspective of tf-idf measures , 2003, Inf. Process. Manag..

[8]  Yiyu Yao,et al.  An Information-Theoretic Measure of Term Specificity , 1992, J. Am. Soc. Inf. Sci..

[9]  Martin Chodorow,et al.  Combining local context and wordnet similarity for word sense identification , 1998 .

[10]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[11]  Eugene Charniak,et al.  Determining the specificity of nouns from text , 1999, EMNLP.

[12]  Key-Sun Choi,et al.  An Information-Theoretic Approach to Taxonomy Extraction for Ontology Learning , 2005 .

[13]  Olivier Bodenreider,et al.  Aspects of the taxonomic relation in the biomedical domain , 2001, FOIS.

[14]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.