论文信息 - Feature selection for automatic taxonomy induction

Feature selection for automatic taxonomy induction

Most existing automatic taxonomy induction systems exploit one or more features to induce a taxonomy; nevertheless there is no systematic study examining which are the best features for the task under various conditions. This paper studies the impact of using different features on taxonomy induction for different types of relations and for terms at different abstraction levels. The evaluation shows that different conditions need different technologies or different combination of the technologies. In particular, co-occurrence and lexico-syntactic patterns are good features for is-a, sibling and part-of relations; contextual, co-occurrence, patterns, and syntactic features work well for concrete terms; co-occurrence works well for abstract terms.

Grace Hui Yang | James P. Callan

[1] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2] Philipp Cimiano,et al. Automatic Acquisition of Ranked Qualia Structures from the Web , 2007, ACL.

[3] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[4] Dan I. Moldovan,et al. Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations , 2003, NAACL.

[5] J. Katz,et al. The philosophy of linguistics , 1989 .

[6] Zellig S. Harris,et al. Distributional Structure , 1954 .

[7] Patrick Pantel,et al. Discovering word senses from text , 2002, KDD.

[8] Grace Hui Yang,et al. A Metric-based Framework for Automatic Taxonomy Induction , 2009, ACL.