论文信息 - MOTIF-RE: Motif-Based Hypernym/Hyponym Relation Extraction from Wikipedia Links

MOTIF-RE: Motif-Based Hypernym/Hyponym Relation Extraction from Wikipedia Links

Hypernym/hyponym relation extraction plays an essential role in taxonomy learning. The conventional methods based on lexico-syntactic patterns or machine learning usually make use of content-related features. In this paper, we find that the proportions of hyperlinks with different semantic type vary markedly in different network motifs. Based on this observation, we propose MOTIF-RE, an algorithm of extracting hypernym/hyponym relation from Wikipedia hyperlinks. The extraction process consists of three steps: 1) Build a directed graph from a set of domain-specific Wikipedia articles. 2) Count the occurrences of hyperlinks in every three-node network motif and create a feature vector for every hyperlink. 3) Train a classifier to identify semantic relation of hyperlinks. We created three domain-specific Wikipedia article sets to test MOTIF-RE. Experiments on individual dataset show that MOTIF-RE outperforms the baseline algorithm by about 30% in terms of F1-measure. Cross-domain experimental results show similar, which proves that MOTIF-RE has fairly good domain adaptation ability.

[1] S. Shen-Orr,et al. Network motifs: simple building blocks of complex networks. , 2002, Science.

[2] Paola Velardi,et al. Learning Word-Class Lattices for Definition and Hypernym Extraction , 2010, ACL.

[3] Simone Paolo Ponzetto,et al. Deriving a Large-Scale Taxonomy from Wikipedia , 2007, AAAI.

[4] W. Bruce Croft,et al. Deriving concept hierarchies from text , 1999, SIGIR '99.

[5] Qiang Yang,et al. Boosting for transfer learning , 2007, ICML '07.

[6] Haixun Wang,et al. Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[7] Andrew McCallum,et al. Piecewise pseudolikelihood for efficient training of conditional random fields , 2007, ICML '07.

[8] Daniel Jurafsky,et al. Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[9] Ralph Grishman,et al. Discovering Relations among Named Entities from Large Corpora , 2004, ACL.

[10] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.