Experiments for the Extraction of Qualia from Web in Italian: a Pattern-Based Approach
暂无分享,去创建一个
This paper address the problem of automatically extracting qualia structures from the web via patterns in Italian. Qualia are treated as semantic relations, expressed by word patterns, between two entities. 400 examples of italian patterns expressing qualia are sampled and manually annotated in order to study the behaviour of the patterns and their suitability for qualia extraction. Features in the annotation schema include the position of the related entites with respect to the patterns, their informativeness and the minimum context window necessary to catch them. The results of this study are useful for designing experiments with wordspace models.
[1] James Pustejovsky,et al. The Generative Lexicon , 1995, CL.
[2] Philipp Cimiano,et al. Automatically Learning Qualia Structures from the Web , 2005, ACL 2005.
[3] Marco Baroni,et al. Building general- and special-purpose corpora by Web crawling , 2006 .
[4] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.