论文信息 - Entailment: An Effective Metric for Comparing and Evaluating Hierarchical and Non-hierarchical Annotation Schemes

Entailment: An Effective Metric for Comparing and Evaluating Hierarchical and Non-hierarchical Annotation Schemes

Hierarchical or nested annotation of linguistic data often co-exists with simpler non-hierarchical or flat counterparts, a classic example being that of annotations used for parsing and chunking. In this work, we propose a general strategy for comparing across these two schemes of annotation using the concept of entailment that formalizes a correspondence between them. We use crowdsourcing to obtain query and sentence chunking and show that entailment can not only be used as an effective evaluation metric to assess the quality of annotations, but it can also be employed to filter out noisy annotations.

Monojit Choudhury | Kalika Bali | Rohan Ramanath

[1] Rishiraj Saha Roy,et al. Crowd Prefers the Middle Path: A New IAA Metric for Crowdsourcing Reveals Turker Biases in Query Segmentation , 2013, ACL.

[2] Rishiraj Saha Roy,et al. An IR-based evaluation framework for web search query segmentation , 2012, SIGIR '12.

[3] Jianfeng Gao,et al. Exploring web scale language models for search query processing , 2010, WWW '10.

[4] Akshar Bharati,et al. Natural language processing : a Paninian perspective , 1996 .

[5] Qin Iris Wang,et al. Learning Noun Phrase Query Segmentation , 2007, EMNLP.

[6] Steven Abney,et al. Parsing By Chunks , 1991 .

[7] Steven Abney,et al. Chunks and Dependencies: Bringing Processing Evidence to Bear on Syntax , 2002 .

[8] Steven P. Abney,et al. Prosodic Structure, Performance Structure and Phrase Structure , 1992, HLT.

[9] James R. Curran,et al. Parsing Noun Phrases in the Penn Treebank , 2011, Computational Linguistics.

[10] Ron Artstein,et al. Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[11] Klaus Krippendorff,et al. Content Analysis: An Introduction to Its Methodology , 1980 .

[12] Matthias Hagen,et al. Query segmentation revisited , 2011, WWW.

[13] Monojit Choudhury,et al. Correlates between Performance, Prosodic and Phrase Structures in Bangla and Hindi: Insights from a Psycholinguistic Experiment , 2009 .

[14] Thorsten Brants,et al. Inter-annotator Agreement for a German Newspaper Corpus , 2000, LREC.