论文信息 - Towards Framework-Independent Evaluation of Deep Linguistic Parsers

Towards Framework-Independent Evaluation of Deep Linguistic Parsers

This paper describes practical issues in the framework-independent evaluation of deep and shallow parsers. We focus on the use of two dependencybased syntactic representation formats in parser evaluation, namely, Carroll et al. (1998)’s Grammatical Relations and de Marneffe et al. (2006)’s Stanford Dependency scheme. Our approach is to convert the output of parsers into these two formats, and measure the accuracy of the resulting converted output. Through the evaluation of an HPSG parser and Penn Treebank phrase structure parsers, we found that mapping between different representation schemes is a non-trivial task that results in lossy conversions that may obscure important differences between different parsing approaches. We discuss sources of disagreements in the representation of syntactic structures in the two dependency-based formats, indicating possible directions for improved framework-independent parser evaluation.

Yusuke Miyao | Kenji Sagae

[1] Jari Björne,et al. BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[2] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[3] Ted Briscoe,et al. An introduction to tag sequence grammars and the RASP system parser , 2006 .

[4] Jun'ichi Tsujii,et al. HPSG Parsing with Shallow Dependency Constraints , 2007, ACL.

[5] James R. Curran,et al. Formalism-Independent Parser Evaluation with CCG and DepBank , 2007, ACL.

[6] James R. Curran,et al. Parsing the WSJ Using CCG and Log-Linear Models , 2004, ACL.

[7] Ted Briscoe,et al. Evaluating the Accuracy of an Unlexicalized Statistical Parser on the PARC DepBank , 2006, ACL.

[8] Tapio Salakoski,et al. On the unification of syntactic annotations under the Stanford dependency scheme: A case study on BioInfer and GENIA , 2007, BioNLP@ACL.

[9] Ralph Grishman,et al. A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars , 1991, HLT.

[10] Jun'ichi Tsujii,et al. Probabilistic Disambiguation Models for Wide-Coverage HPSG Parsing , 2005, ACL.

[11] Mark Steedman,et al. Acquiring Compact Lexicalized Grammars from a Cleaner Treebank , 2002, LREC.