论文信息 - ParaEval: Using Paraphrases to Evaluate Summaries Automatically

ParaEval: Using Paraphrases to Evaluate Summaries Automatically

ParaEval is an automated evaluation method for comparing reference and peer summaries. It facilitates a tiered-comparison strategy where recall-oriented global optimal and local greedy searches for paraphrase matching are enabled in the top tiers. We utilize a domain-independent paraphrase table extracted from a large bilingual parallel corpus using methods from Machine Translation (MT). We show that the quality of ParaEval's evaluations, measured by correlating with human judgments, closely resembles that of ROUGE's.

[1] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[2] George A. Miller,et al. Introduction to WordNet: An On-line Lexical Database , 1990 .

[3] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[4] Regina Barzilay,et al. Extracting Paraphrases from a Parallel Corpus , 2001, ACL.

[5] Philip Resnik,et al. An Unsupervised Method for Word Sense Tagging using Parallel Corpora , 2002, ACL.

[6] Daniel Marcu,et al. Natural Language Based Reformulation Resource and Wide Exploitation for Question Answering , 2002, TREC.

[7] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8] Simone Teufel,et al. Examining the consensus between human summaries: initial experiments with factoid analysis , 2003, HLT-NAACL 2003.

[9] Daniel Marcu,et al. Syntax-based Alignment of Multiple Translations: Extracting Paraphrases and Generating New Sentences , 2003, NAACL.

[10] Eduard H. Hovy,et al. Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[11] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[12] EstimationPeter,et al. The Mathematics of Machine Translation : Parameter , 2004 .

[13] Ani Nenkova,et al. Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[14] Hermann Ney,et al. The Alignment Template Approach to Statistical Machine Translation , 2004, CL.

[15] Eduard Hovy,et al. Evaluating DUC 2005 using Basic Elements , 2005 .

[16] Chris Callison-Burch,et al. Paraphrasing with Bilingual Parallel Corpora , 2005, ACL.