The Effects of Human Variation in DUC Summarization Evaluation
暂无分享,去创建一个
[1] Daniel Marcu,et al. Sentence Level Discourse Parsing using Syntactic and Lexical Information , 2003, NAACL.
[2] Ellen M. Voorhees,et al. Variations in relevance judgments and the measurement of retrieval effectiveness , 1998, SIGIR '98.
[3] Simone Teufel,et al. Examining the consensus between human summaries: initial experiments with factoid analysis , 2003, HLT-NAACL 2003.
[4] Ani Nenkova,et al. Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.
[5] Kathleen R. McKeown,et al. Summarization Evaluation Methods: Experiments and Analysis , 1998 .
[6] Eduard Hovy,et al. Manual and automatic evaluation of summaries , 2002, ACL 2002.