论文信息 - Evaluation Metrics for Generation

Evaluation Metrics for Generation

Certain generation applications may profit from the use of stochastic methods. In developing stochastic methods, it is crucial to be able to quickly assess the relative merits of different approaches or models. In this paper, we present several types of intrinsic (system internal) metrics which we have used for baseline quantitative assessment. This quantitative assessment should then be augmented to a fuller evaluation that examines qualitative aspects. To this end, we describe an experiment that tests correlation between the quantitative metrics and human qualitative judgment. The experiment confirms that intrinsic metrics cannot replace human evaluation, but some correlate significantly with human judgments of quality and understandability and can be used for evaluation during development.

[1] Srinivas Bangalore,et al. Exploiting a Probabilistic Hierarchical Model for Generation , 2000, COLING.

[2] Srinivas Bangalore,et al. Supertagging: An Approach to Almost Parsing , 1999, CL.

[3] Kevin Knight,et al. The Practical Value of N-Grams Is in Generation , 1998, INLG.

[4] Karen Kukich,et al. Knowledge-based report generation : a knowledge engineering approach to natural language report generation , 1983 .

[5] James C. Lester,et al. Developing and Empirically Evaluating Robust Explanation Generators: The KNIGHT Experiments , 1997, Comput. Linguistics.

[6] Igor Mel’čuk,et al. Dependency Syntax: Theory and Practice , 1987 .

[7] Marilyn A. Walker,et al. PARADISE: A Framework for Evaluating Spoken Dialogue Agents , 1997, ACL.

[8] H. Alshawi,et al. Automatic Acquisition of Hierarchical Transduction Models for Machine Translation , 2022, COLING.

[9] Kevin Knight,et al. Generation that Exploits Corpus-Based Statistical Knowledge , 1998, ACL.

[10] Irene Langkilde-Geary,et al. Forest-Based Statistical Sentence Generation , 2000, ANLP.

[11] Chris Mellish,et al. Evaluation in the context of natural language generation , 1998, Comput. Speech Lang..

[12] Anne Abeillé,et al. A Lexicalized Tree Adjoining Grammar for English , 1990 .

[13] XTAG Research Group,et al. A Lexicalized Tree Adjoining Grammar for English , 1998, ArXiv.