论文信息 - Shared Task on Evaluating Accuracy

Shared Task on Evaluating Accuracy

We propose a shared task on methodologies and algorithms for evaluating the accuracy of generated texts, specifically summaries of basketball games produced from basketball box score and other game data. We welcome submissions based on protocols for human evaluation, automatic metrics, as well as combinations of human evaluations and metrics.

Ehud Reiter | Craig Thomson

[1] Ehud Reiter,et al. A Gold Standard Methodology for Evaluating Accuracy in Data-To-Text Systems , 2020, INLG.

[2] Hongmin Wang,et al. Revisiting Challenges in Data-to-Text Generation with Fact Grounding , 2020, INLG.

[3] Patrick Gallinari,et al. A Hierarchical Model for Data-to-Text Generation , 2019, ECIR.

[4] Xiaocheng Feng,et al. Table-to-Text Generation with Effective Hierarchical Encoder on Three Dimensions (Row, Column and Time) , 2019, EMNLP.

[5] Alexander M. Rush,et al. Challenges in Data-to-Document Generation , 2017, EMNLP.

[6] Mirella Lapata,et al. Data-to-Text Generation with Content Selection and Planning , 2018, AAAI.

[7] Yusuke Miyao,et al. Learning to Select, Track, and Generate for Data-to-Text , 2019, ACL.

[8] Mirella Lapata,et al. Data-to-text Generation with Entity Modeling , 2019, ACL.

[9] Albert Gatt,et al. Best practices for the human evaluation of automatically generated text , 2019, INLG.