FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization
暂无分享,去创建一个
[1] Inderjeet Mani,et al. The Tipster Summac Text Summarization Evaluation , 1999, EACL.
[2] Daniel Marcu,et al. Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..
[3] Christopher D. Manning,et al. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.
[4] Mirella Lapata,et al. Discourse Constraints for Document Compression , 2010, CL.
[5] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.
[6] Christopher D. Manning,et al. Leveraging Linguistic Structure For Open Domain Information Extraction , 2015, ACL.
[7] Jianfeng Gao,et al. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses , 2015, NAACL.
[8] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.
[9] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.
[10] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.
[11] Joelle Pineau,et al. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation , 2016, EMNLP.
[12] Yang Liu,et al. Modeling Coverage for Neural Machine Translation , 2016, ACL.
[13] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.
[14] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.
[15] Verena Rieser,et al. Why We Need New Evaluation Metrics for NLG , 2017, EMNLP.
[16] Alexander M. Rush,et al. Challenges in Data-to-Document Generation , 2017, EMNLP.
[17] Bowen Zhou,et al. SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.
[18] Yang Liu,et al. Neural Machine Translation with Reconstruction , 2016, AAAI.
[19] Joelle Pineau,et al. Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses , 2017, ACL.
[20] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.
[21] Zhen-Hua Ling,et al. Enhanced LSTM for Natural Language Inference , 2016, ACL.
[22] Mirella Lapata,et al. Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization , 2018, EMNLP.
[23] Furu Wei,et al. Faithful to the Original: Fact Aware Neural Abstractive Summarization , 2017, AAAI.
[24] Yen-Chun Chen,et al. Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting , 2018, ACL.
[25] Percy Liang,et al. Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.
[26] Mor Naaman,et al. Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies , 2018, NAACL.
[27] Alexander M. Rush,et al. Bottom-Up Abstractive Summarization , 2018, EMNLP.
[28] Percy Liang,et al. Transforming Question Answering Datasets Into Natural Language Inference Datasets , 2018, ArXiv.
[29] Dan Klein,et al. Constituency Parsing with a Self-Attentive Encoder , 2018, ACL.
[30] Luke S. Zettlemoyer,et al. AllenNLP: A Deep Semantic Natural Language Processing Platform , 2018, ArXiv.
[31] Fangfang Zhang,et al. On the Abstractiveness of Neural Document Summarization , 2018, EMNLP.
[32] Percy Liang,et al. The price of debiasing automatic metrics in natural language evalaution , 2018, ACL.
[33] Fei Wu,et al. A Semantic QA-Based Approach for Text Summarization Evaluation , 2017, AAAI.
[34] Percy Liang,et al. Unifying Human and Statistical Evaluation for Natural Language Generation , 2019, NAACL.
[35] R. Thomas McCoy,et al. Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference , 2019, ACL.
[36] Ido Dagan,et al. Ranking Generated Summaries by Correctness: An Interesting but Challenging Application for Natural Language Inference , 2019, ACL.
[37] Mirella Lapata,et al. Text Summarization with Pretrained Encoders , 2019, EMNLP.
[38] Ben Goodrich,et al. Assessing The Factual Accuracy of Generated Text , 2019, KDD.
[39] Jonathan Berant,et al. Question Answering is a Format; When is it Useful? , 2019, ArXiv.
[40] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .
[41] Franck Dernoncourt,et al. Analyzing Sentence Fusion in Abstractive Summarization , 2019, EMNLP.
[42] Lihong Li,et al. Neural Approaches to Conversational AI , 2019, Found. Trends Inf. Retr..
[43] Richard Socher,et al. Neural Text Summarization: A Critical Evaluation , 2019, EMNLP.
[44] Sylvain Lamprier,et al. Answers Unite! Unsupervised Metrics for Reinforced Summarization Models , 2019, EMNLP.
[45] Michael Elhadad,et al. Question Answering as an Automatic Evaluation Metric for News Article Summarization , 2019, NAACL.
[46] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[47] Kilian Q. Weinberger,et al. BERTScore: Evaluating Text Generation with BERT , 2019, ICLR.
[48] Omer Levy,et al. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.
[49] Yejin Choi,et al. The Curious Case of Neural Text Degeneration , 2019, ICLR.
[50] J. Weston,et al. Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training , 2019, ACL.