SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis
暂无分享,去创建一个
[1] Jinglu Hu,et al. Improving Image Captioning Evaluation by Considering Inter References Variance , 2020, ACL.
[2] Serge J. Belongie,et al. Learning to Evaluate Image Captioning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[3] Omri Abend,et al. Reference-less Measure of Faithfulness for Grammatical Error Correction , 2018, NAACL.
[4] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.
[5] Matt Post,et al. Ground Truth for Grammatical Error Correction Metrics , 2015, ACL.
[6] Kristina Lerman,et al. A Survey on Bias and Fairness in Machine Learning , 2019, ACM Comput. Surv..
[7] Thomas Wolf,et al. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter , 2019, ArXiv.
[8] Cyrus Rashtchian,et al. Collecting Image Annotations Using Amazon’s Mechanical Turk , 2010, Mturk@HLT-NAACL.
[9] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[10] Yu Cheng,et al. Patient Knowledge Distillation for BERT Model Compression , 2019, EMNLP.
[11] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.
[12] Thomas M. Cover,et al. Elements of Information Theory , 2005 .
[13] Basura Fernando,et al. SPICE: Semantic Propositional Image Caption Evaluation , 2016, ECCV.
[14] Trevor Darrell,et al. Women also Snowboard: Overcoming Bias in Captioning Models , 2018, ECCV.
[15] Kumiko Tanaka-Ishii,et al. Cross Entropy of Neural Language Models at Infinity—A New Bound of the Entropy Rate , 2018, Entropy.
[16] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Zhe Gan,et al. Distilling Knowledge Learned in BERT for Text Generation , 2019, ACL.
[18] Marcin Junczys-Dowmunt,et al. Human Evaluation of Grammatical Error Correction Systems , 2015, EMNLP.
[19] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[20] Kilian Q. Weinberger,et al. BERTScore: Evaluating Text Generation with BERT , 2019, ICLR.
[21] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[22] Ted Briscoe,et al. Towards a standard evaluation method for grammatical error detection and correction , 2015, NAACL.
[23] Kentaro Inui,et al. Reference-based Metrics can be Replaced with Reference-less Metrics in Evaluating Grammatical Error Correction Systems , 2017, IJCNLP.
[24] Jing Gu,et al. Perception Score, A Learned Metric for Open-ended Text Generation Evaluation , 2020, ArXiv.
[25] Julia Hockenmaier,et al. Focused Evaluation for Image Description with Binary Forced-Choice Tasks , 2016, VL@ACL.
[26] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.
[27] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[28] Xinlei Chen,et al. Microsoft COCO Captions: Data Collection and Evaluation Server , 2015, ArXiv.
[29] Mert Kilickaya,et al. Re-evaluating Automatic Metrics for Image Captioning , 2016, EACL.
[30] Lexing Xie,et al. SemStyle: Learning to Generate Stylised Image Captions Using Unaligned Text , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[31] Thibault Sellam,et al. BLEURT: Learning Robust Metrics for Text Generation , 2020, ACL.
[32] Siqi Liu,et al. Improved Image Captioning via Policy Gradient optimization of SPIDEr , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[33] Radu Soricut,et al. Quality Estimation for Image Captions Based on Large-scale Human Evaluations , 2021, NAACL.
[34] Hwee Tou Ng,et al. A Beam-Search Decoder for Grammatical Error Correction , 2012, EMNLP.
[35] Zhe Gan,et al. TIGEr: Text-to-Image Grounding for Image Caption Evaluation , 2019, EMNLP.
[36] Fei Liu,et al. MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance , 2019, EMNLP.
[37] Peter Young,et al. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics , 2013, J. Artif. Intell. Res..
[38] Rita Cucchiara,et al. Meshed-Memory Transformer for Image Captioning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Tao Mei,et al. X-Linear Attention Networks for Image Captioning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Mohammed Bennamoun,et al. LCEval: Learned Composite Metric for Caption Evaluation , 2019, International Journal of Computer Vision.
[41] อนิรุธ สืบสิงห์,et al. Data Mining Practical Machine Learning Tools and Techniques , 2014 .
[42] Joel R. Tetreault,et al. There’s No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction , 2016, EMNLP.
[43] Peter Young,et al. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions , 2014, TACL.
[44] Ali Abdalla,et al. NUBIA: NeUral Based Interchangeability Assessor for Text Generation , 2020, EVALNLGEVAL.
[45] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Chitta Baral,et al. Image Understanding using vision and reasoning through Scene Description Graph , 2018, Comput. Vis. Image Underst..
[47] Raymond Hendy Susanto,et al. The CoNLL-2014 Shared Task on Grammatical Error Correction , 2014 .
[48] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Omer Levy,et al. What Does BERT Look at? An Analysis of BERT’s Attention , 2019, BlackboxNLP@ACL.