论文信息 - A Paragraph-level Multi-task Learning Model for Scientific Fact-Verification - 字舞流文

A Paragraph-level Multi-task Learning Model for Scientific Fact-Verification

Even for domain experts, it is a non-trivial task to verify a scientific claim by providing supporting or refuting evidence rationales. The situation worsens as misinformation is proliferated on social media or news websites, manually or programmatically, at every moment. As a result, an automatic factverification tool becomes crucial for combating the spread of misinformation. In this work, we propose a novel, paragraphlevel, multi-task learning model for the SCIFACT task by directly computing a sequence of contextualized sentence embeddings from a BERT model and jointly training the model on rationale selection and stance prediction.

Nanyun Peng | Xiangci Li | Gully Burns | Gully A. Burns | Nanyun Peng | Xiangci Li

[1] Andreas Vlachos,et al. FEVER: a Large-scale Dataset for Fact Extraction and VERification , 2018, NAACL.

[2] Andreas Vlachos,et al. Emergent: a novel data-set for stance classification , 2016, NAACL.

[3] Byron C. Wallace,et al. ERASER: A Benchmark to Evaluate Rationalized NLP Models , 2020, ACL.

[4] Preslav Nakov,et al. Overview of the CLEF-2018 CheckThat! Lab on Automatic Identification and Verification of Political Claims. Task 1: Check-Worthiness , 2018, CLEF.

[5] Gerhard Weikum,et al. Where the Truth Lies: Explaining the Credibility of Emerging Claims on the Web and Social Media , 2017, WWW.

[6] Dan Roth,et al. TwoWingOS: A Two-Wing Optimization Strategy for Evidential Claim Verification , 2018, EMNLP.

[7] Yifan Peng,et al. BioSentVec: creating sentence embeddings for biomedical texts , 2018, 2019 IEEE International Conference on Healthcare Informatics (ICHI).

[8] Smaranda Muresan,et al. DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking , 2020, ACL.

[9] Wenpeng Yin,et al. Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms , 2017, TACL.

[10] William Yang Wang. “Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection , 2017, ACL.

[11] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[12] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[13] Matteo Pagliardini,et al. Unsupervised Learning of Sentence Embeddings Using Compositional n-Gram Features , 2017, NAACL.

[14] Hannaneh Hajishirzi,et al. Fact or Fiction: Verifying Scientific Claims , 2020, EMNLP.

[15] Andreas Vlachos,et al. Fact Checking: Task definition and dataset construction , 2014, LTCSS@ACL.

[16] Preslav Nakov,et al. Integrating Stance Detection and Fact Checking in a Unified Corpus , 2018, NAACL.

[17] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.

[18] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.

[19] Nanyun Peng,et al. Multi-task Domain Adaptation for Sequence Tagging , 2016, Rep4NLP@ACL.

[20] Zhiyuan Liu,et al. End-to-End Neural Ad-hoc Ranking with Kernel Pooling , 2017, SIGIR.

[21] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[22] Maosong Sun,et al. GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification , 2019, ACL.

[23] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.

[24] Chris Callison-Burch,et al. Seeing Things from a Different Angle:Discovering Diverse Perspectives about Claims , 2019, NAACL.

[25] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[26] Iryna Gurevych,et al. A Richly Annotated Corpus for Different Tasks in Automated Fact-Checking , 2019, CoNLL.

[27] Arkaitz Zubiaga,et al. SemEval-2017 Task 8: RumourEval: Determining rumour veracity and support for rumours , 2017, *SEMEVAL.

[28] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[29] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[30] Jimmy J. Lin,et al. Scientific Claim Verification with VerT5erini , 2020, LOUHI.

[31] Haonan Chen,et al. Combining Fact Extraction and Verification with Neural Semantic Matching Networks , 2018, AAAI.

[32] Jianfeng Gao,et al. A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[33] Nanyun Peng,et al. Multi-task Multi-domain Representation Learning for Sequence Tagging , 2016, ArXiv.