UPV-UMA at CheckThat! Lab: Verifying Arabic Claims using a Cross Lingual Approach

In this paper we present our team participation at CheckThat!2019 lab Task 2 on Arabic claim verification. We propose a cross-lingual approach to detect the factuality of claims using three main steps, evidence retrieval, evidence ranking, and textual entailment. Our approach achieves the best performance in subtask-D, with a value of 0.62 as F1.

[1]  Goran Glavas,et al.  How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions , 2019, ACL.

[2]  Preslav Nakov,et al.  Fully Automated Fact Checking Using External Sources , 2017, RANLP.

[3]  Paolo Rosso,et al.  ARAP: Arabic Author Profiling Project for Cyber-Security , 2018, Proces. del Leng. Natural.

[4]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[5]  Preslav Nakov,et al.  CheckThat! at CLEF 2019: Automatic Identification and Verification of Claims , 2019, ECIR.

[6]  Ossama Emam,et al.  Language Model Based Arabic Word Segmentation , 2003, ACL.

[7]  Samuel L. Smith,et al.  Offline bilingual word vectors, orthogonal transformations and the inverted softmax , 2017, ICLR.

[8]  Gerhard Weikum,et al.  Leveraging Joint Interactions for Credibility Analysis in News Communities , 2015, CIKM.

[9]  Vitalii Zhelezniak,et al.  Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors , 2019, ICLR.

[10]  Zhen-Hua Ling,et al.  Enhanced LSTM for Natural Language Inference , 2016, ACL.

[11]  Guillaume Lample,et al.  XNLI: Evaluating Cross-lingual Sentence Representations , 2018, EMNLP.

[12]  Motaz Saad,et al.  WikiDocsAligner: An Off-the-Shelf Wikipedia Documents Alignment Tool , 2017, 2017 Palestinian International Conference on Information and Communication Technology (PICICT).

[13]  Preslav Nakov,et al.  Overview of the CLEF-2019 CheckThat! Lab: Automatic Identification and Verification of Claims. Task 2: Evidence and Factuality , 2019, CLEF.

[14]  Samuel R. Bowman,et al.  A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[15]  Farah Benamara,et al.  SOUKHRIA: Towards an Irony Detection System for Arabic in Social Media , 2017, ACLING.