JU_CSE_NLP: Language Independent Cross-lingual Textual Entailment System

This article presents the experiments carried out at Jadavpur University as part of the participation in Cross-lingual Textual Entailment for Content Synchronization (CLTE) of task 8 @ Semantic Evaluation Exercises (SemEval-2012). The work explores cross-lingual textual entailment as a relation between two texts in different languages and proposes different measures for entailment decision in a four way classification tasks (forward, backward, bidirectional and no-entailment). We set up different heuristics and measures for evaluating the entailment between two texts based on lexical relations. Experiments have been carried out with both the text and hypothesis converted to the same language using the Microsoft Bing translation system. The entailment system considers Named Entity, Noun Chunks, Part of speech, N-Gram and some text similarity measures of the text pair to decide the entailment judgments. Rules have been developed to encounter the multi way entailment issue. Our system decides on the entailment judgment after comparing the entailment scores for the text pairs. Four different rules have been developed for the four different classes of entailment. The best run is submitted for Italian -- English language with accuracy 0.326.

[1]  Ido Dagan,et al.  The Sixth PASCAL Recognizing Textual Entailment Challenge , 2009, TAC.

[2]  Shuming Shi,et al.  Overview of NTCIR-9 RITE: Recognizing Inference in TExt , 2011, NTCIR.

[3]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[4]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[5]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[6]  Sivaji Bandyopadhyay,et al.  JU_CSE_TAC: Textual Entailment Recognition System at TAC RTE-6 , 2010, TAC.

[7]  Sivaji Bandyopadhyay,et al.  A Textual Entailment System using Anaphora Resolution , 2011, TAC.

[8]  Deniz Yuret,et al.  SemEval-2010 Task 12: Parser Evaluation Using Textual Entailments , 2010, *SEMEVAL.

[9]  Peter Clark,et al.  The Seventh PASCAL Recognizing Textual Entailment Challenge , 2011, TAC.

[10]  Sivaji Bandyopadhyay,et al.  Lexical based two-way RTE System at RTE-5 , 2009, TAC.

[11]  Marcello Federico,et al.  Towards Cross-Lingual Textual Entailment , 2010, NAACL.

[12]  Ido Dagan,et al.  The Fourth PASCAL Recognizing Textual Entailment Challenge , 2008, TAC.

[13]  Roy Bar-Haim,et al.  The Second PASCAL Recognising Textual Entailment Challenge , 2006 .

[14]  Sivaji Bandyopadhyay,et al.  A Textual Entailment System using Web based Machine Translation System , 2011, NTCIR.

[15]  Matteo Negri,et al.  Creating a Bi-lingual Entailment Corpus through Translations with Mechanical Turk: $100 for a 10-day Rush , 2010, Mturk@HLT-NAACL.

[16]  Johan Bos,et al.  Textual Entailment at EVALITA 2009 , 2009 .

[17]  Marcello Federico,et al.  Using Bilingual Parallel Corpora for Cross-Lingual Textual Entailment , 2011, ACL.

[18]  Ido Dagan,et al.  The Fourth PASCAL Recognizing Textual Entailment Challenge | NIST , 2009 .