A novel semantic and logical-based approach integrating RTE technique in the Arabic question–answering

In this paper, we propose a new approach for the Arabic question-answering system. This approach is based on the automatic understanding of Arabic texts to convert them into semantic and logical representations. Our approach is designed to determine the relation of the textual entailment between the logical representations of the question and the text passage in order to select the text passage that contains an answer to the question and find the precise answer. For a logical representation, we referred to a semantic representation. The idea is to convert the Arabic texts into conceptual graphs that allow the modelling of textual information by concepts and relations. From these graphs, we proposed an algorithm that transforms each conceptual graph to a logical representation. Finally, we extracted three features and combined them to determine the textual entailment relation between two logical representations of the question and its passage answer. Our approach has been validated through a question-answering system, called NArQAS. For the evaluation, we used questions and text passages from our corpus of questions-texts, AQA-WebCorp. The performance of our approach has reached an accuracy of 74%.

[1]  Ghassan Mourad,et al.  Analyse informatique des signes typographiques pour la segmentation de textes et l'extraction automatique de citations : réalisation des applications informatiques : SegATex et CitaRE , 2001 .

[2]  Mahmoud Neji,et al.  An Arabic Question-Answering System Combining a Semantic and Logical Representation of Texts , 2017, ISDA.

[3]  Kenneth Magel,et al.  Software localization: the challenging aspects of Arabic to the localization process (Arabization) , 2008, ICSE 2008.

[4]  Mohanaad Shakir,et al.  Domain-specific ontology-based approach for Arabic question answering , 2016 .

[5]  Iyad AlAgha,et al.  AR2SPARQL: An Arabic Natural Language Interface for the Semantic Web , 2015 .

[6]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[7]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[8]  Nizar Habash,et al.  Arabic Tokenization, Part-of-Speech Tagging and Morphological Disambiguation in One Fell Swoop , 2005, ACL.

[9]  Josef Steinberger,et al.  Multilingual Statistical News Summarization , 2013, Multi-source, Multilingual Information Extraction and Summarization.

[10]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[11]  Mahmoud Neji,et al.  AQA-WebCorp: Web-based Factual Questions for Arabic , 2016, KES.

[12]  Sameerchand Pudaruth,et al.  An intelligent question answering system for ICT , 2016, 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT).

[13]  Diego Mollá Aliod,et al.  Named Entity Recognition for Question Answering , 2006, ALTA.

[14]  Paolo Rosso,et al.  Arabic QA4MRE at CLEF 2012: Arabic Question Answering for Machine Reading Evaluation , 2012, CLEF.

[15]  Bogdan Babych,et al.  Improving Machine Translation Quality with Automatic Named Entity Recognition , 2003, Proceedings of the 7th International EAMT workshop on MT and other Language Technology Tools, Improving MT through other Language Technology Tools Resources and Tools for Building MT - EAMT '03.

[16]  Paolo Rosso,et al.  IDRAAQ: New Arabic Question Answering System Based on Query Expansion and Passage Retrieval , 2012, CLEF.

[17]  Manika Nanda The Named Entity Recognizer Framework , 2014 .

[18]  Gayatri Chavan,et al.  Design of the Effective Question Answering System by Performing Question Analysis using the Classifier , 2016 .

[19]  Stephen J. Payne,et al.  Constructing structure maps of multiple on-line texts , 2006, Int. J. Hum. Comput. Stud..

[20]  Kenneth Magel,et al.  QArabPro: A Rule Based Question Answering System for Reading Comprehension Tests in Arabic , 2011 .

[21]  Natheer K. Gharaibeh,et al.  Development of Yes/No Arabic Question Answering System , 2013, ArXiv.

[22]  Zoubeir Mouelhi AraSeg : un segmenteur semi-automatique des textes arabes , 2008 .

[23]  Khalid J. Al Daimi,et al.  The syntactic analysis of arabic by machine , 1994, Comput. Humanit..

[24]  W. Bruce Croft,et al.  Analysis of Statistical Question Classification for Fact-Based Questions , 2005, Information Retrieval.

[25]  María-Dolores Olvera-Lobo,et al.  Question Answering Track Evaluation in TREC, CLEF and NTCIR , 2015, WorldCIST.

[26]  Leah S. Larkey,et al.  Arabic Information Retrieval at UMass in TREC-10 , 2001, TREC.