An Effective Corpus-Based Question Answering Pipeline for Italian

Question Answering is a longevous field in computer science, aimed at realizing systems able to answer questions expressed in natural language. However, building Question Answering systems for Italian and able to extract answers from a corpus pertaining a closed domain is still an open research problem. Indeed, extracting clues from a question to generate a query for the information retrieval engine as well as determining the likelihood that a candidate answer is correct are two very thorny tasks. To face these issues, the paper presents a Question Answering pipeline for Italian and based on a corpus of documents pertaining a closed domain. In particular, this pipeline exhibits functionalities for: (i) analyzing natural language questions in Italian by using lexical features; (ii) handling both factoid and description answer types and, depending on them, filtering contextual stop words from questions; (iii) scoring and selecting candidate answers with respect to their type in order to determine the best one. The proposed solution has been subject to an evaluation of its performance using standard metrics, showing promising results.

[1]  Ronald G. Dreslinski,et al.  Designing Future Warehouse-Scale Computers for Sirius, an End-to-End Voice and Vision Personal Assistant , 2016, TOCS.

[2]  Wlodek Zadrozny,et al.  Watsonsim: Overview of a Question Answering Engine , 2014, ArXiv.

[3]  Flora Amato,et al.  Exploiting Cloud and Workflow Patterns for the Analysis of Composite Cloud Services , 2017, Future Gener. Comput. Syst..

[4]  Tingting He,et al.  Knowledge Base Question Answering Based on Deep Learning Models , 2016, NLPCC/ICCPOL.

[5]  Xian Zhang,et al.  A Chinese Question Answering System for Specific Domain , 2014, WAIM.

[6]  Flora Amato,et al.  Pattern-based orchestration and automatic verification of composite cloud services , 2016, Comput. Electr. Eng..

[7]  Petr Baudi,et al.  YodaQA: A Modular Question Answering System Pipeline , 2015 .

[8]  Karl-Heinz Weis A Case Based Reasoning Approach for Answer Reranking in Question Answering , 2013, GI-Jahrestagung.

[9]  Miltiadis D. Lytras,et al.  AQUA: A Closed-Domain Question Answering System , 2010, Inf. Syst. Manag..

[10]  Lei Yu,et al.  Deep Learning for Answer Sentence Selection , 2014, ArXiv.

[11]  Bowen Zhou,et al.  Applying deep learning to answer selection: A study and an open task , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[12]  Avinash J. Agrawal,et al.  Keywords based Closed Domain Question Answering System for Indian Penal Code Sections and Indian Amendment Laws , 2015 .

[13]  Aditya Kalyanpur,et al.  A framework for merging and ranking of answers in DeepQA , 2012, IBM J. Res. Dev..

[14]  Vitor Rocio,et al.  IdSay: Question Answering for Portuguese , 2008, CLEF.

[15]  Manuel Montes-y-Gómez,et al.  A Language Independent Method for Question Classification , 2004, COLING.

[16]  Flora Amato,et al.  Model transformations of MapReduce Design Patterns for automatic development and verification , 2017, J. Parallel Distributed Comput..

[17]  Roberto Pirrone,et al.  QuASIt: A Cognitive Inspired Approach to Question Answering for the Italian Language , 2016, AI*IA.

[18]  Giuseppe De Pietro,et al.  Towards a Framework for Closed-Domain Question Answering in Italian , 2016, 2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[19]  Dragomir R. Radev,et al.  Question-answering by predictive annotation , 2000, SIGIR '00.

[20]  Chang Wang,et al.  Relation extraction and scoring in DeepQA , 2012, IBM J. Res. Dev..

[21]  Siddharth Patwardhan,et al.  Question analysis: How Watson reads a clue , 2012, IBM J. Res. Dev..

[22]  Boris Katz,et al.  Learning to Answer Questions from Wikipedia Infoboxes , 2016, EMNLP.

[23]  Alexander H. Waibel,et al.  A Pattern Learning Approach to Question Answering Within the Ephyra Framework , 2006, TSD.

[24]  Peter Clark,et al.  Automatic Coupling of Answer Extraction and Information Retrieval , 2013, ACL.

[25]  Annalina Caputo,et al.  Exploiting Distributional Semantic Models in Question Answering , 2012, 2012 IEEE Sixth International Conference on Semantic Computing.