Semi-automatic Knowledge Base Expansion for Question Answering

In this paper we present a hybrid semi-automatic methodology for the construction of a Lexical Knowledge Base. The purpose of the work is to face the challenges related to synonymy in Question Answering systems. We talk about a “hybrid” method for two main reasons: firstly, it includes both a manual annotation process and an automatic expansion phase; secondly, the Knowledge Base data refers, a the same time, to both the syntactic and the semantic properties borrowed from the Lexicon-Grammar theoretical framework. The resulting Knowledge Base allows the automatic recognition of those nouns and adjectives which are not typically related into synonyms databases. In detail, we refer to nouns and adjectives that enter into a morph-phonological relation with verbs, in addition to the classic matching between words based on synset from the MultiWordNet synonym database.

[1]  Kenneth C. Litkowski,et al.  Syntactic Clues and Lexical Resources in Question-Answering , 2000, TREC.

[2]  Christopher Meek,et al.  Semantic Parsing for Single-Relation Question Answering , 2014, ACL.

[3]  Annibale Elia,et al.  Lessico-grammatica dell'italiano : metodi, descrizioni e applicazioni , 2004 .

[4]  Flora Amato,et al.  Analyse digital forensic evidences through a semantic-based methodology and NLP techniques , 2019, Future Gener. Comput. Syst..

[5]  Flora Amato,et al.  Correlation of Digital Evidences in Forensic Investigation through Semantic Technologies , 2017, 2017 31st International Conference on Advanced Information Networking and Applications Workshops (WAINA).

[6]  Flora Amato,et al.  An Application of Semantic Techniques for Forensic Analysis , 2018, 2018 32nd International Conference on Advanced Information Networking and Applications Workshops (WAINA).

[7]  John McCarthy,et al.  Notes on Formalizing Context , 1993, IJCAI.

[8]  Andreas Reuter,et al.  Principles of transaction-oriented database recovery , 1983, CSUR.

[9]  Annibale Elia,et al.  Lexicon-Grammar, Electronic Dictionaries and Local Grammars of Italian , 2004 .

[10]  Tim Berners-Lee,et al.  Realising the Full Potential of the Web , 1999 .

[11]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[12]  Alessandro Maisto,et al.  Automatic Text Preprocessing for Intelligent Dialog Agents , 2019, AINA Workshops.

[13]  Max Silberztein,et al.  NooJ: a Linguistic Annotation System for Corpus Processing , 2005, HLT.

[14]  Massimiliano Polito,et al.  An Industrial Multi-Agent System (MAS) Platform , 2019, 3PGCIC.

[15]  Maurice Gross,et al.  Le verbe italien : les completives dans les phrases a un complement , 1984 .

[16]  Flora Amato,et al.  ABC: A knowledge Based Collaborative framework for e-health , 2015, 2015 IEEE 1st International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI).

[17]  Rahul Gupta,et al.  Knowledge base completion via search-based question answering , 2014, WWW.