A Comparison of Natural Language Understanding Services to build a chatbot in Italian

All leading IT companies have developed cloud-based platforms that allow building a chatbot in few steps and most times without knowledge about programming languages. These services are based on Natural Language Understanding (NLU) engines which deal with identifying information such as entities and intents from the sentences provided as input. In order to integrate a chatbot on an e-learning platform, we want to study the performance in intent recognition task of major NLU platforms available on the market through a deep and severe comparison, using an Italian dataset which is provided by the owner of the e-learning platform. We focused on the intent recognition task because we believe that it is the core part of an efficient chatbot, which is able to operate in a complex context with thousands of users who have different language skills. We carried out different experiments and collected performance information about F-score, error rate, response time and robustness of all selected NLU platforms.

[1]  Ivan Serina,et al.  Leveraging Multi-task Learning for Biomedical Named Entity Recognition , 2019, AI*IA.

[2]  Rishin Haldar,et al.  Applying Chatbots to the Internet of Things: Opportunities and Architectural Elements , 2016, ArXiv.

[3]  Luigi De Russis,et al.  A Comparison and Critique of Natural Language Understanding Tools , 2018 .

[4]  Ivan Serina,et al.  Combining Multi-task Learning with Transfer Learning for Biomedical Named Entity Recognition , 2020, KES.

[5]  Matteo Zubani,et al.  Evaluating different Natural Language Understanding services in a real business case for the Italian language , 2020, KES.

[6]  Verena Rieser,et al.  Benchmarking Natural Language Understanding Services for building Conversational Agents , 2019, IWSDS.

[7]  Taweesak Samanchuen,et al.  Chatbot Utilization for Medical Consultant System , 2018, 2018 3rd Technology Innovation Management and Engineering Science International Conference (TIMES-iCON).

[8]  Ivan Serina,et al.  Automatic Classification of Radiological Reports for Clinical Care , 2017, AIME.

[9]  Ashok K. Goel,et al.  Jill Watson , 2018, Learning Engineering for Online Education.

[10]  Ivan Serina,et al.  Applying Self-interaction Attention for Extracting Drug-Drug Interactions , 2019, AI*IA.

[11]  Adrian Hernandez-Mendez,et al.  Evaluating Natural Language Understanding Services for Conversational Question Answering Systems , 2017, SIGDIAL Conference.

[12]  Ivan Serina,et al.  Multi-task Learning Applied to Biomedical Named Entity Recognition Task , 2019, CLiC-it.

[13]  Ivan Serina,et al.  Deep Learning for Classification of Radiology Reports with a Hierarchical Schema , 2020, KES.

[14]  Ivan Serina,et al.  The Impact of Self-Interaction Attention on the Extraction of Drug-Drug Interactions , 2019, CLiC-it.