Focused domain contextual AI chatbot framework for resource poor languages

ABSTRACT In today’s business world, providing reliable customer service is equally important as delivering better products for maintaining a sustainable business model. As providing customer service requires human resource and money, businesses are often shifting towards artificial intelligence system for necessary customer interaction. However, these traditional chatbot architectures depend heavily on natural language processing (NLP), it is not feasible to implement for the languages with little to no prior NLP backbone. In this work, we propose a semi-supervised artificially intelligent chatbot framework that can automate parts of primary interaction and customer service. The primary focus of this work is to build a chatbot which can generate contextualized responses in any language without depending much on rich NLP background and a vast number of a prior data set. This system is designed in such a way that with a dictionary of a language and regular customer interaction dataset, it can provide customer services for any business in any language. This architecture has been used to build a customer service bot for an electric shop, and different analysis has been done to evaluate the performance of individual components of the framework to show its competence to provide reliable response generation in comparison with other approaches.

[1]  Lorenzo Rosasco,et al.  Are Loss Functions All the Same? , 2004, Neural Computation.

[2]  Levent Özgür,et al.  Text Categorization with Class-Based and Corpus-Based Keyword Selection , 2005, ISCIS.

[3]  Rong Jin,et al.  Understanding bag-of-words model: a statistical framework , 2010, Int. J. Mach. Learn. Cybern..

[4]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[5]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[6]  Martin T. Hagan,et al.  Neural network design , 1995 .

[7]  Juan Luis Castro,et al.  A multi-agent conversational system with heterogeneous data sources access , 2016, Expert Syst. Appl..

[8]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  Tara N. Sainath,et al.  Improving deep neural networks for LVCSR using rectified linear units and dropout , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Huidong Jin,et al.  CenKNN: a scalable and effective text classifier , 2014, Data Mining and Knowledge Discovery.

[11]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[12]  Anjali Ganesh Jivani,et al.  A Comparative Study of Stemming Algorithms , 2011 .

[13]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[14]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[15]  Kirthevasan Kandasamy,et al.  Batch Policy Gradient Methods for Improving Neural Conversation Models , 2017, ICLR.

[16]  Chayan Chakrabarti,et al.  Artificial conversations for customer service chatter bots: Architecture, algorithms, and evaluation metrics , 2015, Expert Syst. Appl..

[17]  Ming Li,et al.  Enhanced question understanding with dynamic memory networks for textual question answering , 2017, Expert Syst. Appl..

[18]  Ruofei Zhang,et al.  DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks , 2017, KDD.

[19]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[20]  Edward Fredkin,et al.  Trie memory , 1960, Commun. ACM.

[21]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[22]  Huaiqin Wu Global stability analysis of a general class of discontinuous neural networks with linear growth activation functions , 2009, Inf. Sci..

[23]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[24]  Ladislav Lenc,et al.  Two-Level Neural Network for Multi-label Document Classification , 2017, ICANN.

[25]  Mile Pavlic,et al.  Question answering with a conceptual framework for knowledge-based system development "Node of Knowledge" , 2015, Expert Syst. Appl..

[26]  Honglak Lee,et al.  Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units , 2016, ICML.

[27]  Jianfeng Gao,et al.  Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[28]  Ferry Wahyu Wibowo,et al.  Chatbot Using a Knowledge in Database: Human-to-Machine Conversation Modeling , 2016, 2016 7th International Conference on Intelligent Systems, Modelling and Simulation (ISMS).