Towards Open Domain Chatbots - A GRU Architecture for Data Driven Conversations

Understanding of textual content, such as topic and intent recognition, is a critical part of chatbots, allowing the chatbot to provide relevant responses. Although successful in several narrow domains, the potential diversity of content in broader and more open domains renders traditional pattern recognition techniques inaccurate. In this paper, we propose a novel deep learning architecture for content recognition that consists of multiple levels of gated recurrent units (GRUs). The architecture is designed to capture complex sentence structure at multiple levels of abstraction, seeking content recognition for very wide domains, through a distributed scalable representation of content. To evaluate our architecture, we have compiled 10 years of questions and answers from a youth information service, \(200\ 083\) questions spanning a wide range of content, altogether 289 topics, involving law, health, and social issues. Despite the relatively open domain data set, our architecture is able to accurately categorize the 289 intents and topics. Indeed, it provides roughly an order of magnitude higher accuracy compared to more classical content recognition techniques, such as SVM, Naive Bayes, random forest, and K-nearest neighbor, which all seem to fail on this challenging open domain dataset.

[1]  Andreas Holzinger,et al.  Data Mining with Decision Trees: Theory and Applications , 2015, Online Inf. Rev..

[2]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[3]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[4]  Supratip Ghose,et al.  Toward the implementation of a topic specific dialogue based natural language chatbot as an undergraduate advisor , 2013, 2013 International Conference on Informatics, Electronics and Vision (ICIEV).

[5]  John Woods,et al.  Survey on Chatbot Design Techniques in Speech Conversation Systems , 2015 .

[6]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Michael J. Watts,et al.  IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS Publication Information , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Manisha Biswas,et al.  Microsoft Bot Framework , 2018 .

[9]  Anbang Xu,et al.  A New Chatbot for Customer Service on Social Media , 2017, CHI.

[10]  S. Rigatti Random Forest. , 2017, Journal of insurance medicine.

[11]  Eric Atwell,et al.  ALICE Chatbot: Trials and Outputs , 2015, Computación y Sistemas.

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  Edson P. Pimentel,et al.  Artificial Intelligence MArkup Language: A Brief Tutorial , 2013, ArXiv.

[14]  Ho-Jin Choi,et al.  The chatbot feels you - a counseling service using emotional response generation , 2017, 2017 IEEE International Conference on Big Data and Smart Computing (BigComp).

[15]  J. Weizenbaum Computer Power And Human Reason: From Judgement To Calculation , 1978 .

[16]  David A. Ferrucci,et al.  Introduction to "This is Watson" , 2012, IBM J. Res. Dev..