In bot we trust: A new methodology of chatbot performance measures

Abstract Chatbots are used frequently in business to facilitate various processes, particularly those related to customer service and personalization. In this article, we propose novel methods of tracking human-chatbot interactions and measuring chatbot performance that take into consideration ethical concerns, particularly trust. Our proposed methodology links neuroscientific methods, text mining, and machine learning. We argue that trust is the focal point of successful human-chatbot interaction and assess how trust as a relevant category is being redefined with the advent of deep learning supported chatbots. We propose a novel method of analyzing the content of messages produced in human-chatbot interactions, using the Condor Tribefinder system we developed for text mining that is based on a machine learning classification engine. Our results will help build better social bots for interaction in business or commercial environments.

[1]  J. Weizenbaum From Computer Power and Human Reason From Judgment to Calculation , 2007 .

[2]  Collette Curry,et al.  The Implementation of a Story Telling Chatbot , 2012 .

[3]  Eric Steven Atwell,et al.  Different measurement metrics to evaluate a chatbot system , 2007, HLT-NAACL 2007.

[4]  Andrea Fronzetti Colladon,et al.  Identifying Tribes on Twitter Through Shared Context , 2019, Studies on Entrepreneurship, Structural Change and Industrial Dynamics.

[5]  H. Francis Song,et al.  Machine Theory of Mind , 2018, ICML.

[6]  J. Weizenbaum Computer Power And Human Reason: From Judgement To Calculation , 1978 .

[7]  Alun D. Preece,et al.  Asking 'Why' in AI: Explainability of intelligent systems - perspectives and challenges , 2018, Intell. Syst. Account. Finance Manag..

[8]  Kellie Morrissey,et al.  'Realness' in Chatbots: Establishing Quantifiable Criteria , 2013, HCI.

[9]  Justine Cassell,et al.  External manifestations of trustworthiness in the interface , 2000, CACM.

[10]  M. Tomasello,et al.  Does the chimpanzee have a theory of mind? 30 years later , 2008, Trends in Cognitive Sciences.

[11]  Eric Atwell,et al.  Using corpora in machine-learning chatbot systems , 2005 .

[12]  K. Dautenhahn,et al.  A Survey of Socially Interactive Robots : Concepts , Design , and Applications , 1992 .

[13]  H. Wimmer,et al.  Beliefs about beliefs: Representation and constraining function of wrong beliefs in young children's understanding of deception , 1983, Cognition.

[14]  Joseph Weizenbaum,et al.  and Machine , 1977 .

[15]  Jeffrey M. Voas,et al.  “Alexa, Can I Trust You?” , 2017, Computer.

[16]  N. Epley,et al.  The mind in the machine: Anthropomorphism increases trust in an autonomous vehicle , 2014 .

[17]  Justine Cassell,et al.  Relational agents: a model and implementation of building user trust , 2001, CHI.

[18]  Anik Das,et al.  Introduction to Chatbots , 2018 .

[19]  André L. M. Santos,et al.  Developing a Corporate Chatbot for a Customer Engagement Program: A Roadmap , 2018, ICIC.

[20]  Aleksandra Przegalinska,et al.  Wearable Technologies in Organizations , 2018 .

[21]  J. J. Dijkstra,et al.  On the use of computerised decision aids , 2006 .

[22]  Morgan C. Benton,et al.  Evaluating Quality of Chatbots and Intelligent Conversational Agents , 2017, ArXiv.

[23]  Krzysztof Wegner,et al.  The Necessity of New Paradigms in Measuring Human-Chatbot Interaction , 2017 .

[24]  GalaxyScope: Finding the “Truth of Tribes” on Social Media , 2018 .

[25]  Mariarosaria Taddeo,et al.  Trust in Technology: A Distinctive and a Problematic Relation , 2010 .

[26]  Slawomir Zadrozny,et al.  Computing With Words Is an Implementable Paradigm: Fuzzy Queries, Linguistic Data Summaries, and Natural-Language Generation , 2010, IEEE Transactions on Fuzzy Systems.

[27]  Andreas M. Kaplan,et al.  Siri, Siri, in my hand: Who’s the fairest in the land? On the interpretations, illustrations, and implications of artificial intelligence , 2019, Business Horizons.

[28]  Jordi Vallverdú,et al.  Can machines talk? Comparison of Eliza with modern dialogue systems , 2016, Comput. Hum. Behav..

[29]  Itamar Arel,et al.  Beyond the Turing Test , 2009, Computer.

[30]  Raja Parasuraman,et al.  The World is not Enough: Trust in Cognitive Agents , 2012 .

[31]  M. Schweitzer,et al.  Feeling and believing: the influence of emotion on trust. , 2003, Journal of personality and social psychology.

[32]  Digital anthropomorphism , 2015 .

[33]  A. Heinzl,et al.  Human Versus Machine: Contingency Factors of Anthropomorphism as a Trust-Inducing Design Strategy for Conversational Agents , 2018 .

[34]  J. Baron Rationality and Intelligence , 1985 .

[35]  Luis A. Guerrero,et al.  Alexa vs. Siri vs. Cortana vs. Google Assistant: A Comparison of Speech-Based Natural User Interfaces , 2017 .

[36]  Luiz Moutinho,et al.  Surf tribal behaviour: a sports marketing application , 2007 .

[37]  Asbjørn Følstad,et al.  An Initial Model of Trust in Chatbots for Customer Service - Findings from a Questionnaire Study , 2019, Interact. Comput..

[38]  Grzegorz Pochwatko,et al.  Polish Version of the Negative Attitude Toward Robots Scale (NARS-PL) , 2015, J. Autom. Mob. Robotics Intell. Syst..

[39]  Karl F. MacDorman,et al.  The Uncanny Valley [From the Field] , 2012, IEEE Robotics Autom. Mag..

[40]  James H. Moor,et al.  The Status and Future of the Turing Test , 2001, Minds and Machines.

[41]  Wlodek Zadrozny,et al.  Natural language dialogue for personalized interaction , 2000, CACM.

[42]  Peter A. Gloor,et al.  In the shades of the uncanny valley: An experimental study of human-chatbot interaction , 2018, Future Gener. Comput. Syst..

[43]  Bernard Cova,et al.  Tribal marketing: The tribalisation of society and its impact on the conduct of marketing , 2002 .

[44]  Natali Asher A Warmer Welcome : Application of a Chatbot as a Facilitator for New Hires Onboarding , 2017 .

[45]  Heloir,et al.  The Uncanny Valley , 2019, The Animation Studies Reader.

[46]  Amy J. C. Cuddy,et al.  A model of (often mixed) stereotype content: competence and warmth respectively follow from perceived status and competition. , 2002, Journal of personality and social psychology.

[47]  Dominique Méda The future of work: The meaning and value of work in Europe , 2016 .

[48]  Tina Klüwer,et al.  From Chatbots to Dialog Systems , 2011 .

[49]  Eric Atwell,et al.  Chatbots: Are they Really Useful? , 2007, LDV Forum.

[50]  Christopher Ré,et al.  Machine learning and deep analytics for biocomputing: Call for better explainability , 2018, PSB.

[51]  Sherwyn P. Morreale,et al.  Building the High-Trust Organization: Strategies for Supporting Five Key Dimensions of Trust , 2010 .

[52]  Robert M. French Moving beyond the Turing test , 2012, CACM.