Evaluating Variable-Length Multiple-Option Lists in Chatbots and Mobile Search
暂无分享,去创建一个
[1] Jaana Kekäläinen,et al. IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR Forum.
[2] Charles L. A. Clarke,et al. Modeling user variance in time-biased gain , 2012, HCIR '12.
[3] Ellen M. Voorhees,et al. Overview of the TREC 2002 Question Answering Track , 2003, TREC.
[4] Anselmo Peñas,et al. A Simple Measure to Assess Non-response , 2011, ACL.
[5] Zahra Ashktorab,et al. Resilient Chatbots: Repair Strategy Preferences for Conversational Breakdowns , 2019, CHI.
[6] Timothy Baldwin,et al. Quit While Ahead: Evaluating Truncated Rankings , 2016, SIGIR.
[7] Fabrizio Sebastiani,et al. An Axiomatically Derived Measure for the Evaluation of Classification Algorithms , 2015, ICTIR.
[8] Tony Russell-Rose,et al. Designing the search experience - the information architecture of discovery , 2012 .
[9] Tetsuya Sakai,et al. Summaries, ranked retrieval and sessions: a unified framework for information access evaluation , 2013, SIGIR.
[10] Alistair Moffat,et al. Seven Numeric Properties of Effectiveness Metrics , 2013, AIRS.
[11] Enrique Amigó,et al. An Axiomatic Analysis of Diversity Evaluation Metrics: Introducing the Rank-Biased Utility Metric , 2018, SIGIR.
[12] Marco Aurélio Gerosa,et al. How Should My Chatbot Interact? A Survey on Social Characteristics in Human–Chatbot Interaction Design , 2019, Int. J. Hum. Comput. Interact..
[13] Francesco Caltagirone,et al. Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces , 2018, ArXiv.
[14] Tetsuya Sakai,et al. New Performance Metrics Based on Multigrade Relevance: Their Application to Question Answering , 2004, NTCIR.
[15] Paul Thomas,et al. Measuring the Utility of Search Engine Result Pages: An Information Foraging Based Measure , 2018, SIGIR.
[16] Geoffrey Zweig,et al. Fast and easy language understanding for dialog systems with Microsoft Language Understanding Intelligent Service (LUIS) , 2015, SIGDIAL Conference.
[17] Nick Pawlowski,et al. Rasa: Open Source Language Understanding and Dialogue Management , 2017, ArXiv.
[18] Julio Gonzalo,et al. A general evaluation measure for document organization tasks , 2013, SIGIR.
[19] Alistair Moffat,et al. Rank-biased precision for measurement of retrieval effectiveness , 2008, TOIS.
[20] Alistair Moffat,et al. Desirable Properties for Diversity and Truncated Effectiveness Metrics , 2018, ADCS.
[21] Ruhi Sarikaya,et al. Exploiting shared information for multi-intent natural language sentence classification , 2013, INTERSPEECH.
[22] Asbjørn Følstad,et al. Chatbots and the new world of HCI , 2017, Interactions.
[23] Andrei Z. Broder,et al. The New Frontier of Web Search Technology: Seven Challenges , 2010, SeCO Workshop.
[24] Varvara Logacheva,et al. DeepPavlov: Open-Source Library for Dialogue Systems , 2018, ACL.
[25] Adrian Hernandez-Mendez,et al. Evaluating Natural Language Understanding Services for Conversational Question Answering Systems , 2017, SIGDIAL Conference.