From VoiceXML to multimodal mobile Apps: development of practical conversational interfaces

Speech Technologies and Language Processing have made possible the development of a number of new applications which are based on conversational interfaces. In this paper, we describe two approaches to bridge the gap between the academic and industrial perspectives in order to develop conversational interfaces using an academic paradigm for dialog management while employing the industrial standards. The advances in these technologies have made possible to extend the initial applications of conversational interfaces from only spoken interaction (for instance, by means of VoiceXML-based systems) to multimodal services by means of mobile devices (for instance, using the facilities provided by the Android OS). Our proposal has been evaluated with the successful development of different spoken and multimodal conversational interfaces.

[1]  Rakesh Gupta,et al.  Situated language understanding for a spoken dialog system within vehicles , 2015, Comput. Speech Lang..

[2]  James R. Glass,et al.  Developments and Directions in Speech Recognition and Understanding , Part 1 T , 2022 .

[3]  Ramón López-Cózar,et al.  A domain-independent statistical methodology for dialog management in spoken dialog systems , 2014, Comput. Speech Lang..

[4]  José Rouillard Web services and speech-based applications around VoiceXML , 2007, J. Networks.

[5]  David Griol,et al.  A statistical simulation technique to develop and evaluate conversational agents , 2013, AI Commun..

[6]  Sophie Rosset,et al.  Natural Interaction with Robots, Knowbots and Smartphones, Putting Spoken Dialog Systems into Practice , 2013 .

[7]  Juliane Hahn Human Centric Interfaces For Ambient Intelligence , 2016 .

[8]  Roberto Pieraccini The Voice in the Machine: Building Computers That Understand Speech , 2012 .

[9]  Arthur C. Graesser,et al.  Improving the Efficiency of Dialogue in Tutoring. , 2012 .

[10]  David Traum,et al.  The Information State Approach to Dialogue Management , 2003 .

[11]  Javier Bajo,et al.  A multi-agent architecture for distributed services and applications , 2009 .

[12]  Javier Bajo,et al.  REPLANNING MECHANISM FOR DELIBERATIVE AGENTS IN DYNAMIC CHANGING ENVIRONMENTS , 2008, Comput. Intell..

[13]  Wolfgang Minker,et al.  Stochastic versus rule-based speech understanding for information retrieval , 1998, Speech Commun..

[14]  Min-Jen Tsai The VoiceXML dialog system for the e-commerce ordering service , 2005, Proceedings of the Ninth International Conference on Computer Supported Cooperative Work in Design, 2005..

[15]  Grzegorz Pochwatko,et al.  From demonstration to theory in embodied language comprehension: A review , 2014, Cognitive Systems Research.

[16]  David Griol,et al.  A statistical approach to spoken dialog systems design and evaluation , 2008, Speech Commun..

[17]  Alexander I. Rudnicky,et al.  Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda , 2003, INTERSPEECH.

[18]  Zoraida Callejas,et al.  Voice Application Development for Android , 2013 .

[19]  Laila Dybkjær,et al.  Recent trends in discourse and dialogue , 2008 .

[20]  Oliver Lemon,et al.  REINFORCEMENT LEARNING OF DIALOGUE STRATEGIES WITH HIERARCHICAL ABSTRACT MACHINES , 2006, 2006 IEEE Spoken Language Technology Workshop.

[21]  Florian Metze,et al.  Language independent search in MediaEval's Spoken Web Search task , 2014, Comput. Speech Lang..

[22]  David Escudero Mancebo,et al.  From HTML to VoiceXML: A First Approach , 2002, TSD.

[23]  Darpa Speech Speech and natural language : proceedings of a workshop held at Cape Cod, Massachusetts, October 15-18, 1989 , 1990 .

[24]  Timothy W. Bickmore,et al.  Maintaining reality: Relational agents for antipsychotic medication adherence , 2010, Interact. Comput..

[25]  Wolfgang Minker,et al.  Spoken Dialogue Systems for Intelligent Environments , 2010, AmI 2010.

[26]  Jeremy Peckham,et al.  A new generation of spoken dialogue systems: results and lessons from the sundial project , 1993, EUROSPEECH.

[27]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[28]  Daniel Jurafsky,et al.  Generating Recommendation Dialogs by Extracting Information from User Reviews , 2013, ACL.

[29]  David Griol,et al.  An Agent-Based Dialog Simulation Technique to Develop and Evaluate Conversational Agents , 2011, PAAMS.