Open-World Dialog: Challenges, Directions, and a Prototype

We present an investigation of open-world dialog, centering on systems that can perform conversational dialog in an open-world context, where multiple people with different needs, goals, and long-term plans may enter, interact, and leave an environment. We outline and discuss a set of challenges and core competencies required for supporting the kind of fluid multiparty interaction that people expect when conversing and collaborating with other people. Then, we focus as a concrete example on the challenges faced by receptionists who field requests at the entries to corporate buildings. We review the subtleties and difficulties of creating an automated receptionist that can work with people on solving their needs with the ease and etiquette expected from a human receptionist. Finally, we review details of the construction and operation of a working prototype.

[1]  Michael Johnston,et al.  MATCHkiosk: A Multimodal Interactive City Guide , 2004, ACL.

[2]  B. Granström,et al.  NATURAL TURN-TAKING NEEDS NO MANUAL : COMPUTATIONAL THEORY AND MODEL , FROM PERCEPTION TO ACTION , 2002 .

[3]  Ronald A. Cole,et al.  TOOLS FOR RESEARCH AND EDUCATION IN SPEECH SCIENCE , 1999 .

[4]  Alexander I. Rudnicky,et al.  The RavenClaw dialog management framework: Architecture and systems , 2009, Comput. Speech Lang..

[5]  M. Argyle,et al.  Gaze and Mutual Gaze , 1994, British Journal of Psychiatry.

[6]  Herbert H. Clark,et al.  Contributing to Discourse , 1989, Cogn. Sci..

[7]  Candace L. Sidner,et al.  COLLAGEN: Applying Collaborative Discourse Theory to Human-Computer Interaction , 2001, AI Mag..

[8]  S. Feldstein,et al.  Rhythms of dialogue , 1970 .

[9]  Fredrik Kronlid Steps towards Multi-Party Dialogue Management , 2008 .

[10]  Kristinn R. Thórisson,et al.  Mind Model for Multimodal Communicative Creatures and Humanoids , 1999, Appl. Artif. Intell..

[11]  BohusDan,et al.  The RavenClaw dialog management framework , 2009 .

[12]  Eric Horvitz,et al.  Reflections on Challenges and Promises of Mixed-Initiative Interaction , 2007, AI Mag..

[13]  Maxine Eskénazi,et al.  Optimizing Endpointing Thresholds using Dialogue Features in a Spoken Dialogue System , 2008, SIGDIAL Workshop.

[14]  David R. Traum,et al.  Embodied agents for multi-party dialogue in immersive virtual worlds , 2002, AAMAS '02.

[15]  Geoffrey Zweig,et al.  Live search for mobile:Web services by voice on the cellphone , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[16]  K. Chang,et al.  Embodiment in conversational interfaces: Rea , 1999, CHI '99.

[17]  Joakim Gustafson,et al.  The august spoken dialogue system , 1999, EUROSPEECH.

[18]  Candace L. Sidner,et al.  Engagement rules for human-robot collaborative interactions , 2003, SMC'03 Conference Proceedings. 2003 IEEE International Conference on Systems, Man and Cybernetics. Conference Theme - System Security and Assurance (Cat. No.03CH37483).

[19]  Eric Horvitz,et al.  A computational architecture for conversation , 1999 .

[20]  A. Kendon Conducting Interaction: Patterns of Behavior in Focused Encounters , 1990 .

[21]  E. Schegloff,et al.  A simplest systematics for the organization of turn-taking for conversation , 1974 .

[22]  Michael F. McTear,et al.  Book Review: Spoken Dialogue Technology: Toward the Conversational User Interface, by Michael F. McTear , 2002, CL.