Robust Speech Understanding for Multi-Modal Human-Robot Communication

In order to model complex human robot interaction researchers not only have to consider different tasks but also to handle the complex interplay of different modules of one single robot system. In our context we constructed a robot assistant integrated in a home or office environment. We allow for a fairly natural communication style, which means that the users communicate using speech but are also allowed to use gestures and moreover to use contextual scene knowledge. Against this background, this paper presents a robust speech understanding component for situated human-robot communication. It serves as interface between speech recognition and dialog management. To increase robustness of speech processing it rates the speech recognition output by means of semantic coherence. Even if the recognized word-stream is not grammatically correct the speech understanding component provides semantic interpretations in context of multi-modal input for dialog management. For the understanding process, we designed special semantic concepts grounded to the domain of situated communication. They also provide additional information about the dialog act. A processing mechanism uses these concept units to generate the most likely semantic interpretation of the utterances

[1]  Jan-Torsten Milde,et al.  Situated Communication with Robots , 1999 .

[2]  Ben J. A. Kröse,et al.  Lino, the User-Interface Robot , 2003, EUSAI.

[3]  Franz Kummert,et al.  Interpretation of Situated Human-Robot Dialogues , 2004 .

[4]  K. Dautenhahn Robots we like to live with?! - a developmental perspective on a personalized, life-long robot companion , 2004, RO-MAN 2004. 13th IEEE International Workshop on Robot and Human Interactive Communication (IEEE Catalog No.04TH8759).

[5]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[6]  R. Dillmann,et al.  Using gesture and speech control for commanding a robot assistant , 2002, Proceedings. 11th IEEE International Workshop on Robot and Human Interactive Communication.

[7]  Sebastian Lang,et al.  BIRON - The Bielefeld Robot Companion , 2004 .

[8]  Anders Green,et al.  Involving users in the design of a mobile office robot , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  D. McNeill Hand and Mind: What Gestures Reveal about Thought , 1992 .

[10]  Gernot A. Fink,et al.  Experiments in Distant Talking Speech Recognition Using a Standard Database , 2005 .

[11]  Sebastian Wrede,et al.  A Flexible Infrastructure for the Development of a Robot Companion with Extensible HRI-Capabilities , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[12]  Cory D. Kidd,et al.  HUMANOID ROBOTS AS COOPERATIVE PARTNERS FOR PEOPLE , 2004 .

[13]  Hiroaki Kitano,et al.  Human-robot interaction through real-time auditory and visual multiple-talker tracking , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[14]  Guido Bugmann,et al.  Training Personal Robots Using Natural Language Instruction , 2001, IEEE Intell. Syst..

[15]  Jannik Fritsch,et al.  Human-style interaction with a robot for cooperative learning of scene objects , 2005, ICMI '05.

[16]  Bertram C. Bruce Case Systems for Natural Language , 1975, Artif. Intell..

[17]  Collin F. Baker,et al.  Frame semantics for text understanding , 2001 .

[18]  Mário Rodrigues,et al.  From robust spoken language understanding to knowledge acquisition and management , 2005, INTERSPEECH.

[19]  Sven Wachsmuth,et al.  Integration of parsing and incremental speech recognition , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[20]  Guido Bugmann,et al.  Personal Robot Training via Natural-Language Instructions. , 2001 .

[21]  Wayne H. Ward Extracting information in spontaneous speech , 1994, ICSLP.

[22]  Roland Siegwart,et al.  Building a Fully Autonomous Tour Guide Robot: Where Academic Research Meets Industry , 2002 .