Development of coffee maker service robot using speech and face recognition systems using POMDP

There are many development of intelligent service robot in order to interact with user naturally. This purpose can be done by embedding speech and face recognition ability on specific tasks to the robot. In this research, we would like to propose Intelligent Coffee Maker Robot which the speech recognition is based on Indonesian language and powered by statistical dialogue systems. This kind of robot can be used in the office, supermarket or restaurant. In our scenario, robot will recognize user’s face and then accept commands from the user to do an action, specifically in making a coffee. Based on our previous work, the accuracy for speech recognition is about 86% and face recognition is about 93% in laboratory experiments. The main problem in here is to know the intention of user about how sweetness of the coffee. The intelligent coffee maker robot should conclude the user intention through conversation under unreliable automatic speech in noisy environment. In this paper, this spoken dialog problem is treated as a partially observable Markov decision process (POMDP). We describe how this formulation establish a promising framework by empirical results. The dialog simulations are presented which demonstrate significant quantitative outcome.

[1]  Kazuhiro Nakadai,et al.  Sound source separation of moving speakers for robot audition , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Widodo Budiharto,et al.  Designing of Humanoid Robot with Voice Recognition Capability , 2015, ICIS 2015.

[3]  Sakari,et al.  Social Service Robots in Wellness and Restaurant Applications , 2013 .

[4]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[5]  Martin Hägele,et al.  Robotic home assistant Care-O-bot® 3 - product vision and innovation platform , 2009, 2009 IEEE Workshop on Advanced Robotics and its Social Impacts.

[6]  Milica Gasic,et al.  POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.

[7]  Steve J. Young,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..