Emotional Communication Robot Based on 3D Face Model and ASR Technology

Today, some robots have the ability to express emotions, which makes human-robot interaction (HRI) more realistic. However, most of the current robots do not have real face, and people's facial expressions are very important in communication. Therefore, this paper constructs an emotional communication humanoid robot system based on 3D face and automatic speech recognition (ASR) system. Chinese speech recognition is performed by an ASR system. Audio data can be used for other research. In order to express facial expressions and pronunciations more accurately and realistically, the real-life data collected by the OptiTrack system is used as a support, and weighted Dirichlet free-form deformations (DFFD) is applied to deform the 3D face model. ASR selects HMM-GMM model as the acoustic model and N-gram model as the language model. Acoustic features are selected as perceptual linear prediction (PLP) features.

[1]  Shohei Kato,et al.  Facial expressions using emotional space in sensitivity communication robot "Ifbot" , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[2]  Ronald C. Arkin,et al.  Behavioral overlays for non-verbal communication expression on a humanoid robot , 2007, Auton. Robots.

[3]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[4]  Sonya S. Kwak,et al.  Can robotic emotional expressions induce a human to empathize with a robot? , 2009, RO-MAN 2009 - The 18th IEEE International Symposium on Robot and Human Interactive Communication.

[5]  Hiroyuki Inoue,et al.  Influence of emotional expression of real humanoid robot to human decision-making , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[6]  Akira Ito,et al.  Artificial emotion expression for a robot by dynamic color change , 2012, 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication.

[7]  S. J. Young,et al.  Tree-based state tying for high accuracy acoustic modelling , 1994 .