Comparison of sensibilities of Japanese and Koreans in recognizing emotions from speech by using Bayesian networks

The paper describes a comparison of the sensibility of recognizing emotions from human voices speaking Japanese and Korean. Our study focuses on the emotional elements included in the human voice, and our method uses Bayesian networks of prosodic features as models of Japanese's and Korean's sensibilities in recognizing emotions. The training datasets are prosodic features extracted from emotionally expressive voice data in the two languages. Our method makes the Bayesian network learn the dependence and its strength between nonverbal voice features and its emotion. We compare the sensibilities of emotion recognition from Japanese and Koreans speech by examining the cross-inference through two Bayesian networks with speech in the other language.

[1]  Kevin B. Korb,et al.  Bayesian Artificial Intelligence , 2004, Computer science and data analysis series.

[2]  K. Scherer,et al.  Emotion Inferences from Vocal Expression Correlate Across Languages and Cultures , 2001 .

[3]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[4]  Shohei Kato,et al.  Bayesian-Based Inference of Dialogist's Emotion for Sensitivity Robots , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[5]  K. Scherer,et al.  Vocal expression of emotion. , 2003 .

[6]  Hozumi Tanaka,et al.  A Bayesian Approach for User Modeling in Dialogue Systems , 1994, COLING.

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  Gregory F. Cooper,et al.  A Bayesian Method for Constructing Bayesian Belief Networks from Databases , 1991, UAI.

[9]  Wolfram Burgard,et al.  Experiences with an Interactive Museum Tour-Guide Robot , 1999, Artif. Intell..

[10]  Cynthia Breazeal,et al.  Designing sociable robots , 2002 .

[11]  R. Brooks,et al.  The cog project: building a humanoid robot , 1999 .

[12]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[13]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[14]  George N. Votsis,et al.  Emotion recognition in human-computer interaction , 2001, IEEE Signal Process. Mag..

[15]  Say Wei Foo,et al.  Speech emotion recognition using hidden Markov models , 2003, Speech Commun..

[16]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[17]  Wolfram Burgard,et al.  The Interactive Museum Tour-Guide Robot , 1998, AAAI/IAAI.

[18]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[19]  Max Henrion,et al.  Propagating uncertainty in bayesian networks by probabilistic logic sampling , 1986, UAI.

[20]  P. Ekman Unmasking The Face , 1975 .

[21]  Yoshinori Kitahara,et al.  Prosodic Control to Express Emotions for Man-Machine Speech Interaction , 1992 .

[22]  Albino Nogueiras,et al.  Speech emotion recognition using hidden Markov models , 2001, INTERSPEECH.

[23]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[24]  Vladimir Pavlovic,et al.  Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review , 1997, IEEE Trans. Pattern Anal. Mach. Intell..