论文信息 - Zara The Supergirl: An Empathetic Personality Recognition System

Zara The Supergirl: An Empathetic Personality Recognition System

Zara the Supergirl is an interactive system that, while having a conversation with a user, uses its built in sentiment analysis, emotion recognition, facial and speech recognition modules, to exhibit the human-like response of sharing emotions. In addition, at the end of a 5-10 minute conversation with the user, it can give a comprehensive personality analysis based on the user’s interaction with Zara. This is a first prototype that has incorporated a full empathy module, the recognition and response of human emotions, into a spoken language interactive system that enhances human-robot understanding. Zara was shown at the World Economic Forum in Dalian in September 2015.

[1] James W. Pennebaker,et al. Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[2] Dimitri Palaz,et al. Analysis of CNN-based speech recognition system using raw speech as input , 2015, INTERSPEECH.

[3] Marilyn A. Walker,et al. Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[4] Matthew Purver. The Theory and Use of Clarification Requests in Dialogue , 2004 .

[5] Pascale Fung. ROBOTS WITH HEART. , 2015, Scientific American.

[6] Hermann Ney,et al. Convolutional neural networks for acoustic modeling of raw time signal in LVCSR , 2015, INTERSPEECH.

[7] Tatsuya Nomura,et al. Why Do Children Abuse Robots? , 2015, HRI.

[8] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .

[9] S Roth,et al. Approach, avoidance, and coping with stress. , 1986, The American psychologist.

[10] Mark J. F. Gales,et al. CUED-RNNLM — An open-source toolkit for efficient training and evaluation of recurrent neural network language models , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11] Julia Hirschberg,et al. Classifying subject ratings of emotional speech using acoustic features , 2003, INTERSPEECH.

[12] Björn Schuller,et al. Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[13] Tim Polzehl,et al. Automatically Assessing Personality from Speech , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[14] Lawrence R. Wheeless,et al. THE MEASUREMENT OF TRUST AND ITS RELATIONSHIP TO SELF‐DISCLOSURE , 1977 .

[15] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.