As a practical information guidance system, we have been developing a speech-oriented system named "Takemaru-kun". The system has been operated on a public space since Nov. 2002. The system answers to user's question about the hall facilities, sightseeing, transportation, weather information around the city, etc. All triggered inputs to the system have been recorded since the operation started. And all system inputs during 22 months are manually transcribed and labelled for speakers gender and age category. In this paper, we conduct a long-term prosody analysis of user speech to find a clue to obtain users attitude from a users speech. In this preliminary analysis, it is observed that F0 decreases regardless of age and gender category when the stability of the dialogue system is not established.
[1]
Kiyohiro Shikano,et al.
Public speech-oriented guidance system with adult and child discrimination capability
,
2004,
2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[2]
Roy D. Patterson,et al.
Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity
,
1999,
EUROSPEECH.
[3]
Kiyohiro Shikano,et al.
Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs
,
2004,
INTERSPEECH.
[4]
Kiyohiro Shikano,et al.
Operating a public spoken guidance system in real environment
,
2005,
INTERSPEECH.