论文信息 - Prosodic focus control in reply speech generation for a spoken dialogue system of information retrieval

Prosodic focus control in reply speech generation for a spoken dialogue system of information retrieval

A spoken dialogue system of information retrieval on academic documents has been developed with a special attention to reply speech generation. In order to realize speech reply with its prosodic features properly controlled to express dialogue focuses, a scheme was developed for directly generating speech reply from reply content. When developing the system, firstly a priority was placed on the automatic processing, and prosodic focus was controlled by rather simple rules (original rules). Based on the listening test for the reply speech generated using original rules, new rules were then developed. Through the further listening test, the rules were revised and called the revised rules. The validity of the revised rules was verified through an evaluation experiment. It was also indicated that there existed users' preferences on the intonation of the reply speech.

[1] Keikichi Hirose,et al. Development and evaluation of a spoken dialogue system for academic document retrieval with a focus on reply generation , 2002, Systems and Computers in Japan.

[2] Keikichi Hirose,et al. Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3] Keikichi Hirose,et al. Use of topic knowledge in spoken dialogue information retrieval system for academic documents , 2001, INTERSPEECH.

[4] Keikichi Hirose,et al. A System for the Synthesis of High-Quality Speech from Texts on General Weather Conditions (Special Section on Speech Synthesis: Current Technologies and Equipment) , 1993 .

[5] K. Hirose,et al. Control of Prosodic Focuses for Reply Speech Generation in a Spoken Dialogue System of Information Retrieval on Academic Documents , 2002 .