A question-and-answer classification technique for constructing and managing spoken dialog system

To recognize user speech accurately and respond to it appropriately, a spoken dialog system usually uses a question-and-answer database (QADB) which contains many question-and-answer pairs. The systems first select a question example which is the most similar to the recognition result for the input voice from the database. An answer sentence which is then paired with the selected question example is output to the user. Many systems have a large database to enable a more appropriate answer to be output. However, when such a database is used, the waiting time increases because the system needs to find the most appropriate question example from a vast number of question examples. We propose a method of classifying the queries in the QADB. By classifying question examples into some clusters using pLSA, an appropriate question example can be found more quickly than when using the conventional method. We evaluated the validity of our proposed method by changing various parameters.

[1]  Tatsuya Kawahara,et al.  Speech-Based Interactive Information Guidance System using Question-Answering Technique , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[2]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  Toshiyuki Takezawa,et al.  An interaction mechanism of multimodal dialogue systems , 2002, Systems and Computers in Japan.

[5]  Anton Leuski,et al.  Practical Language Processing for Virtual Humans , 2010, IAAI.

[6]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[7]  Kiyohiro Shikano,et al.  Public speech-oriented guidance system with adult and child discrimination capability , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Kiyohiro Shikano,et al.  Question and answer database optimization using speech recognition results , 2008, INTERSPEECH.