User-Centered Design for a Voice Portal

After a brief overview of voice portal technology, with special attention paid to Polish, we discuss some aspects of user-centered design and its influence on usability of the proposed solution. We describe the issues of voice portal preparation on the example of Warsaw city transportation hotline and main components of the voice portal. The system is effective and supports users in their needs: about 30% of users complete their requests through the automated system without talking to human operators.

[1]  Krzysztof Marasek,et al.  Recognizing Connected Digit Strings Using Neural Networks , 2006, TSD.

[2]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[3]  Hervé Bourlard,et al.  Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[4]  Paul J. Werbos,et al.  Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[5]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[6]  Dilek Z. Hakkani-Tür,et al.  Spoken language understanding , 2008, IEEE Signal Processing Magazine.

[7]  Ryszard Gubrynowicz,et al.  Wizard of Oz Experiment for a Telephony-Based City Transport Dialog System , 2008 .

[8]  Alan W. Black,et al.  Generating F/sub 0/ contours from ToBI labels using linear regression , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[9]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[10]  Ryszard Gubrynowicz,et al.  Multi-level Annotation in SpeeCon Polish Speech Database , 2004, IMTCI.

[11]  Malgorzata Marciniak,et al.  Annotated Corpus of Polish Spoken Dialogues , 2007, LTC.

[12]  Ryszard Gubrynowicz,et al.  Design and Data Collection for Spoken Polish Dialogs Database , 2008, LREC.

[13]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[14]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[15]  Jon R. W. Yi,et al.  Corpus-based unit selection for natural-sounding speech synthesis , 2003 .

[16]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[17]  Zbigniew Michalewicz,et al.  Intelligent Media Technology for Communicative Intelligence, Second International Workshop, IMTCI 2004, Warsaw, Poland, September 13-14, 2004, Revised Selected Papers , 2005, IMTCI.