论文信息 - Very large vocabulary voice dictation for mobile devices

Very large vocabulary voice dictation for mobile devices

This paper deals with optimization techniques that can make very large vocabulary voice dictation applications deployable on recent mobile devices. We focus namely on optimization of signal parameterization (frame rate, FFT calculation, fixedpoint representation) and on efficient pruning techniques employed on the state and Gaussian mixture level. We demonstrate the applicability of the proposed techniques on the practical design of an embedded 255K-word discrete dictation program developed for Czech. Its real performance is comparable to a client-server version of the fluent dictation program implemented on the same mobile device.

Jan Nouza | Petr Cerva | Jindrich Zdánský

[1] Jan Silovský,et al. Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak) , 2009, COST 2102 Training School.

[2] Alexander I. Rudnicky,et al. Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[3] Jan Nouza,et al. Design and development of voice controlled aids for motor-handicapped persons , 2007, INTERSPEECH.

[4] Richard C. Rose,et al. Efficient client-server based implementations of mobile speech recognition services , 2006, Speech Commun..

[5] Guo-Hong Ding,et al. A decoder for large vocabulary continuous short message dictation on embedded devices , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6] Xia Wang,et al. Mandarin short message dictation on Symbian series 60 mobile phones , 2007, Mobility '07.

[7] Shuang Xu,et al. User Expectations from Dictation on Mobile Devices , 2007, HCI.

[8] Jordan Cohen,et al. Embedded speech recognition applications in mobile phones: Status, trends, and challenges , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9] Sebastian Stüker,et al. Rapid porting of ASR-systems to mobile devices , 2005, INTERSPEECH.