Speech-to-text translation by a non-word lexical unit based system

Speech understanding applications where a word based output of the uttered sentence is not needed, can benefit from the use of alternative lexical units. Experimental results from these systems show that the use of non-word lexical units bring us a new degree of freedom in order to improve the system performance (better recognition rate and lower size can be obtained in comparison to word based models). However, if the aim of the system is a speech-to-text translation, a post-processing stage must be included in order to convert the non-word sequences into word sentences. In this paper a technique to perform this conversion as well as an experimental test carried out over a task oriented Spanish corpus are reported. As a conclusion, we see that the whole speech-to-text system neatly outperforms the word-constrained baseline system.

[1]  Karmele López de Ipiña,et al.  Using non-word lexical units in automatic speech understanding , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  M. Inés Torres,et al.  K-TLSS(S) language models for speech recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Klaus Ries,et al.  An automatic method for learning a Japanese lexicon for recognition of spontaneous speech , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[4]  Kyuwoong Hwang Vocabulary optimization based on perplexity , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Petra Geutner,et al.  Using morphology towards better large-vocabulary speech recognition systems , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.