Mandarin digit recognition assisted by selective tone distinction

Continuous Mandarin digit recognition is an important function to provide a useful user interface for in-car applications. In this paper, as opposed to the conventional N-best rescoring, we propose a direct modification approach on the 1-best hypothesis of recognition results using selective tone distinction. Experiments were performed on noisy speech at SNRs of 20dB and 9dB. Over the baseline without using tone information, our proposal achieved error reductions of 24%~27% for both SNRs, which is significantly better than the error reduction of 10-best rescoring. Moreover, the relatively constant error reduction seen in wide-ranging SNR demonstrates the robustness of our proposal.

[1]  Sadaoki Furui,et al.  Noise robust speech recognition using F0 contour extracted by hough transform , 2002, INTERSPEECH.

[2]  Sin-Horng Chen,et al.  Tone recognition of continuous Mandarin speech based on neural networks , 1995, IEEE Trans. Speech Audio Process..

[3]  Jiahong Yuan,et al.  A corpus study of the 3rd tone sandhi in standard Chinese , 2007, INTERSPEECH.

[4]  Yi Xu Contextual tonal variations in Mandarin , 1997 .

[5]  Jinyu Li,et al.  A study on knowledge source integration for candidate rescoring in automatic speech recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6]  Wayne H. Ward,et al.  Dialog-context dependent language modeling combining n-grams and stochastic context-free grammars , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Keikichi Hirose,et al.  Tone nucleus modeling for Chinese lexical tone recognition , 2004, Speech Commun..

[8]  Keikichi Hirose,et al.  Latent Prosody Model of Continuous Mandarin Speech , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[9]  Tan Lee,et al.  Using tone information in Cantonese continuous speech recognition , 2002, TALIP.