论文信息 - On the performance of ITU-T G.723.1 and AMR-NB codecs for large vocabulary distributed speech recognition in Brazilian Portuguese

On the performance of ITU-T G.723.1 and AMR-NB codecs for large vocabulary distributed speech recognition in Brazilian Portuguese

In this paper, we present the accuracy for large vocabulary distributed continuous speech recognition systems over ITU-T G.723.1 and AMRNB speech codecs. Experiments were conducted using LPC and LSF-derived speech recognition features, CDHMM acoustic models, triphone units and trigram language models for the Brazilian Portuguese.

Abraham Alcaim | Vladimir Fabregas Surigue de Alencar

[1] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[2] Hwang Soo Lee,et al. On approximating line spectral frequencies to LPC cepstral coefficients , 2000, IEEE Trans. Speech Audio Process..

[3] Steve Young,et al. The HTK book , 1995 .

[4] A. Alcaim,et al. Digital filter interpolation of decoded LSFs for distributed continuous speech recognition , 2008 .

[5] Yoshiaki Ohshima,et al. Environmental robustness in speech recognition using physiologically-motivated signal processing , 1993 .

[6] Thomas W. Parks,et al. New results in the design of digital interpolators" ieee trans , 1975 .

[7] Redwan Salami,et al. A toll quality 8 kb/s speech codec for the personal communications system (PCS) , 1994 .

[8] Kuldip K. Paliwal,et al. Speech Coding and Synthesis , 1995 .

[9] Hong Kook Kim,et al. A bitstream-based front-end for wireless speech recognition on IS-136 communications system , 2001, IEEE Trans. Speech Audio Process..

[10] Bishnu S. Atal,et al. A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[11] Abraham Alcaim,et al. Transformations of LPC and LSF Parameters to Speech Recognition Features , 2005, ICAPR.

[12] Hwang Soo Lee,et al. Speech recognition using quantized LSP parameters and their transformations in digital communication , 2000, Speech Commun..

[13] Abraham Alcaim,et al. Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC , 2007, INTERSPEECH.