On the performance of ITU-T G.723.1 and AMR-NB codecs for large vocabulary distributed speech recognition in Brazilian Portuguese

In this paper, we present the accuracy for large vocabulary distributed continuous speech recognition systems over ITU-T G.723.1 and AMRNB speech codecs. Experiments were conducted using LPC and LSF-derived speech recognition features, CDHMM acoustic models, triphone units and trigram language models for the Brazilian Portuguese.

[1]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[2]  Hwang Soo Lee,et al.  On approximating line spectral frequencies to LPC cepstral coefficients , 2000, IEEE Trans. Speech Audio Process..

[3]  Steve Young,et al.  The HTK book , 1995 .

[4]  A. Alcaim,et al.  Digital filter interpolation of decoded LSFs for distributed continuous speech recognition , 2008 .

[5]  Yoshiaki Ohshima,et al.  Environmental robustness in speech recognition using physiologically-motivated signal processing , 1993 .

[6]  Thomas W. Parks,et al.  New results in the design of digital interpolators" ieee trans , 1975 .

[7]  Redwan Salami,et al.  A toll quality 8 kb/s speech codec for the personal communications system (PCS) , 1994 .

[8]  Kuldip K. Paliwal,et al.  Speech Coding and Synthesis , 1995 .

[9]  Hong Kook Kim,et al.  A bitstream-based front-end for wireless speech recognition on IS-136 communications system , 2001, IEEE Trans. Speech Audio Process..

[10]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[11]  Abraham Alcaim,et al.  Transformations of LPC and LSF Parameters to Speech Recognition Features , 2005, ICAPR.

[12]  Hwang Soo Lee,et al.  Speech recognition using quantized LSP parameters and their transformations in digital communication , 2000, Speech Commun..

[13]  Abraham Alcaim,et al.  Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC , 2007, INTERSPEECH.