Assessment of CELP codecs quality in multi-lingual environment

This paper investigates the performance of CELP speech codecs over different languages. The English language has had a dominating influence in the advance of telecommunications. With many of the major developments coming from primarily English speaking areas there is the risk that these advances may not be linguistically robust. It is noted that quality of a speech produced by voice codecs mainly is assessed using samples of English language. Some investigation show that language influence to codecs performance could be noticed. In order to judge the performance of the most popular CELP voice codecs (Speex and AMR), we encoded and decoded the speech samples from three different languages: English, Arabic and Lithuanian. The quality of transformed speech signals was estimated using two quality estimation algorithms 3SQM (ITU recommendations P.563) and PESQ (ITU recommendations P.862). The experiments results showed quality bias toward the English language-the scores were higher and the performance was more stable.

[1]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Ian S. Burnett,et al.  On the effects of accent and language on low rate speech coders , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3]  Vijay Parsa,et al.  Interaction of speech coders and atypical speech I: effects on speech intelligibility. , 2002, Journal of speech, language, and hearing research : JSLHR.

[4]  Joe F. Chicharo,et al.  Language-specific phonetic structure and the quantisation of the spectral envelope of speech , 2000, Speech Commun..

[5]  Šarūnas Paulikas,et al.  INFLUENCE OF LANGUAGES ON CELP CODECS PERFORMANCE , 2008 .

[6]  Vijay Parsa,et al.  Interaction of speech coders and atypical speech I: effects on speech intelligibility. , 2002 .

[7]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[8]  A. Uvliden,et al.  Adaptive multi-rate. A speech service adapted to cellular radio network quality , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).