Design of a Variable Rate Algorithm for the CS-ACELP Coder

In 1995, 8 kb/s CS-ACELP coder of G.729 is standardized by ITU-T SG15 and it has been reported that the speech quality of G.729 is better than or equal to that of 32 kb/s ADPCM (G.726). However G.729 is the fixed rate speech coder, and it does not consider the property of voice activity in mutual conversation. If we use the voice activity, we can reduce the average bit rate in half without any degradations of the speech quality. In this paper, we propose an efficient variable rate algorithm for G.729. The variable rate algorithm consists of two main subjects, the rate determination algorithm and the design of sub rate coders. For the robust VAD algorithm, we combine the energy-thresholding method, the phonetic segmentation method by integration of various feature parameters obtained through the analysis procedure, and the variable hangover period method. Through the analysis of noise features, the 1 kb/s sub rate coder is designed for coding the background noise signal. Also, we design the 4 kb/s sub rate coder for the unvoiced parts. The performance of the variable rate algorithm is evaluated by the comparison of speech quality and average bit rate with G.729. Subjective quality test is also done by MOS test. Conclusively, it is verified that the proposed variable rate CS-ACELP coder produces the same speech quality as G.729, at the average bit rate of 4.4 kb/s. key words: speech coding, variable rate coding, CELP, VAD, conjugate structure, algebraic code

[1]  Bishnu S. Atal,et al.  Stochastic coding of speech signals at very low bit rates: The importance of speech perception , 1985, Speech Commun..

[2]  Kuldip K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1993, IEEE Trans. Speech Audio Process..

[3]  Nariman Farvardin,et al.  Quantizer design in LSP speech analysis-synthesis , 1988, IEEE J. Sel. Areas Commun..

[4]  P. Usai Subjective performance of the proposed ITU-T 8kb/s speech coding standard , 1995 .

[5]  Allen Gersho,et al.  Variable rate speech coding with phonetic segmentation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Shinji Hayashi,et al.  LSP and Gain Quantization for the Proposed ITU-T 8-kb/s Speech Coding , 1995, Proceedings. IEEE Workshop on Speech Coding for Telecommunications.

[7]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[8]  P. Mabilleau,et al.  16 kbps wideband speech coding technique based on algebraic CELP , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[9]  E. Shlomot,et al.  ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications , 1997, IEEE Commun. Mag..

[10]  S. Hayashi,et al.  Conjugate structure celp coder for the ccitt 8-kbit/s standardization candidate , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[11]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[12]  Redwan Salami,et al.  ITU-T G.729 Annex A: reduced complexity 8 kb/s CS-ACELP codec for digital simultaneous voice and data , 1997, IEEE Commun. Mag..