A Low-Delay CELP Coder for the CCITT 16 kb/s Speech Coding Standard

A low-delay code-excited linear prediction (LD-CELP) speech coder which is expected to be standardized in 1992 as a CCITT G Series Recommendation for universal applications of speech coding at 16 kb/s is presented. The coder achieves a one-way coding delay of less than 2 ms by making both the LPC predictor and the excitation gain backward-adaptive and by using a small excitation vector size of five samples. The official CCITT laboratory tests revealed that the speech quality of this 16 kb/s LD-CELP coder is either equivalent to or better than that of the CCITT G.721 standard 32-kb/s ADPCM coder for almost all conditions tested. A description of the LD-CELP algorithm, its implementation on the DSP32C for CCITT testing, and performance results from these tests are presented. >

[1]  N. Jayant Adaptive quantization with a one-word memory , 1973 .

[2]  David J. Goodman,et al.  A Robust Adaptive Quantizer , 1975, IEEE Trans. Commun..

[3]  P. Noll,et al.  Adaptive transform coding of speech signals , 1977 .

[4]  Thomas P. Barnwell,et al.  Recursive windowing for generating autocorrelation coefficients for LPC analysis , 1981 .

[5]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[6]  Bishnu S. Atal Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[7]  V. Ramamoorthy,et al.  Enhancement of ADPCM speech by adaptive postfiltering , 1984, AT&T Bell Laboratories Technical Journal.

[8]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Allen Gersho,et al.  Gain-adaptive vector quantization for medium-rate speech coding , 1985 .

[10]  Joseph P. Campbell,et al.  Voiced/Unvoiced classification of speech with applications to the U.S. government LPC-10E algorithm , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Y. Yatsuzuka,et al.  A variable rate coding by APC with maximum likelihood quantization from 4.8 kbits/s to 16 kbits/s , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[13]  Nuggehally Sampath Jayant,et al.  Adaptive postfiltering of 16 kb/s-ADPCM speech , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Isabel Trancoso,et al.  Efficient procedures for finding the optimum innovation in stochastic coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Frank K. Soong,et al.  A high quality subband speech coder with backward adaptive predictor and optimal time-frequency bit assignment , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Yukou Mochida,et al.  A 16 kbps ADPCM with multi-quantizer (ADPCM-MQ) codec and its implementation by digital signal processor , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  K. Zeger,et al.  Zero redundancy channel coding in vector quantisation , 1987 .

[18]  Allen Gersho,et al.  Gain-Adaptive Vector Quantization with Application to Speech Coding , 1987, IEEE Trans. Commun..

[19]  J.-H. Chen Low-bit-rate predictive coding of speech waveforms based on vector quantization , 1987 .

[20]  Man Mohan Sondhi,et al.  Enhancement of ADPCM speech coding with backward-adaptive algorithms for postfiltering and noise feedback , 1988, IEEE J. Sel. Areas Commun..

[21]  Jerry D. Gibson,et al.  Backward adaptive tree coding of speech at 16 kbps , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[22]  V. Cuperman,et al.  A vector ADPCM analysis-by-synthesis configuration for 16 kbit/s speech coding , 1988, IEEE Global Telecommunications Conference and Exhibition. Communications for the Information Age.

[23]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[24]  Michael W. Marcellin,et al.  Predictive trellis coded quantization of speech , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[25]  Yair Shoham,et al.  New directions in subband coding , 1988, IEEE J. Sel. Areas Commun..

[26]  P. Kabal,et al.  A low delay 16 kbits/sec speech coder , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[27]  Joseph P. Campbell,et al.  An expandable error-protected 4800 bps CELP coder (US Federal Standard 4800 bps voice coder) , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[28]  V. Cuperman,et al.  Backward pitch prediction for low-delay speech coding , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[29]  Jerry D. Gibson,et al.  Fractional rate multi-tree speech coding , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[30]  Jianfeng Chen,et al.  A robust low-delay CELP speech coder at 16 kbits/s , 1989 .

[31]  J.J. Shynk,et al.  Backward adaptation for low delay vector excitation coding of speech at 16 kbit/s , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[32]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[33]  Claude Galand,et al.  A 2 ms-delay adaptive code excited linear predictive coder , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[34]  Juin-Hwey Chen,et al.  Real-time implementation and performance of a 16 kb/s low-delay CELP speech coder , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[35]  Juin-Hwey Chen,et al.  High-quality 16 kb/s speech coding with a one-way delay less than 2 ms , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[36]  I. A. Gerson,et al.  Vector sum excited linear prediction (VSELP) speech coding at 8 kbps , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[37]  Y.-C. Lin,et al.  A fixed-point 16 kb/s LD-CELP algorithm , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[38]  T. Fischer,et al.  A Trellis-Searched 16 Kbit/Sec Speech Coder with Low-Delay , 1991 .

[39]  Peter Kabal,et al.  Low-delay CELP and tree coders: comparison and performance improvements , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[40]  V. Cuperman,et al.  Variable-rate low-delay analysis-by-synthesis speech coding at 8-16 kb/s , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[41]  Nuggehally Sampath Jayant,et al.  Improving the performance of the 16 kb/s LD-CELP speech coder , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.