CELP and sinusoidal coders: Two solutions for speech coding at 4.8-9.6 kbps

Abstract This paper serves a double purpose: to review the coding methods which have been introduced during the past decade in the 4.8–9.6 kbps range, and to discuss the most recent research trends. The bulk of the paper is devoted to CELP-based coding, a mandatory method which is at the basis of several emerging standards. The rest consists of a brief review of an alternative class of coders based on sinusoidal modelling of speech. The comparison between these opposite techniques will enable us to draw some conclusions and identify areas for future research.

[1]  Per Hedelin A tone oriented voice excited vocoder , 1981, ICASSP.

[2]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[3]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[4]  B. Atal,et al.  Predictive coding of speech signals and subjective error criteria , 1979 .

[5]  Thomas F. Quatieri,et al.  Phase coherence in speech reconstruction for enhancement and coding applications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[6]  M. R. Schroeder,et al.  Adaptive predictive coding of speech signals , 1970, Bell Syst. Tech. J..

[7]  Biing-Hwang Juang,et al.  Speech enhancement with harmonic synthesis , 1983, ICASSP.

[8]  Bishnu S. Atal,et al.  ON IMPROVING THE PERFORMANCE OF PITCH PREDICTORS IN SPEECH CODING SYSTEMS , 1991 .

[9]  D. L. Thomson Parametric models of the magnitude/phase spectrum for harmonic speech coding , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  Luís B. Almeida,et al.  Harmonic coding: A low bit-rate, good-quality speech coding technique , 1982, ICASSP.

[11]  Chong Un,et al.  The Residual-Excited Linear Prediction Vocoder with Transmission Rate Below 9.6 kbits/s , 1975, IEEE Trans. Commun..

[12]  Jean-Pierre Petit,et al.  Robust and fast code-excited linear predictive coding of speech signals , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[13]  Luís B. Almeida,et al.  Variable-frequency synthesis: An improved harmonic coding scheme , 1984, ICASSP.

[14]  José M. Tribolet,et al.  Harmonic post-processing of speech synthesized by stochastic coders , 1987, ECST.

[15]  D. Lin Speech coding using efficient pseudo-stochastic block codes , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[17]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[18]  P. Noll,et al.  Adaptive transform coding of speech signals , 1977 .

[19]  Isabel Trancoso,et al.  Efficient procedures for finding the optimum innovation in stochastic coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[20]  Jae S. Lim,et al.  A new model-based speech analysis/Synthesis system , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  E. Bronson,et al.  Harmonic coding of speech at 4.8 kb/s , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Bishnu S. Atal,et al.  Improving performance of multi-pulse LPC coders at low bit rates , 1984, ICASSP.

[23]  Willem Bastiaan Kleijn,et al.  Improved speech quality and efficient vector quantization in SELP , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[24]  Thomas F. Quatieri,et al.  Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model , 1984, ICASSP.

[25]  R. McAulay,et al.  Mid-rate coding based on a sinusoidal representation of speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[26]  P. Mabilleau,et al.  Fast CELP coding based on algebraic codes , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27]  Claude Galand,et al.  Adaptive code excited linear predictive coder (ACELPC) , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[28]  José M. Tribolet,et al.  Improved pitch prediction with fractional delays in CELP coding , 1989, International Conference on Acoustics, Speech, and Signal Processing.

[29]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.