PREDICTIVE CODING OF SPEECH USING ANALYSIS-BY-SYNTHESIS TECHNIQUES

This paper presents an overview of analysis-by-synthesis techniques used for low bit rate coding of speech signals. Analysis-by-synthesis procedures use linear predictors to remove the redundancies in the speech signal. The remaining difference signal is not quantized directly, but is replaced by an excitation signa1 that can be represented with a low number of bits. The selection of this signal is typically based on an exhaustive search procedure, in which for each prototype excitation the corresponding speech signal is constructed. The average mean-squared error between the original and the reconstructed signal is used as a criterion to determine the best choice of Lhe excitation signal. In this paper, different excitation signals are discussed, as well as procedures for determining the various coder parametsrs. In addition, the paper discusses some recently proposed speech coding standards, which are based on analysis-by-synthesis techniques.

[1]  Jianfeng Chen,et al.  A robust low-delay CELP speech coder at 16 kbits/s , 1989 .

[2]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[3]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Allen Gersho,et al.  Complexity reduction methods for vector excitation coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  B. Atal,et al.  Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .

[6]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[7]  Willem Bastiaan Kleijn,et al.  Robust CELP coders for noisy backgrounds and noisy channels , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[8]  K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1990 .

[9]  Sumesh. Kaul Vector quantization techniques for speech coding. , 1984 .

[10]  Ed F. Deprettere,et al.  A class of analysis-by-synthesis predictive coders for high quality speech coding at rates between 4.8 and 16 kbit/s , 1988, IEEE J. Sel. Areas Commun..

[11]  Bishnu S. Atal,et al.  Amplitude optimization and pitch prediction in multipulse coders , 1989, IEEE Trans. Acoust. Speech Signal Process..

[12]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[13]  I. A. Gerson,et al.  Vector sum excited linear prediction (VSELP) speech coding at 8 kbps , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[14]  Ira Alan Gerson,et al.  Vector Sum Excited Linear Prediction (VSELP) , 1991 .

[15]  Bishnu S. Atal,et al.  Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[16]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[17]  W. Bastiaan Kleijn,et al.  Fast methods for the CELP speech coding algorithm , 1990, IEEE Trans. Acoust. Speech Signal Process..

[18]  Karl Hellwig,et al.  A regular-pulse excited linear predictive codec , 1988, Speech Commun..

[19]  Frank K. Soong,et al.  Optimal quantization of LSP parameters using delayed decisions , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[20]  Bishnu S. Atal,et al.  ON IMPROVING THE PERFORMANCE OF PITCH PREDICTORS IN SPEECH CODING SYSTEMS , 1991 .

[21]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[22]  D. J. Krasinski,et al.  IMPROVED SPEECH QUALITY AND EFFICIENT VECTOR ION IN SELP , 1988 .