论文信息 - An efficient stochastically excited linear predictive coding algorithm for high quality low bit rate transmission of speech

An efficient stochastically excited linear predictive coding algorithm for high quality low bit rate transmission of speech

Abstract The Stochastically Excited Linear Prediction (SELP) algorithm for speech coding offers good performance at bit rates as low as 4.8 kbit/s. Linear Predictive Coding (LPC) techniques remove the short-term correlation from the speech. A pitch loop removes long-term correlation, producing a noise-like residual, which is vector quantized. Information describing the LPC filter coefficients, the long-term predictor, and the vector quantization is transmitted. In this paper, we describe improvements to the SELP algorithm which result in better speech quality and higher computational efficiency. In its closed-loop form, the pitch loop can be interpreted as a vector quantization of the desired excitation signal with an adaptive codebook populated by previous excitation sequences. To better model the non-stationarity of speech we extend this adaptive codebook with a special set of candidate vectors which are transform of other codebook entries. The second stage vector quantization is performed using a fixed stochastic codebook. In its original form, the SELP algorithm requires excessive computational effort. We employ a new recursive algorithm which performs a very fast search through the adaptive codebook. In this method, we modify the error criterion, and exploit the resulting symmetries. The same fast vector quantization procedure is applied to the stochastic codebook.

W. Bastiaan Kleijn | Daniel J. Krasinski | Richard H. Ketchum

[1] Allen Gersho,et al. Complexity reduction methods for vector excitation coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] B. Atal,et al. Quantization procedures for the excitation in CELP coders , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Yair Shoham. Vector predictive quantization of the spectral parameters for low rate speech coding , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Takehiro Moriya,et al. Transform coding of speech with weighted vector quantization , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] Bishnu S. Atal,et al. A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[6] V. Ramamoorthy,et al. Enhancement of ADPCM speech by adaptive postfiltering , 1984, AT&T Bell Laboratories Technical Journal.

[7] Antonio Cantoni,et al. Properties of the Eigenvectors of Persymmetric Matrices with Applications to Communication Theory , 1976, IEEE Trans. Commun..

[8] B. Atal,et al. Predictive coding of speech signals and subjective error criteria , 1979 .

[9] Ed F. Deprettere,et al. Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[10] Isabel Trancoso,et al. Efficient procedures for finding the optimum innovation in stochastic coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.