Adaptive transform coding of speech signals

This paper discusses speech coding systems based upon transform coding (TC). It compares several transforms and shows that the cosine transform leads to a nearly optimum performance for almost all speech sounds. Various adaptive coding strategies are then investigated, and a coding scheme is proposed that is based on a nonadaptive discrete cosine transform (DCT), on an adaptive bit assignment, and on adaptive quantization. The adaptation is controlled by a short-term basis spectrum that is derived from the transform coefficients prior to coding and transmission and that is transmitted as side information to the receiver. The main result is that this adaptive transform coder performs better than all known nonpitch-tracking coding schemes; it extends the range of speech waveform coding to lower bit rates and closes the gap between vocoders and predictive waveform coders.

[1]  Richard Bellman,et al.  Introduction to Matrix Analysis , 1972 .

[2]  P. Schultheiss,et al.  Block Quantization of Correlated Gaussian Random Variables , 1963 .

[3]  L. M. Goodman Channel encoders , 1967 .

[4]  Barry J. Bunin Rate-distortion functions for Gaussian Markov processes , 1969 .

[5]  P. Wintz,et al.  Image Coding by Adaptive Block Quantization , 1971 .

[6]  G. Anderson,et al.  Piecewise Fourier Transformation for Picture Bandwidth Compression , 1971 .

[7]  S. Campanella,et al.  A Comparison of Orthogonal Transformations for Digital Speech Processing , 1971 .

[8]  L. Davisson Rate-distortion theory and application , 1972 .

[9]  P. Wintz Transform picture coding , 1972 .

[10]  F. Shum,et al.  Speech processing with Walsh-Hadamard transforms , 1973 .

[11]  Judea Pearl,et al.  On coding and filtering stationary signals by discrete Fourier transforms (Corresp.) , 1973, IEEE Trans. Inf. Theory.

[12]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[13]  Wen-Hsiung Chen,et al.  Slant Transform Image Coding , 1974, IEEE Trans. Commun..

[14]  P. Noll,et al.  Effects of channel errors on the signal-to-noise performance of speech-encoding systems , 1975, The Bell System Technical Journal.

[15]  P. Noll A comparative study of various quantization schemes for speech encoding , 1975, The Bell System Technical Journal.

[16]  James L. Flanagan,et al.  Digital coding of speech in sub-bands , 1976, The Bell System Technical Journal.

[17]  José Tribolet,et al.  Frequency domain techniques for speech coding , 1978 .