A rationale is advanced for digitally coding speech signals in terms of sub-bands of the total spectrum. The approach provides a means for controlling and reducing quantizing noise in the coding. Each sub-band is quantized with an accuracy (bit allocation) based upon perceptual criteria. As a result, the quality of the coded signal is improved over that obtained from a single full-band coding of the total spectrum. In one implementation, the individual sub-bands are low-pass translated before coding. In another, “integer-band” sampling is employed to alias the signal in an advantageous way before coding. Other possibilities extend to complex demodulation of the sub-bands, and to representing the sub-band signals in terms of envelopes and phase-derivatives. In all techniques, adaptive quantization is used for the coding, and a parsimonious allocation of bits is made across the bands. Computer simulations are made to demonstrate the signal qualities obtained for codings at 16 and 9.6 kb/s.
[1]
L.L. Beranek,et al.
The Design of Speech Communication Systems
,
1947,
Proceedings of the IRE.
[2]
N. Jayant.
Digital coding of speech waveforms: PCM, DPCM, and DM quantizers
,
1974
.
[3]
N. S. Jayant.
Step-size transmitting differential coders for mobile telephony
,
1975,
The Bell System Technical Journal.
[4]
L. Rabiner,et al.
Optimum FIR Digital Filter Implementations for Decimation, Interpolation, and Narrow-Band Filtering
,
1975
.
[5]
K. D. Kryter.
Methods for the Calculation and Use of the Articulation Index
,
1962
.
[6]
N. Jayant.
Adaptive quantization with a one-word memory
,
1973
.
[7]
James L. Flanagan,et al.
Adaptive quantization in differential PCM coding of speech
,
1973
.
[8]
J. L. Flanagan,et al.
PHASE VOCODER
,
2008
.
[9]
W. R. Bennett,et al.
Band width and transmission performance
,
1949,
Bell Syst. Tech. J..