A low delay 16 kb/s speech coder

A code tree generated by a stochastically populated innovations tree with a backward adaptive gain and backward adaptive synthesis filters is considered. The synthesis configuration uses a cascade of two all-pole filters: a pitch (long time delay) filter followed by a formant (short time delay) filter. Both filters are updated using backward adaptation. The formant predictor is updated using an adaptive lattice algorithm. The multipath (M, L) search algorithm is used to encode the speech. A frequency-weighted error measure is used to reduce the perceptual loudness of the quantization noise. The addition of the pitch filter gives 2-10-dB increase in segSNR (segmental signal-to-noise ratio) in the voiced segments. Subjective testing has shown that the coder attains a subjective quality equivalent to 7 b/sample log-PCM (pulse code modulation) with an encoding delay of eight samples (1 ms with an 8-kHz sampling rate). >

[1]  John B. Anderson,et al.  Instrumentable tree encoding of information sources (Corresp.) , 1971, IEEE Trans. Inf. Theory.

[2]  N. Jayant Adaptive quantization with a one-word memory , 1973 .

[3]  John B. Anderson,et al.  Tree encoding of speech , 1975, IEEE Trans. Inf. Theory.

[4]  Nuggehally Sampath Jayant,et al.  Tree-Encoding of Speech Using the (M, L)-Algorithm and Adaptive Quantization , 1978, IEEE Trans. Commun..

[5]  Stephen G. Wilson,et al.  Adaptive Tree Encoding of Speech at 8000 Bits/s with a Frequency-Weighted Error Criterion , 1979, IEEE Trans. Commun..

[6]  B. Atal,et al.  Predictive coding of speech signals and subjective error criteria , 1979 .

[7]  John Makhoul,et al.  Adaptive lattice analysis of speech , 1981 .

[8]  P. Noll,et al.  Multipath Search Coding of Stationary Signals with Applications to Speech , 1982, IEEE Trans. Commun..

[9]  Robert M. Gray,et al.  The Design of Predictive Trellis Waveform Coders Using the Generalized Lloyd Algorithm , 1986, IEEE Trans. Commun..

[10]  Allen Gersho,et al.  Real-time vector APC speech coding at 4800 bps with adaptive postfiltering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  P. Kabal,et al.  A low delay 16 kbits/sec speech coder , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[12]  Peter Kabal,et al.  Pitch prediction filters in speech coding , 1989, IEEE Trans. Acoust. Speech Signal Process..

[13]  Michael W. Marcellin,et al.  Predictive trellis coded quantization of speech , 1990, IEEE Trans. Acoust. Speech Signal Process..