Very low complexity interpolative speech coding at 1.2 to 2.4 kbps

The recently-introduced waveform interpolation (WI) coders provide good-quality speech at low rates but may be too complex for commercial use. This paper proposes new approaches to low-complexity WI speech coding at rates of 1.2 and 2.4 kbps. The proposed coders are 4 to 5 times faster than the previously reported ones. At 2.4 kbps, the complexity is about 7.5 and 2.5 MFLOPS for the encoder and decoder, respectively. At 1.2 kbps, the complexity is about 6 and 2.3 MFLOPS for the encoder and decoder, respectively. Informal subjective evaluation shows that, at 2.4 kbps, the quality is close to that of the high-complexity coders. The quality does not significantly degrade at 1.2 kbps and it is considered sufficient for messaging applications.

[1]  Ian S. Burnett,et al.  New techniques for multi-prototype waveform coding at 2.84 kb/s , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[2]  Allen Gersho,et al.  Adaptive postfiltering for quality enhancement of coded speech , 1995, IEEE Trans. Speech Audio Process..

[3]  Michael Unser,et al.  B-spline signal processing. I. Theory , 1993, IEEE Trans. Signal Process..

[4]  Hsieh Hou,et al.  Cubic splines for image interpolation and digital filtering , 1978 .

[5]  Ian S. Burnett,et al.  Quantisation Techniques for Prototype Waveforms , 1996, Fourth International Symposium on Signal Processing and Its Applications.

[6]  Yair Shoham,et al.  A low-complexity waveform interpolation coder , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  W. Bastiaan Kleijn,et al.  A speech coder based on decomposition of characteristic waveforms , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Ali N. Akansu,et al.  Simple fast vector quantization of the line spectral frequencies , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[9]  Yair Shoham High-quality speech coding at 2.4 to 4.0 kbit/s based on time-frequency interpolation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Akram Aldroubi,et al.  B-SPLINE SIGNAL PROCESSING: PART I-THEORY , 1993 .

[11]  Yair Shoham High-quality speech coding at 2.4 kbps based on time-frequency interpolation , 1993, EUROSPEECH.

[12]  Akram Aldroubi,et al.  B-spline signal processing. II. Efficiency design and applications , 1993, IEEE Trans. Signal Process..