A speech coder based on decomposition of characteristic waveforms

For low-rate speech coding it is advantageous to represent the speech signal as an evolving characteristic waveform (CW). The CW evolves slowly when the speech signal is clearly voiced and rapidly when the speech signal is clearly unvoiced. The voiced (periodic) and unvoiced (nonperiodic) components of the speech signal can be separated by a simple nonadaptive filter in the CW domain. Because of perceptual effects, a significant increase in coding efficiency is obtained by coding these two components separately. A 2.4 kb/s coder using these principles was developed. In an independent evaluation, the performance of the 2.4 kb/s waveform interpolation (WI) coder was found to be at least equivalent to the 4.8 kb/s FS1016 standard for all of the many tests.

[1]  R. J. Holbeche,et al.  A mixed prototype waveform/CELP coder for sub 3 kbit/s , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Allen Gersho,et al.  Real-time vector APC speech coding at 4800 bps with adaptive postfiltering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Yoshinori Tanaka,et al.  Low-bit-rate speech coding using a two-dimensional transform of residual signals and waveform interpolation , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  W. Bastiaan Kleijn,et al.  Encoding speech using prototype waveforms , 1993, IEEE Trans. Speech Audio Process..

[5]  Joseph P. Campbell,et al.  The Dod 4.8 Kbps Standard (Proposed Federal Standard 1016) , 1991 .

[6]  Gernot Kubin,et al.  Performance of noise excitation for unvoiced speech , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[7]  W.B. Kleijn,et al.  Transformation and decomposition of the speech signal for coding , 1994, IEEE Signal Processing Letters.

[8]  Petter Knagenhjelm,et al.  How good is your index assignment? , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Yair Shoham High-quality speech coding at 2.4 kbps based on time-frequency interpolation , 1993, EUROSPEECH.

[10]  Dik J. Hermes,et al.  Synthesis of breathy vowels: Some research methods , 1991, Speech Commun..

[11]  K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1990 .

[12]  Willem Bastiaan Kleijn,et al.  Continuous representations in linear predictive coding , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[13]  S. D. Hansen,et al.  Improvements in 2.4 kbps high-quality speech coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  M. A. Kohler,et al.  Progress towards a new government standard 2400 bps voice coder , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.