Non-linear techniques for pitch and waveform enhancement in PWI coders

Two non-linear interpolation techniques are introduced for enhancing speech reproduction in prototype waveform interpolation (PWI) and similar encoders. A temporal differential rate (TDR) vector is used to characterise the non-uniform evolution of pitch cycle temporal structure during interpolation. Experimental results show a clear improvement in the accuracy of decoded pitch cycle lengths and in the reproduction of periodicity in general. It is also shown that waveform reproduction can be significantly improved by vector quantising sets of optimal combination coefficients (OCC) aimed at maximising the similarity between interpolated and target signal segments. Both time domain waveform similarity and frequency domain spectral envelope similarity derived OCC are tested. Subjective assessment suggests a general preference for non-linear interpolation methods and the scheme using frequency domain derived OCC with perceptual weighting provided the best subjective preference.

[1]  W. Bastiaan Kleijn,et al.  A speech coder based on decomposition of characteristic waveforms , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[2]  G. Lockhart,et al.  Non-linear prototype waveform interpolation for voiced speech encoding , 1995 .

[3]  W.B. Kleijn,et al.  Transformation and decomposition of the speech signal for coding , 1994, IEEE Signal Processing Letters.

[4]  Allen Gersho,et al.  Advances in speech and audio compression , 1994, Proc. IEEE.

[5]  G. B. Lockhart,et al.  Non-linear interpolation in prototype waveform interpolation (PWI) encoders , 1994 .

[6]  K. W. Tang,et al.  Variable frame length prototype waveform interpolation for low bit rate speech coding , 1993 .

[7]  W. Bastiaan Kleijn,et al.  Encoding speech using prototype waveforms , 1993, IEEE Trans. Speech Audio Process..

[8]  W. Bastiaan Kleijn,et al.  Methods for waveform interpolation in speech coding , 1991, Digit. Signal Process..

[9]  Willem Bastiaan Kleijn,et al.  Continuous representations in linear predictive coding , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[10]  Kuldip K. Paliwal,et al.  Speech coding at 4 kb/s and lower using single-pulse and stochastic models of LPC excitation , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[11]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.