This correspondence uses a pitch synchronous wavelet transform (PSWT) as an alternative characteristic waveform decomposition method for the waveform interpolation (WI) paradigm. The proposed method has the benefit of providing additional scalability in quantization than the existing WI decomposition to meet desired quality requirements. The PSWT is implemented as a quadrature mirror filter bank and decomposes the characteristic waveform surface into a series of reduced time resolution surfaces. Efficient quantization of these surfaces is achieved by exploiting their perceptual importance and inherent transmission rate requirements. The multiresolution representation has the additional benefit of more flexible parameter quantization, allowing a more accurate description of perceptually important scales, especially at higher coding rates. The proposed PSWT-WI coder is very well suited to high quality speech storage applications.
[1]
Joe F. Chicharo,et al.
Low delay multi-level decomposition and quantisation techniques for WI coding
,
1999,
1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[2]
W.B. Kleijn,et al.
Transformation and decomposition of the speech signal for coding
,
1994,
IEEE Signal Processing Letters.
[3]
Allen Gersho,et al.
Variable dimension spectral coding of speech at 2400 bps and below with phonetic classification
,
1995,
1995 International Conference on Acoustics, Speech, and Signal Processing.
[4]
Gianpaolo Evangelista,et al.
Pitch-synchronous wavelet representations of speech and music signals
,
1993,
IEEE Trans. Signal Process..