Audio coding using steady state harmonics and residuals

In this paper, we solve the problem of inter and intra-frame frequency leakage in harmonic based audio coding approaches. Specifically, we first use an FFT to estimate the harmonic parameters. Because of the limited frequency resolution of the windowed FFT, there is an uncertainty in the measured frequency within one frequency bin. To minimize this uncertainty, we use these estimates as seeds to a minimum least squares problem and solve for the optimal harmonic parameters. The resulting residual should be essentially free of harmonics and consist of only attacks and noise. We are preserving the attacks and coding them separately. The resulting residual noise is modeled and coded using wavelet transforms and noise modeling. We also present a solution to the problem of inter-frame frequency fading and highlight the benefit of using a rectangular coordinate representation of the harmonic data. The improved coder that we propose can be viewed as an object based approach to audio coding and leads to perceptually pleasant progressive transmission and representation schemes for multimedia applications.

[1]  Julius O. Smith,et al.  Spectral modeling synthesis: A sound analysis/synthesis based on a deterministic plus stochastic decomposition , 1990 .

[2]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[3]  Xavier Serra,et al.  A sound analysis/synthesis system based on a deterministic plus stochastic decomposition , 1990 .

[4]  Ting Chen,et al.  Time-scale modification of audio signals with combined harmonic and wavelet representations , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Eric Moulines,et al.  HNS: Speech modification based on a harmonic+noise model , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Michael M. Goodwin Residual modeling in music analysis-synthesis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  Ahmed H. Tewfik,et al.  Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[8]  Julius O. Smith,et al.  Multiresolution sinusoidal modeling for wideband audio with modifications , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).