Rate-distortion optimized hybrid sound coding

This paper presents a rate-distortion optimization approach to hybrid sound coding. The advantages of sinusoidal and transform coding are combined by a rate-distortion optimization mechanism, using a perceptually relevant distortion measure based on spectral auditory masking. As a result, the coder can adapt to the input signal and to constraints such as bit rate. Listening test results show improved performance of the hybrid coder compared to the individual coding paradigms. There is a good correlation between the improved performance as reported by the listeners and the differences in distortion resulting from the perceptually relevant distortion measure. This confirms that the distortion measure used in the optimization is useful; moreover, it shows the feasibility of the rate-distortion optimization approach for hybrid sound coding.

[1]  Martin Vetterli,et al.  Optimal time segmentation for signal modeling and compression , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  P. Noll,et al.  MPEG digital audio coding , 1997, IEEE Signal Process. Mag..

[3]  Heiko Purnhagen,et al.  HILN-the MPEG-4 parametric audio coding tools , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[4]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[5]  Richard Heusdens,et al.  A new psychoacoustical masking model for audio coding applications , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  T. F. Quatieri,et al.  Audio Signal Processing Based on Sinusoidal Analysis/Synthesis , 2002 .

[7]  Richard Heusdens,et al.  Rate-distortion optimal sinusoidal modeling of audio and speech using psychoacoustical matching pursuits , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Heiko Purnhagen,et al.  A Closer Look into MPEG-4 High Efficiency AAC , 2003 .

[9]  F. Riera-Palou,et al.  A hybrid parametric-waveform approach to bit stream scalable audio coding , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[10]  Richard Heusdens,et al.  RD optimal time segmentations for the time-varying MDCT , 2004, 2004 12th European Signal Processing Conference.

[11]  Søren Holdt Jensen,et al.  Open loop rate-distortion optimized audio coding , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[12]  Bernd Edler,et al.  Efficient Coding of Excitation Patterns Combined with a Transform Audio Coder , 2005 .

[13]  Steven van de Par,et al.  Scalable Noise Coder for Parametric Sound Coding , 2005 .