论文信息 - Open loop rate-distortion optimized audio coding

Open loop rate-distortion optimized audio coding

The paper addresses complexity reduced rate-distortion optimized audio coding under rate constraint. A technique where distortion minimizing coding templates, chosen from a set of templates, are jointly selected for a set of segments. This optimization requires knowledge of rate-distortion pairs for all segments, and for each coding template, which is often costly to obtain. The proposed framework exchanges true rate-distortion pairs with predicted ones, thereby allowing for complexity reduction. The prediction is based on a property vector extracted for each segment, from which distortion predictions, using Gaussian mixture models, are performed. Here, we evaluate the proposed framework in a sinusoidal coding context. The results show that the proposed framework can increase the distortion performance, compared to a fixed sinusoidal coding scheme.

Søren Holdt Jensen | Mads Græsbøll Christensen | Fredrik Nordén

[1] W. Bastiaan Kleijn,et al. Towards optimal quantization in multistage audio coding , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] P. Prandoni. Optimal segmentation techniques for piecewise stationary signals , 1999 .

[3] R. Vafin,et al. Sinusoidal modeling using psychoacoustic-adaptive matching pursuits , 2002, IEEE Signal Processing Letters.

[4] S. van de Par,et al. Rate-distortion efficient amplitude modulated sinusoidal audio coding , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[5] Richard Heusdens,et al. A new psychoacoustical masking model for audio coding applications , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] Douglas Keislar,et al. Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[7] S.H. Jensen,et al. Property vector based distortion estimation , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[8] Richard Heusdens,et al. Rate-distortion optimal sinusoidal modeling of audio and speech using psychoacoustical matching pursuits , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.