High resolution spherical quantization of sinusoidal parameters using a perceptual distortion measure

Sinusoidal modelling is a key technology in low rate audio coding, and methods for efficient quantization of sinusoidal parameters are therefore of high importance. We derive analytical formulas for the optimal entropy constrained unrestricted spherical quantizers for amplitude, phase and frequency, using a perceptual distortion measure. This is done both for a single sinusoid, and for multiple sinusoids distributed over multiple segments. The quantizers minimize a high-resolution approximation of the expected distortion, while the corresponding quantization indices satisfy an entropy constraint. The quantizers turn out to be flexible and of low complexity, in the sense that they can be determined easily for varying bit rate requirements, without any sort of retraining or iterative procedures. In objective and subjective comparison tests, the proposed method is shown to outperform an existing state-of-the-art sinusoidal quantization scheme, where quantization of frequency parameters is done independently.

[1]  Richard Heusdens,et al.  A new psychoacoustical masking model for audio coding applications , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Pim Korten,et al.  High rate spherical quantization of sinusoidal parameters , 2004, 2004 12th European Signal Processing Conference.

[3]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[4]  R. Vafin,et al.  Sinusoidal modeling using psychoacoustic-adaptive matching pursuits , 2002, IEEE Signal Processing Letters.

[5]  Martin Vetterli,et al.  Optimal time segmentation for signal modeling and compression , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  W. Bastiaan Kleijn,et al.  Entropy-constrained polar quantization and its application to audio coding , 2005, IEEE Transactions on Speech and Audio Processing.

[7]  Heiko Purnhagen Advances in parametric audio coding , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[8]  Teresa H. Y. Meng,et al.  A 6Kbps to 85Kbps scalable audio coder , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).