论文信息 - Audio coding based on rate distortion and perceptual optimization

Audio coding based on rate distortion and perceptual optimization

The time-frequency tiling, bit allocation and the quantizer of most perceptual coding algorithms is either fixed or controlled by a perceptual mode. The large variety of existing audio signals, each exhibiting different coding requirements due to their different temporal and spectral fine-structure suggests to use a signal-adaptive algorithm. The framework which is described in this is paper makes use of a signal-adaptive wavelet filterbank which allows to switch any node of the wavelet-packet tree individually. Therefore each subband can have an individual time- segmentation and the overall time-frequency tiling can be adapted to the signal using optimization techniques. A rate- distortion optimality can be defined which will minimize the distortion for a given rate in every subband, based on a perceptual model. Due to the additivity of the rate and distortion measure over disjoint covers of the input signal, an overall cost function including the switching cost for the filterbank switching can be defined. By the use of dynamic programming techniques, the wavelet-packet tree can be pruned base don a top-down or bottom-up 'split-merge' decision in every node of the wavelet-tree. Additionally we can profit form temporal masking due to the fact that each subband can have an individual segmentation in time without introducing time domain artifacts such as pre-echo distortion.

George S. Moschytz | Markus Erne

[1] William A. Pearlman,et al. A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[2] Ronald R. Coifman,et al. Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[3] K Ramchandran,et al. Best wavelet packet bases in a rate-distortion sense , 1993, IEEE Trans. Image Process..

[4] Pierrick Philippe,et al. Wavelet packet filterbanks for low time delay audio coding , 1999, IEEE Trans. Speech Audio Process..

[5] George S. Moschytz,et al. Perceptual and Near-Lossless Audio Coding Based on a Signal-Adaptive Wavelet Filterbank , 1999 .