Flexible frequency decompositions for cosine-modulated filter banks

We investigate the use of nonuniform cosine-modulated filter banks for audio coding. A rate-distortion framework is employed, similar to the work in Herley et al. (1994), to select the filter bank structure from a large library of possible frequency decompositions. A new flexible frequency decomposition algorithm is proposed that jointly optimizes the filter bank structure and the bit allocation over the subband channels. Experimental results for both synthetic and real audio signals are provided. The new algorithm shows significant improvements in comparison with fixed uniform frequency decompositions, but special care has to be taken to reduce the size of the decomposition overhead.

[1]  Dimitri P. Bertsekas,et al.  Dynamic Programming: Deterministic and Stochastic Models , 1987 .

[2]  Bernd Edler Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen , 1989 .

[3]  R. Heusdens,et al.  Subband merging in cosine-modulated filter banks , 2003, IEEE Signal Processing Letters.

[4]  Michael T. Orchard,et al.  Flexible tree-structured signal expansions using time-varying wavelet packets , 1997, IEEE Trans. Signal Process..

[5]  P. P. Vaidyanathan,et al.  Cosine-modulated FIR filter banks satisfying perfect reconstruction , 1992, IEEE Trans. Signal Process..

[6]  P. Noll,et al.  A new orthonormal wavelet packet decomposition for audio coding using frequency-varying modulated lapped transforms , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[7]  Ronald R. Coifman,et al.  Wavelet analysis and signal processing , 1990 .

[8]  Marcus Purat,et al.  Audio coding with a dynamic wavelet packet decomposition based on frequency-varying modulated lapped transforms , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  K. Ramchandran,et al.  Flexible time segmentations for time-varying wavelet packets , 1994, Proceedings of IEEE-SP International Symposium on Time- Frequency and Time-Scale Analysis.

[10]  K Ramchandran,et al.  Best wavelet packet bases in a rate-distortion sense , 1993, IEEE Trans. Image Process..

[11]  Kannan Ramchandran,et al.  Tilings of the time-frequency plane: construction of arbitrary orthogonal bases and fast tiling algorithms , 1993, IEEE Trans. Signal Process..

[12]  Richard Heusdens,et al.  Rate-distortion optimal sinusoidal modeling of audio and speech using psychoacoustical matching pursuits , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Seymour Shlien,et al.  The modulated lapped transform, its time-varying forms, and its applications to audio coding standards , 1997, IEEE Trans. Speech Audio Process..