A non-uniform modulation transform for audio coding with increased time resolution

Perceptual audio coders exploit two properties to achieve coding gain: perceptual irrelevancy and source redundancy. Recently, a two-dimensional modulation transform was introduced which efficiently extracts perceptual irrelevancy and source redundancy not accessible in a one-dimensional transform. In this paper, we propose an alternative modulation transform design with an octave-band non-uniform modulation dimension. This non-uniform modulation dimension approximately mimics the spacing of modulation filter subbands of the human auditory system, while simultaneously increasing the time resolution of the modulation transform providing improved temporal control of coding noise.

[1]  Deepen Sinha,et al.  Low bit rate transparent audio compression using adapted wavelets , 1993, IEEE Trans. Signal Process..

[2]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[3]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[4]  John Princen,et al.  Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[5]  T. Dau,et al.  Characterizing frequency selectivity for envelope fluctuations. , 2000, The Journal of the Acoustical Society of America.

[6]  Steven Greenberg,et al.  The modulation spectrogram: in pursuit of an invariant representation of speech , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  T. Houtgast Frequency selectivity in amplitude-modulation detection. , 1989, The Journal of the Acoustical Society of America.

[8]  P. Vaidyanathan Multirate Systems And Filter Banks , 1992 .

[9]  Les E. Atlas,et al.  Scalable and progressive audio codec , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[10]  Henrique S. Malvar Enhancing the performance of subband audio coders for speech signals , 1998, ISCAS '98. Proceedings of the 1998 IEEE International Symposium on Circuits and Systems (Cat. No.98CH36187).