论文信息 - Perceptual Matching Pursuit for Audio Coding

Perceptual Matching Pursuit for Audio Coding

This paper introduces a Perceptual Matching Pursuit (PMP) algorithm for audio coding. A masking model has been developed and integrated into the matching pursuit algorithm to account for the characteristics of the hearing system. By doing so, only an audible kernel is extracted at each iteration. Moreover, contrary to the matching pursuit algorithm, PMP will stop decomposing an audio signal once there is no audible part left in the residual. We have used ITU-R PEAQ to compare audio materials decomposed by PMP and by matching pursuit. Objective scores for PMP increase by up to 1 unit. A semi-formal listening test has verified the objective scores and shown the perceptual superiority of PMP over the matching pursuit algorithm.

Hossein Najaf-Zadeh | Ramin Pichevar | Louis Thibault | Hassan Lahdili

[1] Teresa H. Y. Meng,et al. Sinusoidal modeling using frame-based perceptually weighted matching pursuits , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2] E. Terhardt,et al. Algorithm for extraction of pitch and pitch salience from complex tonal signals , 1982 .

[3] W. Jesteadt,et al. Forward masking as a function of frequency, masker level, and signal delay. , 1982, The Journal of the Acoustical Society of America.

[4] Hossein Najaf-Zadeh,et al. A Biologically-Inspired Low-Bit-Rate Universal Audio Coder , 2007 .

[5] R. Hellman. Asymmetry of masking between noise and tone , 1972 .

[6] Edward A. Lee,et al. Adaptive Signal Models: Theory, Algorithms, and Audio Applications , 1998 .

[7] Hugo Fastl,et al. Psychoacoustics: Facts and Models , 1990 .

[8] Richard Heusdens,et al. Sinusoidal modeling of audio and speech using psychoacoustic-adaptive matching pursuits , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[9] James D. Johnston,et al. Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[10] E. Zwicker. Dependence of post-masking on masker duration and its relation to temporal effects in loudness. , 1984, The Journal of the Acoustical Society of America.

[11] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..