论文信息 - Direct MDCT Domain Psychoacoustic Modeling

Direct MDCT Domain Psychoacoustic Modeling

We extend the recently proposed spectral integration based psychoacoustic model for sinusoidal distortions to the MDCT domain. The estimated masking threshold additionally depends on the sub-band spectral flatness measure of the signal which accounts for the non- sinusoidal distortion introduced by masking. The expressions for masking threshold are derived and the validity of the proposed model is established through perceptual transparency test of audio clips. Test results indicate that we do achieve transparent quality reconstruction with the new model. Performance of the model is compared with MPEG psychoacoustic models with respect to the estimated perceptual entropy (PE). The results show that the proposed model predicts a lower PE than other models.

K. Suresh | T.V. Sreenivas

[1] John Princen,et al. Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[2] Peter Kabal,et al. Perceptual coding of narrow-band audio signals at low rates , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3] G. Brink. Detection of Tone Pulse of Various Durations in Noise of Various Bandwidths , 1964 .

[4] S Buus,et al. Decision rules in detection of simple and complex tones. , 1986, The Journal of the Acoustical Society of America.

[5] Bernd Edler,et al. Efficient Coding of Excitation Patterns Combined with a Transform Audio Coder , 2005 .

[6] Roy D. Patterson,et al. The sound of a sinusoid: Spectral models , 1994 .

[7] Miikka Vilermo,et al. Modified Discrete Cosine Transform: Its Implications for Audio Coding and Error Concealment , 2003 .

[8] J. D. Johnston,et al. Estimation of perceptual entropy using noise masking criteria , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[9] James D. Johnston,et al. Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[10] Jesper Jensen,et al. A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration , 2005, EURASIP J. Adv. Signal Process..

[11] A. W. Johnson,et al. Adaptive transform coding incorporating Time Domain Aliasing Cancellation , 1987, Speech Commun..