Speech quality measure based on auditory scene analysis

Objective speech quality measures are very important for evaluating speech codecs and VoIP systems. Recent speech quality measures are based on psychoacoustic facts. In this paper we incorporate ideas from auditory scene analysis in the computation of speech quality measure. Specifically, use notions of harmonicity, dynamics, and onset times. Both relative and absolute metrics are proposed.

[1]  Antony William Rix,et al.  Perceptual evaluation of speech quality (PESQ): The new ITU standard for end-to-end speech quality a , 2002 .

[2]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[4]  Albert S. Bregman,et al.  Auditory scene analysis : hearing in complex environments , 1993 .

[5]  Davis Pan,et al.  A Tutorial on MPEG/Audio Compression , 1995, IEEE Multim..

[6]  S. H. Srinivasan Auditory blobs , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.