Enhanced intrusive Voice Quality Estimation (EVQE)

A well known and used method of intrusive objective quality assessment is ITU-T P.862 — “Perceptual Evaluation of Speech Quality (PESQ) suffers from certain weaknesses. The standard is falling behind the latest coding technologies and as some papers have shown lacks the sufficient accuracy in performing reliable and accurate estimation. The proposed method of Enhanced Voice Quality Estimation (EVQE) utilizes a combination of several effective approaches from the field of speech signal analysis and voice quality estimation. Each approach presents its own perspective to assemble a comprehensive model of the human auditory system and perception. We found a considerable improvement in performance both in terms of the correlation and the absolute estimation error, by adapting these perceptual models to fit the requirements of intrusive speech quality evaluation algorithm, and combining them with PESQ.

[1]  Doh-Suk Kim,et al.  ANIQUE: An Auditory Model for Single-Ended Speech Quality Estimation , 2005, IEEE Trans. Speech Audio Process..

[2]  Malcolm Slaney,et al.  An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .

[3]  Powen Ru,et al.  Multiresolution spectrotemporal analysis of complex sounds. , 2005, The Journal of the Acoustical Society of America.

[4]  T. Dau,et al.  Characterizing frequency selectivity for envelope fluctuations. , 2000, The Journal of the Acoustical Society of America.