论文信息 - Perceptual Objective Listening Quality Assessment ( POLQA ) , The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II

Perceptual Objective Listening Quality Assessment ( POLQA ) , The Third Generation ITU-T Standard for End-to-End Speech Quality Measurement Part II – Perceptual Model

In two closely related papers we present POLQA (Perceptual Objective Listening Quality Assessment), the third generation perceptual objective speech quality measurement algorithm, standardized by the International Telecommunication Union (ITU-T) as Recommendation P.863 in 2011. This measurement algorithm simulates subjects that rate the quality of a speech fragment in a listening test using a five-point opinion scale. The new standard provides a significantly improved performance in predicting the subjective speech quality in terms of Mean Opinion Scores when compared to PESQ (Perceptual Evaluation of Speech Quality), the second generation of objective speech quality measurements. The new POLQA algorithm allows for predicting speech quality over a wide range of distortions, from “High Definition” super-wideband speech (HD Voice, audio bandwidth up to 14 kHz) to extremely distorted narrowband telephony speech (audio bandwidth down to 2 kHz), using sample rates between 48 and 8 kHz. POLQA is suited for distortions that are outside the scope of PESQ such as linear frequency response distortions, time stretching/compression as found in Voice-over-IP, certain types of codec distortions, reverberations, and the impact of playback volume. POLQA outperforms PESQ in assessing any kind of degradation making it an ideal tool for all speech quality measurements in today’s and future mobile and IP based networks. This paper (Part II) outlines the core elements of the underlying perceptual model and presents the final results.

[1] Daniel F. Hoth. Room Noise Spectra at Subscribers' Telephone Locations , 1941 .

[2] B C Moore,et al. The shape of the ear's temporal window. , 1988, The Journal of the Acoustical Society of America.

[3] John G. Beerends,et al. A Perceptual Audio Quality Measure Based on a Psychoacoustic Sound Representation , 1992 .

[4] John G. Beerends,et al. Modeling a Cognitive Aspect in the Measurement of the Quality of Music Codecs , 1994 .

[5] Kamal Ahmed,et al. Degradation decomposition of the perceived quality of speech signals on the basis of a perceptual modeling approach , 2007 .

[6] Ronald W. Schafer,et al. EW-PESQ: A Quality Assessment Method for Speech Signals Sampled at 48 kHz , 2010 .

[7] METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .