Real-Time, Non-intrusive Evaluation of VoIP

Speech quality, as perceived by the users of Voice over Internet Protocol (VoIP) telephony, is critically important to the uptake of this service. VoIP quality can be degraded by network layer problems (delay, jitter, packet loss). This paper presents a method for real-time, non-intrusive speech quality estimation for VoIP that emulates the subjective listening quality measures based on Mean Opinion Scores (MOS). MOS provide the numerical indication of perceived quality of speech. We employ a Genetic Programming based symbolic regression approach to derive a speech quality estimation model. Our results compare favorably with the International Telecommunications Union-Telecommunication Standardization (ITU-T) PESQ algorithm which is the most widely accepted standard for speech quality estimation. Moreover, our model is suitable for real-time speech quality estimation of VoIP while PESQ is not. The performance of the proposed model was also compared to the new ITU-T recommendation P.563 for non-intrusive speech quality estimation and an improved performance was observed.

[1]  Lingfen Sun,et al.  Subjective and Objective Speech Quality Evaluation under Bursty Losses , 2002 .

[2]  Edmund K. Burke,et al.  On improving genetic programming for symbolic regression , 2005, 2005 IEEE Congress on Evolutionary Computation.

[3]  Scott Pennock,et al.  Accuracy of the Perceptual Evaluation of Speech Quality (PESQ) algorithm , 2002 .

[4]  Gerardo Rubino,et al.  A method for quantitative evaluation of audio quality over packet networks and its comparison with existing techniques (in MESAQUIN'04, Prague, June 2004) , 2004 .

[5]  Lawrence Davis,et al.  Adapting Operator Probabilities in Genetic Algorithms , 1989, ICGA.

[6]  Gerardo Rubino,et al.  A method for quantitative evaluation of audio quality over packet networks and its comparison with existing techniques , 2004 .

[7]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[8]  Hossam Afifi,et al.  Integrating networks measurements and speech quality subjective scores for control purposes , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[9]  Adam Wolisz,et al.  A Perceptual Quality Model for Adaptive VoIP Applications , 2004 .

[10]  Wai C. Chu,et al.  Speech Coding Algorithms: Foundation and Evolution of Standardized Coders , 2003 .

[11]  Wai-Chor So,et al.  Speech coding algorithms : foundation and evolution of standardized coders , 2003 .

[12]  Maarten Keijzer,et al.  Scaled Symbolic Regression , 2004, Genetic Programming and Evolvable Machines.

[13]  Lingfen Sun,et al.  Perceived speech quality prediction for voice over IP-based networks , 2002, 2002 IEEE International Conference on Communications. Conference Proceedings. ICC 2002 (Cat. No.02CH37333).

[14]  Sean Luke,et al.  Lexicographic Parsimony Pressure , 2002, GECCO.

[15]  Wai C. Chu,et al.  Speech Coding Algorithms , 2003 .