An Evolutionary Approach to Speech Quality Estimation

In this paper we have employed a genetic programming (GP) based symbolic regression approach to estimate the speech quality as a function of impairments due to IP network and low bitrate coding. A main advantage of GP is that it can produce human-readable results in the form of analytical expressions. Moreover, GP is capable of weeding out irrelevant parameters and concentrating on the most salient ones. These features of GP make our results superior to the past research based on Artificial Neural Networks (ANNs) by Sun and Ifeachor, Mohamed et. al. and on lookup tables by Hoene et. al. We have used PESQ as a reference for evolutionary modeling. The results of proposed models show a high correlation with PESQ. Moreover, our models are suitable for real-time and non-intrusive estimation of VoIP quality.

[1]  Hossam Afifi,et al.  Integrating networks measurements and speech quality subjective scores for control purposes , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[2]  Lawrence Davis,et al.  Adapting Operator Probabilities in Genetic Algorithms , 1989, ICGA.

[3]  Edmund K. Burke,et al.  On improving genetic programming for symbolic regression , 2005, 2005 IEEE Congress on Evolutionary Computation.

[4]  Gerardo Rubino,et al.  A method for quantitative evaluation of audio quality over packet networks and its comparison with existing techniques , 2004 .

[5]  Maarten Keijzer,et al.  Scaled Symbolic Regression , 2004, Genetic Programming and Evolvable Machines.

[6]  Lingfen Sun,et al.  Perceived speech quality prediction for voice over IP-based networks , 2002, 2002 IEEE International Conference on Communications. Conference Proceedings. ICC 2002 (Cat. No.02CH37333).

[7]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[8]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[9]  Sean Luke,et al.  Lexicographic Parsimony Pressure , 2002, GECCO.

[10]  Adam Wolisz,et al.  A Perceptual Quality Model for Adaptive VoIP Applications , 2004 .