Prediction of perceived conversational speech quality and effects of playout buffer algorithms

Perceived conversational speech quality is a key quality of service (QoS) metric for voice over IP (VoIP) applications. Speech quality is mainly affected by network impairments, such as delay, jitter and packet loss. Playout buffer algorithms are used to compensate for jitter based on a tradeoff between delay and loss, but can have a significant effect on perceived quality. The main aim in this paper is to assess how buffer algorithms affect perceived speech quality and how to choose the best algorithm and its parameters to obtain optimum perceived speech quality (in terms of an objective mean opinion score). The contributions of the paper are three-fold. First, we introduce a new methodology for predicting conversational speech quality (conversational mean opinion score or MOSc) which combines the latest ITU-T speech quality measurement algorithm (PESQ) and the concepts of the E-model. Second, we assess different playout buffer algorithms using the new MOSc metric on Internet trace data. Our findings indicate that, in general, end-to-end delay has a major effect on the selection of a buffer algorithm and its parameters. For small end-to-end delays, an algorithm that seeks to minimise loss is preferred, whereas for large end-to-end delays, an algorithm that aims at a minimum buffer delay is best. Third, we propose a modified buffer algorithm together with an adaptive parameter adjustment scheme. Preliminary results show that this can achieve an "optimum" perceived speech quality for all the traces considered. The results are based on Internet trace data measurements between UK and USA, UK and China, UK and Germany.

[1]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[2]  Michael S. Borella Measurement and interpretation of Internet packet loss , 2000, Journal of Communications and Networks.

[3]  Adrian E. Conway A passive method for monitoring voice-over-IP call quality with ITU-T objective speech quality measurement methods , 2002, 2002 IEEE International Conference on Communications. Conference Proceedings. ICC 2002 (Cat. No.02CH37333).

[4]  Athina Markopoulou,et al.  Assessment of VoIP quality over Internet backbones , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[5]  Ashok K. Agrawala,et al.  Experimental assessment of end-to-end behavior on Internet , 1993, IEEE INFOCOM '93 The Conference on Computer Communications, Proceedings.

[6]  Wenyu Jiang,et al.  Analysis of on-off patterns in VoIP and their effect on voice traffic aggregation , 2000, Proceedings Ninth International Conference on Computer Communications and Networks (Cat.No.00EX440).

[7]  Alan Clark,et al.  Modeling the effects of burst packet loss and recency on subjective voice quality , 2001 .

[8]  Jean-Chrysostome Bolot,et al.  Characterizing End-to-End Packet Delay and Loss in the Internet , 1993, J. High Speed Networks.

[9]  Henning Schulzrinne,et al.  Integrating packet FEC into adaptive voice playout buffer algorithms on the Internet , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[10]  Wenyu Jiang,et al.  QoS Measurement of Internet Real-Time Multimedia Services , 1999 .

[11]  Donald F. Towsley,et al.  Packet audio playout delay adjustment: performance bounds and algorithms , 1998, Multimedia Systems.

[12]  Henning Schulzrinne,et al.  Adaptive playout mechanisms for packetized audio applications in wide-area networks , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[13]  Paul T. Brady,et al.  A technique for investigating on-off patterns of speech , 1965 .