Exploring the Extent and Impact of Playout Adjustments within VoIP Applications on the MOS Scale

In coping with best-effort service, many VoIP applications employ adaptive playout strategies. Objective methods of speech quality assessment such as the ITU-T Recommendation P.862 (also known as Perceptual Evaluation of Speech Quality PESQ) typically do not capture distortion due to playout adjustments as they match up short segments prior to analysis. Similarly, the ITU-T E-Model does not capture the effect of delay variation and uses an average delay figure in its calculations. In this paper we explore in some detail, the extent of playout adjustments within VoIP applications and assess the likely impact on Mean Opinion Score MOS. We review the impact of various factors such as Voice Activity Detection (VAD) settings and hangover thresholds on talkspurt/silence period distribution. In this context we examine the distribution of playout adjustments resulting from various playout algorithms and assess the likely impact on MOS. We show that our hybrid playout strategy which utilises synchronised time to implement an informed fixed delay playout strategy wherever possible will significantly reduce playout adjustments and any consequent MOS degradation.

[1]  Donald F. Towsley,et al.  Packet audio playout delay adjustment: performance bounds and algorithms , 1998, Multimedia Systems.

[2]  Warren A. Montgomery,et al.  Techniques for Packet Voice Synchronization , 1983, IEEE J. Sel. Areas Commun..

[3]  JongWon Kim,et al.  Quality Enhancement of Packet Audio with Time-Scale Modification , 2002, SPIE ITCom.

[4]  Bernd Girod,et al.  Adaptive playout scheduling using time-scale modification in packet voice communications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  Liam Murphy,et al.  An evaluation of the potential of synchronized time to improve voice over IP quality , 2003, IEEE International Conference on Communications, 2003. ICC '03..

[6]  Liam Murphy,et al.  An Evaluation of Delay-Aware Receiver Playout Strategies for VoIP Applications , 2004, NETWORKING.

[7]  Henning Schulzrinne,et al.  Adaptive playout mechanisms for packetized audio applications in wide-area networks , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[8]  Paul T. Brady,et al.  A technique for investigating on-off patterns of speech , 1965 .

[9]  Wenyu Jiang,et al.  Analysis of on-off patterns in VoIP and their effect on voice traffic aggregation , 2000, Proceedings Ninth International Conference on Computer Communications and Networks (Cat.No.00EX440).

[10]  Alan Clark,et al.  Modeling the effects of burst packet loss and recency on subjective voice quality , 2001 .

[11]  Henning Schulzrinne,et al.  Voice Communication Across the Internet: A Network Voice Terminal , 1992 .

[12]  S. Voran Perception of Temporal Discontinuity Impairments in Coded Speech - A Proposal for Objective Estimators and Some Subjective Test Results , 2003 .