A case study of perceived listening quality of temporally interrupted VoIP service

In modern VoIP services, we often observe temporary speech interruptions during ordinary conversations. This is caused by the mobility facility and the best effort nature of data transport networks. This impairment factor has been classically de-emphasized by existing subjective and objective quality assessment techniques because it is rarely observed over legacy landline telephone systems. This paper explores the perceptual effects of discontinuity in speech communications. A series of lab-based subjective tests has been carried-out in order to understand the perceptual quality variation with respect to diverse patterns of temporal service discontinuity. In parallel, impairment conditions have been evaluated using the standardized active and passive signal-layer SQA (Speech Quality Assessment) models described in ITU-T Rec. P.862 and P.563, respectively. Our exploration indicates that both strategies estimate poorly perceived quality of interrupted speech stimulus on a sample-by-sample basis. We found that the time-alignment algorithm of original and degraded speech sequences embedded in the ITU-T Rec. P.862 SQA model plays an essential role in the observed unpredictable quality rating estimates. Moreover, the dichotomy treatment of discontinuity instances by the ITU-T Rec. P.563 SQA model constitutes a principal source of inaccuracy of estimated perceptual quality. A guideline for proper consideration of discontinuity distribution and context is presented and applied on the ITU-T Rec. P.862 SQA algorithm. This results in an improvement of its estimation performance in the context of interrupted speech sequences.

[1]  Anja Feldmann,et al.  Understanding Signal-Based Speech Quality Prediction in Future Mobile Communications , 2010, 2010 IEEE International Conference on Communications.

[2]  L. Humes,et al.  Factors influencing recognition of interrupted speech. , 2010, The Journal of the Acoustical Society of America.

[3]  Catherine Colomes,et al.  Perceived Quality of an Audio Signal Impaired by Signal Loss: Psychoacoustic Tests and Prediction Model , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[4]  Yoshiaki Tanaka,et al.  Interrupted voice quality evaluation and adaptive delay control in a voice packet communication system , 1991 .

[5]  Henning Schulzrinne,et al.  Protocols and system design, reliability and energy efficiency in peer-to-peer communication systems , 2011 .

[6]  Henning Schulzrinne,et al.  Application-layer mobility using SIP , 2000, MOCO.

[7]  G. A. Miller,et al.  The Intelligibility of Interrupted Speech , 1948 .

[8]  S. Voran Perception of Temporal Discontinuity Impairments in Coded Speech - A Proposal for Objective Estimators and Some Subjective Test Results , 2003 .

[9]  Charles Speaks,et al.  Intelligibility of temporally interrupted speech. , 1971, The Journal of the Acoustical Society of America.

[10]  J.I. Alonso,et al.  Effects of handover on Voice quality in wireless convergent networks , 2007, 2007 IEEE Radio and Wireless Symposium.

[11]  Deniz Başkent,et al.  Effects of envelope discontinuities on perceptual restoration of amplitude-compressed speech. , 2009, The Journal of the Acoustical Society of America.

[12]  Nicolas Côté Integral and Diagnostic Intrusive Prediction of Speech Quality , 2011, T-Labs Series in Telecommunication Services.