Corrupted Speech Data Considered Useful: Improving Perceived Speech Quality of VoIP over Error-Prone Channels

Summary The provisioning of an appropriate level of perceptual speech quality is crucial for the successful deployment of Vo ice over the Internet Protocol (VoIP). Today’s heterogeneous multimedia networks include links that introduce bit errors into the voice data stream. These errors are detected by the IP packet transport protocol and result in packet losses which eventually degrade the speech quality. However, modern speech coding algorithms can either conceal packet losses or tolerate corrupted packets. In this paper, we investigate to which extent it makes sense to keep corrupted speech data for the special case of uniformly distributed bit errors. We simulate different transport strategies that allow the incorporation of damaged speech data into the speech decoding process. The results from an instrumental speech quality evaluation show that keeping as much damaged data as possible leads to superior performance with regard to the perceptual speech quality compared to dropping packets and using packet loss concealment.

[1]  V. Hardman,et al.  A survey of packet loss recovery techniques for streaming audio , 1998, IEEE Network.

[2]  R. Blahut Theory and practice of error control codes , 1983 .

[3]  Don Towsley,et al.  Theories and models for Internet quality of service , 2002, Proc. IEEE.

[4]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[5]  Donald F. Towsley,et al.  Packet audio playout delay adjustment: performance bounds and algorithms , 1998, Multimedia Systems.

[6]  Jens Berger Instrumentelle Verfahren zur Sprachqualitätsschätzung: Modelle auditiver Tests , 1998 .

[7]  Ghyslain Pelletier,et al.  Network Working Group Robust Header Compression (rohc): Profiles for User Datagram Protocol (udp) Lite , 2005 .

[8]  Carsten Bormann,et al.  RObust Header Compression (ROHC): Framework and four profiles: RTP, UDP, ESP, and uncompressed , 2001, RFC.

[9]  Kang G. Shin,et al.  Evolution of the Internet QoS and support for soft real-time applications , 2003, Proc. IEEE.

[10]  Alexander Raake Predicting speech quality under random packet loss: Individual impairment and additivity with other network impairments , 2004 .

[11]  Lars-Åke Larzon,et al.  The UDP-Lite Protocol , 2001 .

[12]  Søren Vang Andersen Internet Low Bit Rate Codec, IETF Internet-draft draft-ietf-avt-ilbc-codec-04.txt , 2003 .

[13]  Henning Schulzrinne,et al.  Adaptive playout mechanisms for packetized audio applications in wide-area networks , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[14]  J.L. Massey,et al.  Theory and practice of error control codes , 1986, Proceedings of the IEEE.

[15]  Wenyu Jiang,et al.  Comparison and optimization of packet loss repair methods on VoIP perceived quality under bursty loss , 2002, NOSSDAV '02.

[16]  Anthony D. Joseph,et al.  Performance evaluation of UDP lite for cellular video , 2001, NOSSDAV '01.

[17]  Jon Postel,et al.  User Datagram Protocol , 1980, RFC.

[18]  Peter Reichl,et al.  Corrupted Speech Data Considered Useful , 2003 .

[19]  Steven Andersen,et al.  Internet Low Bit Rate Codec , 2002 .