Real-time voice communication over the internet using packet path diversity

The quality of real-time voice communication over best-effort networks is mainly determined by the delay and loss characteristics observed along the network path. Excessive playout buffering at the receiver is prohibitive and significantly delayed packets have to be discarded and considered as late loss. We propose to improve the tradeoff among delay, late loss rate, and speech quality using multi-stream transmission of real-time voice over the Internet, where multiple redundant descriptions of the voice stream are sent over independent network paths. Scheduling the playout of the received voice packets is based on a novel multi-stream adaptive playout scheduling technique that uses a Lagrangian cost function to trade delay versus loss. Experiments over the Internet suggest largely uncorrelated packet erasure and delay jitter characteristics for different network paths which leads to a noticeable path diversity gain. We observe significant reductions in mean end-to-end latency and loss rates as well as improved speech quality when compared to FEC protected single-path transmission at the same data rate. In addition to our Internet measurements, we analyze the performance of the proposed multi-path voice communication scheme using the ns network simulator for different network topologies, including shared network links.

[1]  Bernd Girod,et al.  Adaptive playout scheduling and loss concealment for voice communication over IP networks , 2003, IEEE Trans. Multim..

[2]  Henning Sanneck,et al.  Speech-property-based FEC for Internet telephony applications , 1999, Electronic Imaging.

[3]  Robert Tappan Morris,et al.  The case for resilient overlay networks , 2001, Proceedings Eighth Workshop on Hot Topics in Operating Systems.

[4]  Stefan Savage,et al.  The end-to-end effects of Internet path selection , 1999, SIGCOMM '99.

[5]  M. Kendall Probability and Statistical Inference , 1956, Nature.

[6]  Bernd Girod,et al.  Adaptive playout scheduling using time-scale modification in packet voice communications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Bernd Girod,et al.  Multi-stream voice over IP using packet path diversity , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[8]  Benjamin W. Wah,et al.  Transformation-Based Reconstruction for Real-Time Voice Transmissions over the Internet , 1999, IEEE Trans. Multim..

[9]  P. J. Green,et al.  Probability and Statistical Inference , 1978 .

[10]  Antonio Ortega,et al.  Erasure recovery in predictive coding environments using multiple description coding , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[11]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[12]  G. Kubin,et al.  Multiple-description coding (MDC) of speech with an invertible auditory model , 1999, 1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351).

[13]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[14]  Robert Tappan Morris,et al.  Resilient overlay networks , 2001, SOSP.

[15]  John G. Apostolopoulos,et al.  Reliable video communication over lossy packet networks using multiple state encoding and path diversity , 2000, IS&T/SPIE Electronic Imaging.

[16]  Antonio Ortega,et al.  Multiple description speech coding for robust communication over lossy packet networks , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[17]  Donald F. Towsley,et al.  Adaptive FEC-based error control for Internet telephony , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[18]  David L. Mills Internet Time Synchronization: The Network Time Protocol , 1989, RFC.

[19]  Jean-Chrysostome Bolot,et al.  End-to-end packet delay and loss behavior in the internet , 1993, SIGCOMM '93.