Assessment of objective voice quality over best-effort networks

VoIP calls transferred over dedicated bandwidth or QoS capable networks is a cost-effective alternative for PSTN in large enterprises. However, the calls made over the best effort network, such as the global Internet, suffer packet loss and jitter. In some VoIP-codecs, such as ITU G.723.1 and G.729a, there are built-in recovery mechanisms for concealing packet-based errors in the audio/speech stream. These recovery mechanisms can conceal up to 5% packet losses without significant quality degradation, as shown in this article. The 5% quality degradation is approximately within 0.5 MOS scale when compared to the original signal. Beyond 5%, the speech quality will drop gradually. The overall quality of MOS scale 3 can be maintained even with 14-17% packet loss rates. The influence of delay variation or jitter cannot be eliminated with the concealment algorithms unless the jitter time exceeds packet loss indication delay. The influence of jitter is not critical below 20ms but beyond 20ms limit its influence will decrease the speech quality very steeply. This suggests that packet losses can be recovered in normal conditions, but the influence of jitter must be eliminated somehow. The interleaving and piggybacking-based stream manipulation enhances speech quality in packet dropout situations. The guaranteed delay over the whole Internet would enhance the possibilities of VoIP to achieve success.

[1]  Xiao Su,et al.  A survey of error-concealment schemes for real-time audio and video transmissions over the Internet , 2000, Proceedings International Symposium on Multimedia Software Engineering.

[2]  Vladimir Cuperman,et al.  Classification and spectral extrapolation based packet reconstruction for low-delay speech coding , 1994, 1994 IEEE GLOBECOM. Communications: The Global Bridge.

[3]  Gilbert Held Voice over data networks , 1998 .

[4]  Guido H. Petit,et al.  Assessing Voice Quality in Packet-Based Telephony , 2002, IEEE Internet Comput..

[5]  Ming-Syan Chen,et al.  Adaptive recovery techniques for real-time audio streams , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[6]  Takao Kaneko,et al.  Robust speech coding under packet-loss conditions using recovery sub-codec for broad-band IP network , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Jian Wang,et al.  Parameter interpolation to enhance the frame erasure robustness of CELP coders in packet networks , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[8]  Piet Demeester,et al.  On the influence of best-effort network conditions on the perceived speech quality of VoIP connections , 2001, Proceedings Tenth International Conference on Computer Communications and Networks (Cat. No.01EX495).

[9]  Kathryn Momtahan,et al.  Linear prediction based packet loss concealment algorithm for PCM coded speech , 2001, IEEE Trans. Speech Audio Process..

[10]  V. Zagursky,et al.  Speech signal recovery in packet-switched communication networks , 2000, 2000 10th Mediterranean Electrotechnical Conference. Information Technology and Electrotechnology for the Mediterranean Countries. Proceedings. MeleCon 2000 (Cat. No.00CH37099).

[11]  Antonio Ortega,et al.  Multiple description speech coding for robust communication over lossy packet networks , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[12]  Lawrence Wai-Choong Wong,et al.  Waveform substitution techniques for recovering missing speech segments in packet voice communications , 1986, IEEE Trans. Acoust. Speech Signal Process..

[13]  Jhing-Fa Wang,et al.  A voicing-driven packet loss recovery algorithm for analysis-by-synthesis predictive speech coders over Internet , 2001, IEEE Trans. Multim..

[14]  Tyseer Aboulnasr,et al.  Improving the performance of ITU-T G.729A for VoIP , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[15]  Geoff Huston,et al.  Quality of Service: Delivering QoS on the Internet and in Corporate Networks , 1998 .

[16]  Bor-Sen Chen,et al.  Model-based multirate representation of speech signals and its application to recovery of missing speech packets , 1997, IEEE Trans. Speech Audio Process..

[17]  Fred Halsall,et al.  Multimedia Communications , 2000 .