Perceptual codec and interaction aware playout algorithms and quality measurements for VoIP systems

To reduce the effect of network jitter, the playout algorithm for voice streams should correctly adjust the playout delay. Conventional playout algorithms were based on network delay only: they did not consider the perceptual quality, and were not aware the codec and interactive mode. Therefore, we present two novel approaches: codec aware adaptive playout (CAAP) algorithm and interactive aware adaptive playout (IAAP) algorithm, which intent to optimize the voice quality based on codec and interactive mode respectively. The performance of CAAP and IAAP are superior to the prior algorithms in our substantial evaluation. Since no objective mechanisms for measuring the speech quality of two-way communication exist, we also propose a new quality measurement for it.

[1]  Giovanni Pau,et al.  Design and Experimental Evaluation of an Adaptive Playout Delay Control Mechanism for Packetized Audio for Use over the Internet , 2004, Multimedia Tools and Applications.

[2]  Nikolaos Laoutaris,et al.  Adaptive playout strategies for packet video receivers with finite buffer capacity , 2001, ICC 2001. IEEE International Conference on Communications. Conference Record (Cat. No.01CH37240).

[3]  Bernd Girod,et al.  Adaptive playout scheduling using time-scale modification in packet voice communications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[4]  Henning Schulzrinne,et al.  Integrating packet FEC into adaptive voice playout buffer algorithms on the Internet , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[5]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[6]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[7]  Henning Sanneck,et al.  Concealment of lost speech packets using adaptive packetization , 1998, Proceedings. IEEE International Conference on Multimedia Computing and Systems (Cat. No.98TB100241).

[8]  Henning Schulzrinne,et al.  Adaptive playout mechanisms for packetized audio applications in wide-area networks , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[9]  Carmen García-Mateo,et al.  Adaptive multi-rate speech coder for VoIP transmission , 2000 .

[10]  Cormac J. Sreenan,et al.  An adaptive predictor for media playout buffering , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[11]  Moshe Zukerman,et al.  QoS in best-effort networks , 2002, IEEE Commun. Mag..

[12]  Boaz Patt-Shamir,et al.  Optimal smoothing schedules for real-time streams (extended abstract) , 2000, PODC.

[13]  Telecommunications and Internet Protocol Harmonization Over Networks (TIPHON); General aspects of Quality of Service (QoS) , 1998 .

[14]  Prashant J. Shenoy,et al.  A Reliable, Adaptive Network Protocol for Video Transport , 1996, INFOCOM.

[15]  Kenneth J. Christensen,et al.  A new adaptive FEC loss control algorithm for voice over IP applications , 2000, Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086).

[16]  Kenneth J. Christensen,et al.  An algorithm for playout of packet voice based on adaptive adjustment of talkspurt silence periods , 1999, Proceedings 24th Conference on Local Computer Networks. LCN'99.

[17]  Boaz Patt-Shamir,et al.  Optimal smoothing schedules for real-time streams , 2004, PODC '00.