Modelling Speech Quality for NB and WB SILK Codec for VoIP Applications

In the last decade, VoIP telephony has gained a tremendous popularity. Skype is one of the most successful and popular VoIP services which has inspired a new generation of VoIP and multimedia users. SILK speech codec is the latest development by Skype and has been integrated into the current version of Skype and is expected to be incorporated into new and emerging mobile devices such as iphone and soft phones. One of the major challenges in every VoIP service is to find an easily accessible objective quality model to predict/measure the perceived speech quality or the degree of user satisfaction. In this paper, we present a regression-based model to quantify the speech quality of the wideband (WB) and narrowband (NB) SILK codec for VoIP applications. The developed model uses the network level parameter (i.e., packet loss) and the application level parameter (i.e., send bit rate) to predict the perceived voice quality in terms of the Mean Opinion Score (MOS). Subjective tests were also carried out to validate the model and good accuracy was achieved (97% for wideband and 91% for narrowband). The developed model can be easily implemented in soft phones or mobile devices to predict voice quality for SILK codec in VoIP applications and can also be used for real-time adaptation and control of VoIP applications to further explore the adaptive feature of the SILK codec in future mobile devices or softphones.

[1]  Kuan-Ta Chen,et al.  Tuning Skype's Redundancy Control Algorithm for User Satisfaction , 2009, IEEE INFOCOM 2009.

[2]  Kuan-Ta Chen,et al.  Could Skype be more satisfying? a QoE-centric study of the FEC mechanism in an internet-scale VoIP system , 2010, IEEE Network.

[3]  Kuan-Ta Chen,et al.  OneClick: A Framework for Measuring Network Quality of Experience , 2009, IEEE INFOCOM 2009.

[4]  Dario Rossi,et al.  Tracking Down Skype Traffic , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[5]  Chun-Ying Huang,et al.  Quantifying Skype user satisfaction , 2006, SIGCOMM.

[6]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[7]  Anssi Rämö,et al.  Voice quality evaluation of recent open source codecs , 2010, INTERSPEECH.

[8]  Lingfen Sun,et al.  New models for perceived voice quality prediction and their applications in playout buffer optimization for VoIP networks , 2004, 2004 IEEE International Conference on Communications (IEEE Cat. No.04CH37577).

[9]  Hongli Zhang,et al.  Identification and Analysis of Skype Peer-to-Peer Traffic , 2010, 2010 Fifth International Conference on Internet and Web Applications and Services.