Voicing-aware parametric speech quality models over VoIP networks

This paper describes novel parametric speech quality models which subsume the effect of packet loss distribution and voicing feature of missing signal waves. Speech quality estimate models for voiced and unvoiced loss location patterns are developed following multiple statistical regression analysis of measurements gathered from a built speech quality assessment framework. The overall speech quality is estimated by combining voiced and unvoiced speech quality estimate scores using an expression calibrated using a large number of speech samples. The input parameters namely, mean loss durations and ratios for voiced and unvoiced packets, of speech quality estimate models are extracted at run-time using a new voicing-aware packet loss Markov model. This chain, calibrated at run-time, finely models bursty packet loss behavior over voiced and unvoiced missing speech waves. Performance evaluation study shows that our voicing-aware speech quality estimate models clearly outperform voicing-agnostic speech quality models in terms of accuracy over a wide range of conditions.

[1]  Henning Sanneck,et al.  Packet loss recovery and control for voice transmission over the Internet , 2000 .

[2]  Mark A. Greenwood,et al.  SUVING: AUTOMATIC SILENCE /UNVOICED/VOICED CLASSIFICATION OF SPEECH , 1999 .

[3]  Lingfen Sun,et al.  Impact of Packet Loss Location on Perceived Speech Quality , 2001 .

[4]  Akira Takahashi,et al.  QoE Estimation Method for Interconnected VoIP Networks Employing Different Codecs , 2007, IEICE Trans. Commun..

[5]  Zhuoqun Sun,et al.  Voice quality prediction models and their application in VoIP networks , 2006, IEEE Transactions on Multimedia.

[6]  Christian Hoene,et al.  Internet Telephony over Wireless Links , 2006 .

[7]  Ayman Radwan,et al.  Non-intrusive single-ended speech quality assessment in VoIP , 2007, Speech Commun..

[8]  Raj Jain,et al.  The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.

[9]  Peter Reichl,et al.  Where packet traces meet speech samples: an instrumental approach to perceptual QoS evaluation of VoIP , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[10]  Alan Clark,et al.  Modeling the effects of burst packet loss and recency on subjective voice quality , 2001 .

[11]  Oded Ghitza,et al.  Objective Assessment of Speech and Audio Quality - Technology and Applications , 2006, IEEE Trans. Speech Audio Process..

[12]  Takanori Hayashi,et al.  Non-intrusive Quality Monitoring Method of VoIP Speech Based on Network Performance Metrics , 2006, IEICE Trans. Commun..

[13]  S. R. Broom,et al.  VoIP Quality Assessment: Taking Account of the Edge-Device , 2006, IEEE Transactions on Audio, Speech, and Language Processing.