Perceived speech quality is the key metric for QoS in VoIP applications. Our primary aims are to carry out a fundamental investigation of the impact of packet loss and talkers on perceived speech quality using an objective method and, thus, to provide the basis for developing an artificial neural network (ANN) model to predict speech quality for VoIP. The impact on perceived speech quality of packet loss and of different talkers was investigated for three modern codecs (G.729, G.723.1 and AMR) using the new ITU PESQ algorithm. Results show that packet loss burstiness, loss locations/patterns and the gender of talkers have an impact. Packet size has, in general, no obvious influence on perceived speech quality for the same network conditions, but the deviation in speech quality depends on packet size and codec. Based on the investigation, we used talkspurt-based conditional and unconditional packet loss rates (which are perceptually more relevant than network packet loss rates), codec type and the gender of the talker (extracted from decoder) as inputs to an ANN model to predict speech quality directly from network parameters. Results show that high prediction accuracy was obtained from the ANN model (correlation coefficients for the test and validation datasets were 0.952 and 0.946 respectively). This work should help to develop efficient, nonintrusive QoS monitoring and control strategies for VoIP applications.
[1]
P. A. Barrett,et al.
Speech transmission over digital mobile radio channels
,
1996
.
[2]
Lingfen Sun,et al.
Impact of Packet Loss Location on Perceived Speech Quality
,
2001
.
[3]
Alan Clark,et al.
Modeling the effects of burst packet loss and recency on subjective voice quality
,
2001
.
[4]
Hossam Afifi,et al.
Audio quality assessment in packet networks: an "inter-subjective" neural network model
,
2001,
Proceedings 15th International Conference on Information Networking.
[5]
Hossam Afifi,et al.
Integrating networks measurements and speech quality subjective scores for control purposes
,
2001,
Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).
[6]
Wenyu Jiang,et al.
QoS Measurement of Internet Real-Time Multimedia Services
,
1999
.
[7]
L. R. Yamamoto,et al.
Impact of network performance parameters on the end-to-end perceived speech quality
,
1997
.