Evaluating perceived voice quality on packet networks using different random neural network architectures

Voice over Internet Protocol (VoIP) is one of the fastest growing technologies in the world. In VoIP speech signals are transmitted over the same network used for data communications. The internet is not a robust network and is subjected to delay, jitter, and packet loss. It is very important to measure and monitor the quality of service (QoS) the users experience in VoIP networks; this is not an easy task and usually requires subjective tests. In this paper we have analyzed three non-intrusive models to measure and monitor voice quality using Random Neural Networks (RNN). A RNN is an open queuing network with positive and negative signals. We have assessed the voice quality based on various parameters i.e. delay, jitter, packet loss, and codec. In our approach we have used the Mean Opinion Score (MOS) calculated using a Perceptual Evaluation of Speech Quality (PESQ) algorithm to generate data for training the RNN model. We have studied two feed-forward models and a recurrent architecture. We have found that the simple feed-forward architecture has produced the most accurate results compared to the other two architectures.

[1]  Athina Markopoulou,et al.  Assessing the quality of voice communications over internet backbones , 2003, TNET.

[2]  Erol Gelenbe,et al.  Random Neural Networks with Negative and Positive Signals and Product Form Solution , 1989, Neural Computation.

[3]  Stelios Timotheou,et al.  The Random Neural Network: A Survey , 2010, Comput. J..

[4]  Gerardo Rubino,et al.  Controlling Multimedia QoS in the Future Home Network Using the PSQA Metric , 2006, Comput. J..

[5]  Zhi-Hong Mao,et al.  Function approximation with spiked random networks , 1999, IEEE Trans. Neural Networks.

[6]  Doh-Suk Kim,et al.  Perceptual model for non-intrusive speech quality assessment , 2006, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Gerardo Rubino,et al.  A study of real-time packet video quality using random neural networks , 2002, IEEE Trans. Circuits Syst. Video Technol..

[8]  Hadi Larijani,et al.  Voice Quality in VoIP Networks Based on Random Neural Networks , 2010, 2010 Ninth International Conference on Networks.

[9]  B. Goode,et al.  Voice over Internet protocol (VoIP) , 2002, Proc. IEEE.

[10]  Georgios Loukas,et al.  A Denial of Service Detector based on Maximum Likelihood Detection and the Random Neural Network , 2007, Comput. J..

[11]  Erol Gelenbe,et al.  Analog hardware implementation of the random neural network model , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[12]  Gerardo Rubino,et al.  Quality assessment of interactive voice applications , 2008 .

[13]  Nobuhiko Kitawaki,et al.  Pure Delay Effects on Speech Quality in Telecommunications , 1991, IEEE J. Sel. Areas Commun..

[14]  Erol Gelenbe Stability of the Random Neural Network Model , 1990, EURASIP Workshop.

[15]  Timothy A. Hall Objective speech quality measures for Internet telephony , 2001, SPIE ITCom.

[16]  Guy Pujolle,et al.  Parametric speech quality models for measuring the perceptual effect of network delay jitter , 2009, 2009 IEEE 34th Conference on Local Computer Networks.

[17]  Lingfen Sun,et al.  New models for perceived voice quality prediction and their applications in playout buffer optimization for VoIP networks , 2004, 2004 IEEE International Conference on Communications (IEEE Cat. No.04CH37577).

[18]  Taskin Koçak,et al.  Survey of random neural network applications , 2000, Eur. J. Oper. Res..

[19]  Yan Bai,et al.  A Study for Providing Better Quality of Service to VoIP Users , 2006, 20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06).

[20]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[21]  Hadi Larijani,et al.  A study on QoS of VoIP networks: a random neural network (RNN) approach , 2010, SpringSim.

[22]  Erol Gelenbe,et al.  Sensible decisions based on QoS , 2003, Comput. Manag. Sci..

[23]  Erol Gelenbe,et al.  Low bit-rate video compression with neural networks and temporal subsampling , 1996, Proc. IEEE.

[24]  Efstathios D. Sykas,et al.  Voice-quality monitoring and control for VoIP , 2005, IEEE Internet Computing.

[25]  Hadi Larijani,et al.  A non-intrusive method to assess voice quality over internet , 2010, Proceedings of the 2010 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS '10).

[26]  A. G. Constantinides,et al.  Effects of packet loss on 3 toll quality speech coders , 1989 .

[27]  Erol Gelenbe,et al.  Neural network methods for volumetric magnetic resonance imaging of the human brain , 1996 .

[28]  Erol Gelenbe,et al.  Self-aware networks and QoS , 2004, Proceedings of the IEEE.

[29]  Hossam Afifi,et al.  Audio quality assessment in packet networks: an "inter-subjective" neural network model , 2001, Proceedings 15th International Conference on Information Networking.

[30]  Gerardo Rubino,et al.  Performance evaluation of real-time speech through a packet network: a random neural networks-based approach , 2004, Perform. Evaluation.

[31]  Erol Gelenbe,et al.  Traffic and video quality with adaptive neural compression , 1996, Multimedia Systems.

[32]  Erol Gelenbe,et al.  Learning in the multiple class random neural network , 2002, IEEE Trans. Neural Networks.

[33]  Lingfen Sun,et al.  Speech quality prediction for voice over Internet protocol networks , 2004 .