A method for quantitative evaluation of audio quality over packet networks and its comparison with existing techniques

We have recently proposed a novel, non–intrusive, real– time approach to measuring the quality of an audio (or speech) stream transmitted over a packet network. The proposed approach takes into account the diversity of the factors which affect audio quality, including encoding parameters and network impairments. The goal of this method is to overcome the limitations of the quality assessment techniques currently available in the literature, such as the low correlation with subjective measurements, or the need to access the original signal, which precludes real–time applications. Our approach correlates well with human perception, it is not computationally intensive, does not need to access the original signal, and can work with any set of parameters that affect the perceived quality, including parameters such as FEC, which are usually not taken into account in other methods. It is based on the use of a Random Neural Network (RNN), which is trained to assess audio quality as an average human being. In this paper we compare the performance of the proposed method with that of other assessment techniques found in the literature.

[1]  Erol Gelenbe,et al.  Learning in the multiple class random neural network , 2002, IEEE Trans. Neural Networks.

[2]  Donald F. Towsley,et al.  Measurement and modelling of the temporal dependence in packet loss , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[3]  S. Voran,et al.  Estimation of perceived speech quality using measuring normalizing blocks , 1997, 1997 IEEE Workshop on Speech Coding for Telecommunications Proceedings. Back to Basics: Attacking Fundamental Problems in Speech Coding.

[4]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[5]  Gerardo Rubino,et al.  A study of real-time packet video quality using random neural networks , 2002, IEEE Trans. Circuits Syst. Video Technol..

[6]  Erol Gelenbe Stability of the Random Neural Network Model , 1990, EURASIP Workshop.

[7]  John G. Beerends,et al.  A Perceptual Audio Quality Measure Based on a Psychoacoustic Sound Representation , 1992 .

[8]  A.W. Rix,et al.  The perceptual analysis measurement system for robust end-to-end speech quality assessment , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9]  Peter G. Harrison,et al.  G-networks: new queueing models with additional control capabilities , 1995, SIGMETRICS '95/PERFORMANCE '95.

[10]  Wonho Yang,et al.  Enhanced modified bark spectral distortion (embsd): an objective speech quality measure based on audible distortion and cognition model , 1999 .

[11]  Erol Gelenbe,et al.  Random Neural Networks with Negative and Positive Signals and Product Form Solution , 1989, Neural Computation.

[12]  Gerardo Rubino,et al.  Performance evaluation of real-time speech through a packet network: a random neural networks-based approach , 2004, Perform. Evaluation.

[13]  A. Rix Advances in objective quality assessment of speech over analogue and packet based networks , 1999 .

[14]  Timothy A. Hall Objective speech quality measures for Internet telephony , 2001, SPIE ITCom.

[15]  Jean C. Bolot,et al.  The Case for FEC-based Error Control for Packet Audio in the Internet , 1997 .

[16]  David Hands,et al.  A Study of the Impact of Network Loss and Burst Size on Video Streaming Quality and Acceptability , 1999, IDMS.