Audio quality assessment in packet networks: an "inter-subjective" neural network model

Transmitting digital audio signals in real time over packet switched networks (e.g. the Internet) has set forth the need for developing signal processing algorithms that objectively evaluate audio quality. So far, the best way to assess audio quality are subjective listening tests, the most commonly used being the mean opinion score (MOS) recommended by the International Telecommunication Union (ITU). The goal of this paper is to show how artificial neural networks (ANNs) can be used to mimic the way human subjects estimate the quality of audio signals when distorted by changes in several parameters that affect the transmitted audio quality. To validate the approach, we carried out an MOS experiment for speech signals distorted by different values of IP-network parameters (e.g. loss rate, loss distribution, packetization interval, etc.), and changes in the encoding algorithm used to compress the original signal. Our results allow us to show that ANNs can capture the nonlinear mapping, between certain characteristics of audio signals and a subjective five points quality scale, "built" by a group of human subjects when participating in an MOS experiment, creating, in this way, an "inter-subjective" neural network (INN) model that might effectively "evaluate", in real time, the audio quality in packet switched networks.

[1]  S. Dimolitsas,et al.  Objective speech distortion measures and their relevance to speech quality assessments , 1989 .

[2]  Jean-Chrysostome Bolot,et al.  End-to-end packet delay and loss behavior in the internet , 1993, SIGCOMM '93.

[3]  M. Angela Sasse,et al.  Evaluating Audio and Video Quality in Low-Cost Multimedia Conferencing Systems , 1996, Interact. Comput..

[4]  M. Angela Sasse,et al.  Measuring perceived quality of speech and video in multimedia conferencing applications , 1998, MULTIMEDIA '98.

[5]  Bernard Widrow,et al.  Neural networks: applications in industry, business and science , 1994, CACM.

[6]  Mahbub Hassan,et al.  Internet telephony: services, technical challenges, and products , 2000, IEEE Commun. Mag..

[7]  Herwig Bruneel,et al.  An Accurate Closed-Form Formula to Calculate the Dejittering Delay in Packetised Voice Transport , 2000, NETWORKING.

[8]  Methods for the subjective assessment of small impairments in audio systems , 2015 .

[9]  S. Thomas Alexander,et al.  Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[10]  Guido M. Schuster,et al.  Real-time voice over packet-switched networks , 1998, IEEE Netw..

[11]  Donald F. Towsley,et al.  Adaptive FEC-based error control for Internet telephony , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[12]  Fulvio Babich,et al.  A novel wide-band audio transmission scheme over the Internet with a smooth quality degradation , 2000, CCRV.

[13]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[14]  V. Hardman,et al.  A survey of packet loss recovery techniques for streaming audio , 1998, IEEE Network.

[15]  Danny DeVleeschauwer,et al.  Delay bounds for low-bit-rate voice transport over IP networks , 1999, Optics East.

[16]  Vern Paxson End-to-end internet packet dynamics , 1999, TNET.

[17]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[18]  Guido Henri Marguerite Petit,et al.  DELAY AND DISTORTION BOUNDS FOR PACKETIZED VOICE CALLS OF TRADITIONAL PSTN QUALITY , 2000 .

[19]  Vern Paxson,et al.  End-to-end Internet packet dynamics , 1997, SIGCOMM '97.

[20]  D. J. Wright Voice over ATM: an evaluation of implementation alternatives , 1996 .

[21]  Henning Schulzrinne,et al.  Integrating packet FEC into adaptive voice playout buffer algorithms on the Internet , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).