Objective Measurement of Perceived Auditory Quality in Multichannel Audio Compression Coding Systems

Objective quality assessment methods have been used widely for the evaluation of audio coding systems. However, even though many different competing multichannel audio compression coding systems are being developed, most current quality assessment methods only predict results for monaural or stereo signals. A prediction method is introduced that can be used for multichannel audio compression coding systems. The method introduces three variables as measures of the degradations in spatial quality: interaural time difference (ITD) distortion, interaural level difference (ILD) distortion, and interaural cross-correlation coefficient (IACC) distortion. Simultaneously ten model output variables proposed in ITU-R BS. 1387-1 are extracted from binaural signals that are synthesized using head-related transfer functions. The prediction model is trained and verified using results from subjective listening tests of multichannel audio compression coding systems that were performed by participants in the MPEG audio group. This new model, using the three interaural and ten nonspatial statistics, shows encouraging results in the prediction of perceived quality.

[1]  Ernst Eberlein,et al.  Analysis Tool for Realtime Measurements Using Perceptual Criteria , 1992 .

[2]  G. C. Stecker Rate-limited, but accurate, central processing of interaural time differences in modulated high-frequency sounds. Focus on: "neural sensitivity to interaural envelope delays in the inferior colliculus of the guinea pig". , 2005, Journal of neurophysiology.

[3]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[4]  E R Hafter,et al.  Detection of interaural differences of intensity in trains of high-frequency clicks as a function of interclick interval and number. , 1983, The Journal of the Acoustical Society of America.

[5]  Francis Rumsey,et al.  Initial Developments of an Objective Method for the Prediction of Basic Audio Quality for Surround Audio Recordings , 2006 .

[6]  Francis Rumsey,et al.  Systematic Evaluation of Perceived Spatial Quality , 2003 .

[7]  Florian Wickelmaier,et al.  Perceptual Audio Evaluation - Theory, Method and Application , 2006 .

[8]  Thomas Baer,et al.  A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[9]  Jean-Bernard Rault,et al.  A Perceptual Model Applied to Audio Bit-Rate Reduction , 1995 .

[10]  Tim Brookes,et al.  An Investigation Into Head Movements Made When Evaluating Various Attributes of Sound , 2007 .

[11]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[12]  Jon A. Beracoechea-Álava,et al.  Coding Strategies and Quality Measure for Multichannel Audio , 2004 .

[13]  Thomas Sporer Objective Audio Signal Evaluation-Applied Psychoacoustics for Modeling the Perceived Quality of Digital Audio , 1997 .

[14]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[15]  Philip H Smith,et al.  Coincidence Detection in the Auditory System 50 Years after Jeffress , 1998, Neuron.

[16]  Christof Faller,et al.  Binaural cue coding-Part I: psychoacoustic fundamentals and design principles , 2003, IEEE Trans. Speech Audio Process..

[17]  B. Paillard,et al.  PERCEVAL: Perceptual Evaluation of the Quality of Audio Signals , 1992 .

[18]  Thilo Thiede,et al.  A New Perceptual Quality Measure for Bit-Rate Reduced Audio , 1996 .

[19]  Thomas Sporer,et al.  PEAQ - The ITU Standard for Objective Measurement of Perceived Audio Quality , 2000 .

[20]  John G. Beerends,et al.  A Perceptual Audio Quality Measure Based on a Psychoacoustic Sound Representation , 1992 .

[21]  Tomasz Letowski,et al.  Sound Quality Assessment: Concepts and Criteria , 1989 .

[22]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[23]  William C. Treurniet Simulation of Individual Listeners with an Auditory Model , 1996 .

[24]  J. C. Middlebrooks,et al.  Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. , 2002, The Journal of the Acoustical Society of America.

[25]  Barbara G. Shinn-Cunningham,et al.  Prediction of Perceived Quality in Multi-Channel Audio Compression Coding Systems , 2007 .

[26]  Leslie R Bernstein,et al.  Enhancing sensitivity to interaural delays at high frequencies by using "transposed stimuli". , 2002, The Journal of the Acoustical Society of America.

[27]  B. Shinn-Cunningham,et al.  Neural representation of source direction in reverberant space , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).