Quality assessment of multi-channel audio processing schemes based on a binaural auditory model

A perceptual, binaural audio-quality model is introduced. The model was developed for predicting any kinds of perceived spatial quality differences between two audio signals in multi-channel reproduction and audio processing schemes. It employs a recent binaural auditory model as front-end to provide perceptually relevant binaural features for the reference and test audio signal. Correlations between the binaural features of both signals are combined to an overall spatial quality measure by the use of multivariate adaptive regression splines (MARS). Furthermore, a database was generated to train and evaluate the model. The database contains various multi-channel audio signals, which were subjectively assessed in formal listening tests with 15 trained listeners. The results show different model prediction performances depending on the type of quality degradation. Combination of the proposed spatial quality measure with established monaural quality measures improved the predictive power.

[1]  Keong-Mo Sung,et al.  Perceptual Objective Quality Evaluation Method for High Quality Multichannel Audio Codecs , 2013 .

[2]  Peter Vary,et al.  An extension of the PEAQ measure by a binaural hearing model , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Alexander Lindau,et al.  Deriving content-specific measures of room acoustic perception using a binaural, nonlinear auditory model. , 2013, The Journal of the Acoustical Society of America.

[4]  Volker Hohmann,et al.  Auditory model based direction estimation of concurrent speakers from binaural signals , 2011, Speech Commun..

[5]  Marc Moonen,et al.  Theoretical Analysis of Binaural Multimicrophone Noise Reduction Techniques , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Giso Grimm,et al.  Increase and Subjective Evaluation of Feedback Stability in Hearing Aids by a Binaural Coherence-Based Noise Reduction Scheme , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Francis Rumsey,et al.  QESTRAL (Part 1): Quality Evaluation of Spatial Transmission and Reproduction Using an Artificial Listener , 2008 .

[8]  Thomas Görne Mikrofone in Theorie und Praxis , 2007 .

[9]  Birger Kollmeier,et al.  PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  C. Faller,et al.  Source localization in complex listening situations: selection of binaural cues based on interaural coherence. , 2004, The Journal of the Acoustical Society of America.

[11]  Brian C. J. Moore,et al.  Development and Validation of a Method for Predicting the Perceived Naturalness of Sounds Subjected to Spectral Distortion , 2004 .

[12]  A. Kohlrausch,et al.  Binaural processing model based on contralateral inhibition. I. Model structure. , 2001, The Journal of the Acoustical Society of America.

[13]  B Kollmeier,et al.  Binaural and monaural auditory filter bandwidths and time constants in probe tone detection experiments. , 1998, The Journal of the Acoustical Society of America.

[14]  W. T. Peake,et al.  Sound-pressure measurements in the cochlear vestibule of human-cadaver ears. , 1997, The Journal of the Acoustical Society of America.

[15]  T Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. I. Model structure. , 1996, The Journal of the Acoustical Society of America.

[16]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[17]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.