Feature Extraction for the Prediction of Multichannel

This paper seeks to present an algorithm for the pre- diction of frontal spatial fidelity and surround spatial fidelity of multichannel audio, which are two attributes of the subjective pa- rameter called basic audio quality. A number of features chosen to represent spectral and spatial changes were extracted from a set of recordings and used in a regression model as independent variables for the prediction of spatial fidelities. The calibration of the model was done by ridge regression using a database of scores obtained from a series of formal listening tests. The statistically significant features based on interaural cross correlation and spec- tral features found from an initial model were employed to build a simplified model and these selected features were validated. The results obtained from the validation experiment were highly cor- related with the listening test scores and had a low standard error comparable to that encountered in typical listening tests. The ap- plicability of the developed algorithm is limited to predicting the basic audio quality of low-pass filtered and down-mixed recordings (as obtained in listening tests based on a multistimulus test para- digm with reference and two anchors: a 3.5-kHz low-pass filtered signal and a mono signal).

[1]  Mark Franklin Davis History of Spatial Coding , 2003 .

[2]  S. Link,et al.  Bias in Quantifying Judgments , 1989 .

[3]  Tomasz Letowski,et al.  Sound Quality Assessment: Concepts and Criteria , 1989 .

[4]  Eunmi Oh,et al.  Audio adaptation according to usage environment and perceptual quality metrics , 2005, IEEE Transactions on Multimedia.

[5]  Masayuki Morimoto The Role of Rear Loudspeakers in Spatial Impression , 1997 .

[6]  James W. Beauchamp,et al.  Synthesis by Spectral Amplitude and "Brightness" Matching of Analyzed Musical Instrument Tones , 1981 .

[7]  Thilo Thiede,et al.  A New Perceptual Quality Measure for Bit-Rate Reduced Audio , 1996 .

[8]  John G. Beerends,et al.  A Perceptual Audio Quality Measure Based on a Psychoacoustic Sound Representation , 1992 .

[9]  Leo L. Beranek,et al.  Interaural cross‐correlation, lateral fraction, and low‐ and high‐frequency sound levels as measures of acoustical quality in concert halls , 1995 .

[10]  Andrzej Dobrucki,et al.  Proposed Changes to the Methods of Objective, Perceptual Based Evaluation of Compressed Speech and Audio Signals , 2004 .

[11]  Francis Rumsey,et al.  On the relative importance of spatial and timbral fidelities in judgments of degraded multichannel audio quality. , 2005, The Journal of the Acoustical Society of America.

[12]  Thomas Sporer,et al.  PEAQ - The ITU Standard for Objective Measurement of Perceived Audio Quality , 2000 .

[13]  B. Paillard,et al.  PERCEVAL: Perceptual Evaluation of the Quality of Audio Signals , 1992 .

[14]  Charles D. Creusere,et al.  Evaluating low bitrate scalable audio quality using advanced version of PEAQ and energy equalization approach , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[15]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[16]  Jon A. Beracoechea-Álava,et al.  Coding Strategies and Quality Measure for Multichannel Audio , 2004 .

[17]  Matti Karjalainen,et al.  A new auditory model for the evaluation of sound quality of audio systems , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Thomas Sporer Objective Audio Signal Evaluation-Applied Psychoacoustics for Modeling the Perceived Quality of Digital Audio , 1997 .

[19]  Karlheinz Brandenburg Evaluation of Quality for Audio Encoding at Low Bit Rates , 1987 .

[20]  Francis Rumsey,et al.  Comparison of Basic Audio Quality and Timbral and Spatial Fidelity Changes Caused by Limitation of Bandwidth and by Down-Mix Algorithms in 5.1 Surround Audio Systems , 2005 .

[21]  Bill Gardner,et al.  HRTF Measurements of a KEMAR Dummy-Head Microphone , 1994 .

[22]  G. A. Soulodre,et al.  Objective measures of listener envelopment , 1995 .

[23]  Andy Field,et al.  Discovering statistics using SPSS, 2nd ed. , 2005 .

[24]  Jammalamadaka Introduction to Linear Regression Analysis (3rd ed.) , 2003 .

[25]  Francis Rumsey,et al.  Frequency dependency of the relationship between perceived auditory source width and the interaural cross-correlation coefficient for time-invariant stimuli. , 2005, The Journal of the Acoustical Society of America.

[26]  Matti Karajalainen,et al.  A binaural auditory model for sound quality measurements and spatial hearing studies , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[27]  Jayme Garcia Arnal Barbedo,et al.  Strategies to Increase the Applicability of Methods for Objective Assessment of Audio Quality , 2004 .

[28]  Jean-Bernard Rault,et al.  A Perceptual Model Applied to Audio Bit-Rate Reduction , 1995 .