A computational model for the estimation of localisation uncertainty

A computational model for prediction of localisation uncertainty of phantom auditory sources is proposed. The interaural level and time difference pairs due to point sources in free field are used as a reference. The mismatch between these “natural” pairs and interaural time and level difference pairs elicited by phantom sources is quantified by means of the 0.5-norm distance, which is justified on psychoacoustic grounds. The model is validated by results of subjective listening tests, achieving a high level of correlation with experimental data.

[1]  Ville Pulkki,et al.  Localization of virtual sources in multichannel audio reproduction , 2005, IEEE Transactions on Speech and Audio Processing.

[2]  Jerome Daniel,et al.  Ambisonics Encoding of Other Audio Formats for Multiple Listening Conditions , 1998 .

[3]  W. G. Gardner,et al.  HRTF measurements of a KEMAR , 1995 .

[4]  W. Gaik,et al.  Combined evaluation of interaural time and intensity differences: psychoacoustic results and computer modeling. , 1993, The Journal of the Acoustical Society of America.

[5]  H. Gaskell The precedence effect , 1983, Hearing Research.

[6]  Zoran Cvetkovic,et al.  Analysis and Design of Multichannel Systems for Perceptual Sound Field Reconstruction , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. I. Simulation of lateralization for stationary signals. , 1986, The Journal of the Acoustical Society of America.

[8]  Florian Wickelmaier,et al.  Perceptual Audio Evaluation - Theory, Method and Application , 2006 .

[9]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[10]  Søren Bech,et al.  Perceptual Audio Evaluation-Theory, Method and Application: Bech/Perceptual Audio Evaluation-Theory, Method and Application , 2006 .

[11]  C. Faller,et al.  Source localization in complex listening situations: selection of binaural cues based on interaural coherence. , 2004, The Journal of the Acoustical Society of America.

[12]  Elena Deza,et al.  Encyclopedia of Distances , 2014 .

[13]  Francis Rumsey,et al.  Localization Curves for a Regularly-Spaced Octagon Loudspeaker Array , 2009 .

[14]  Jerome Daniel,et al.  Spatial Sound Encoding Including Near Field Effect: Introducing Distance Coding Filters and a Viable, New Ambisonic Format , 2003 .

[15]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[16]  S. van de Par,et al.  The normalized interaural correlation: accounting for NoS pi thresholds obtained with Gaussian and "low-noise" masking noise. , 1999, The Journal of the Acoustical Society of America.

[17]  Malcolm Slaney,et al.  An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .

[18]  R. Duda,et al.  Range dependence of the response of a spherical head model , 1998 .

[19]  Mark A. Poletti,et al.  A Unified Theory of Horizontal Holographic Sound Systems , 2000 .

[20]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[21]  Gary S. Kendall,et al.  The Decorrelation of Audio Signals and Its Impact on Spatial Imagery , 1995 .

[22]  Benjamin Bernfeld,et al.  Attempts for Better Understanding of the Directional Stereophonic Listening Mechanism , 1973 .

[23]  J. Daniel,et al.  Représentation de champs acoustiques, application à la transmission et à la reproduction de scènes sonores complexes dans un contexte multimédia , 2000 .

[24]  RECOMMENDATION ITU-R BS.1534-1 - Method for the subjective assessment of intermediate quality level of coding systems , 2003 .