Perception and Analysis of Selected Auditory Events with Frequency-Dependent Directions

Relatively little is known about complex auditory events caused by multiple simultaneous sources. In order to gain insight into this topic, the perception of wide-band (200-1179-Hz) noise and click train stimuli was examined with subjective tests focusing on perceived spatial distribution. By reproducing different frequency bands of the stimuli from loudspeakers at different azimuth directions, the spatial content of the overall stimulus was varied in 15 test cases. The subjects were required to indicate those loudspeakers that they perceived as radiating sound in each case. The results suggest that the highest and lowest frequencies of the stimuli were more perceptually significant than the middle frequency region. The test cases were never perceived as being more than half the actual width of the source ensemble. The order of the critical-band signals in the loudspeaker setup had a minor effect on the overall width. When a click train stimulus was used instead of continuous noise, the perceived width was reduced significantly. Cross-correlation-based auditory modeling techniques were also examined for their ability to predict the subjective results and were found to be not entirely suitable for the purpose.

[1]  Ville Pulkki,et al.  Localization of virtual sources in multichannel audio reproduction , 2005, IEEE Transactions on Speech and Audio Processing.

[2]  Richard M. Stern,et al.  Lateralization and detection of low‐frequency binaural stimuli: Effects of distribution of internal delay , 1996 .

[3]  A. H. Marshall,et al.  Spatial impression due to early lateral reflections in concert halls: The derivation of a physical measure , 1981 .

[4]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[5]  D R Perrott,et al.  Judgments of sound volume: effects of signal duration, level, and interaural characteristics on the perceived extensity of broadband noise. , 1982, The Journal of the Acoustical Society of America.

[6]  T. Griffiths,et al.  What is an auditory object? , 2004, Nature Reviews Neuroscience.

[7]  M. H. Heycock,et al.  Papers , 1971, BMJ : British Medical Journal.

[8]  Elyse S Sussman,et al.  Integration and segregation in auditory scene analysis. , 2005, The Journal of the Acoustical Society of America.

[9]  Remo Guidieri Res , 1995, RES: Anthropology and Aesthetics.

[10]  T Houtgast,et al.  Spectro-temporal integration in signal detection. , 1990, The Journal of the Acoustical Society of America.

[11]  B. Grothe,et al.  New roles for synaptic inhibition in sound localization , 2003, Nature Reviews Neuroscience.

[12]  Francis Rumsey,et al.  Subjective audio quality trade-offs in consumer multichannel audio-visual delivery systems. Part I: Effects of high frequency limitation. , 2002 .

[13]  Richard M. Stern,et al.  MODELS OF BINAURAL PERCEPTION , 1999 .

[14]  C. Faller,et al.  Source localization in complex listening situations: selection of binaural cues based on interaural coherence. , 2004, The Journal of the Acoustical Society of America.

[15]  M. Gardner Image fusion, broadening, and displacement in sound location. , 1969, The Journal of the Acoustical Society of America.

[16]  Edwin G. Boring,et al.  Auditory Theory with Special Reference to Intensity, Volume, and Localization , 1926 .

[17]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[18]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[19]  R. Meddis,et al.  Implementation details of a computation model of the inner hair‐cell auditory‐nerve synapse , 1990 .

[20]  Michael J. Denham,et al.  A Model of Auditory Streaming , 1995, NIPS.

[21]  Francis Rumsey,et al.  Spatial quality evaluation for reproduced sound: terminology, meaning and a scene-based paradigm , 2002 .