Adjusting The Perceived Distance Of Virtual Speech Sources By Modifying Binaural Room Impulse Responses

Effective control of the perceived location of virtual sound sources is an important aspect of auditory displays. While room acoustics modelling may be used to produce cues related to the sound source and listener location in a space, in many real-time applications it is more feasible to utilize ready-made room impulse responses. This paper looks at how the perception of distance can be affected by modifying the temporal envelopes of room impulse responses. Two measured binaural room impulse responses were modified by amplifying or attenuating different portions of them before convolving them with speech samples. Listeners were asked to judge the relative distances between these virtual speech sources presented over headphones. The results suggest that the perception of distance is more effectively altered by modifying an early-tolate energy ratio, where approximately 50–100 ms of the impulse response is included in the early energy, than by directly modifying the traditional direct-to-reverberant energy ratio.

[1]  Pavel Zahorik,et al.  Direct-to-reverberant energy ratio sensitivity. , 2002, The Journal of the Acoustical Society of America.

[2]  S. S. Stevens On the psychophysical law. , 1957, Psychological review.

[3]  D R Begault,et al.  Headphone Localization of Speech , 1993, Human factors.

[4]  Pavel Zahorik,et al.  Assessing auditory distance perception using virtual acoustics. , 2002, The Journal of the Acoustical Society of America.

[5]  John S. Bradley,et al.  Subjective evaluation of new room acoustic measures , 1995 .

[6]  Douglas Brungart,et al.  Near-Field Virtual Audio Displays , 2002, Presence: Teleoperators & Virtual Environments.

[7]  Wolfgang Ellermeier,et al.  Directional loudness in an anechoic sound field, head-related transfer functions, and binaural summation. , 2006, The Journal of the Acoustical Society of America.

[8]  Nandini Iyer,et al.  On the minimum audible difference in direct-to-reverberant energy ratio. , 2008, The Journal of the Acoustical Society of America.

[9]  Douglas S. Brungart,et al.  Auditory localization of nearby sources in a virtual audio display , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[10]  Peter Vary,et al.  A binaural room impulse response database for the evaluation of dereverberation algorithms , 2009, 2009 16th International Conference on Digital Signal Processing.

[11]  John M. Chowning,et al.  THE SIMULATION OF MOVING SOUND SOURCES , 1970 .

[12]  Mark B. Gardner,et al.  Distance Estimation of 0° or Apparent 0°‐Oriented Speech Signals in Anechoic Space , 1969 .

[13]  D. Brungart,et al.  The effects of production and presentation level on the auditory distance perception of speech. , 2001, The Journal of the Acoustical Society of America.

[14]  D. Mershon,et al.  Intensity and reverberation as factors in the auditory perception of egocentric distance , 1975 .

[15]  A. Bronkhorst,et al.  Auditory distance perception in humans : A summary of past and present research , 2005 .

[16]  Søren H. Nielsen,et al.  Evaluation of Different Loudness Models with Music and Speech Material , 2004 .

[17]  Esteban R Calcagno,et al.  The Role of Vision in Auditory Distance Perception , 2012, Perception.

[18]  Qp. Bronkhorst,et al.  MODELING AUDITORY DISTANCE PERCEPTION IN ROOMS , 2002 .

[19]  Tammo Houtgast,et al.  Auditory distance perception in rooms , 1999, Nature.

[20]  D R Begault,et al.  Preferred Sound Intensity Increase for Sensation of Half Distance , 1991, Perceptual and motor skills.

[21]  J. S. Bradley,et al.  On the importance of early reflections for speech in rooms. , 2003, The Journal of the Acoustical Society of America.