The creation of a binaural spatialization tool

The main focus of the research presented within this thesis is, as the title suggests, binaural spatialization. Binaural technology and, especially, the binaural recording technique are not particularly recent. Nevertheless, the interest in this technology has lately become substantial due to the increase in the calculation power of personal computers, which started to allow the complete and accurate real-time simulation of three-dimensional sound-fields over headphones. The goals of this body of research have been determined in order to provide elements of novelty and of contribution to the state of the art in the field of binaural spatialization. A brief summary of these is found in the following list: • The development and implementation of a binaural spatialization technique with Distance Simulation, based on the individual simulation of the distance cues and Binaural Reverb, in turn based on the weighted mix between the signals convolved with the different HRIR and BRIR sets; • The development and implementation of a characterization process for modifying a BRIR set in order to simulate different environments with different characteristics in terms of frequency response and reverb time; • The creation of a real-time and offline binaural spatialization application, implementing the techniques cited in the previous points, and including a set of multichannel(and Ambisonics)-to-binaural conversion tools. • The performance of a perceptual evaluation stage to verify the effectiveness, realism, and quality of the techniques developed, and • The application and use of the developed tools within both scientific and artistic “case studies”. In the following chapters, sections, and subsections, the research performed between January 2006 and March 2010 will be described, outlining the different stages before, during, and after the development of the software platform, analysing the results of the perceptual evaluations and drawing conclusions that could, in the future, be considered the starting point for new and innovative research projects.

[1]  W A Yost,et al.  Binaural modulation detection interference. , 1997, The Journal of the Acoustical Society of America.

[2]  Jyri Huopaniemi,et al.  VIRTUAL ACOUSTICS AND 3-D SOUND IN MULTIMEDIA SIGNAL PROCESSING , 2000 .

[3]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[4]  J. C. Middlebrooks,et al.  Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. , 2002, The Journal of the Acoustical Society of America.

[5]  Juha Merimaa,et al.  Spatial Impulse Response Rendering II: Reproduction of Diffuse Sound and Listening Tests , 2006 .

[6]  Pablo F. Hoffmann,et al.  Some Observations on Sensitivity to HRTF Magnitude , 2008 .

[7]  Adriano Farina,et al.  Real-Time Auralization Employing a Not-Linear, Not-Time-Invariant Convolver , 2007 .

[8]  Keith D. Martin Estimating azimuth and elevation from interaural differences , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[9]  Philip A. Nelson,et al.  Influence of Individual Head-Related Transfer Function on the Performance of Virtual Acoustic Imaging Systems , 1998 .

[10]  V. Ralph Algazi,et al.  Motion-Tracked Binaural Sound for Personal Music Players , 2005 .

[11]  Sunil Bharitkar,et al.  Selective signal cancellation for multiple-listener audio applications using eigenfilters , 2003, IEEE Trans. Multim..

[12]  R. Duda,et al.  Approximating the head-related transfer function using simple geometric models of the head and torso. , 2002, The Journal of the Acoustical Society of America.

[13]  Scott H. Foster,et al.  Measuring HRTFs in a reflective environment , 1994 .

[14]  William L. Martens,et al.  Simulating the Cues of Spatial Hearing in Natural Environments , 1984, ICMC.

[15]  Gary S. Kendall Directional Sound Processing in Stereo Reproduction , 1992, ICMC.

[16]  J. C. Middlebrooks,et al.  Human sound localization at near-threshold levels , 2005, Hearing Research.

[17]  Juha Merimaa,et al.  Applications of a 3-D Microphone Array , 2002 .

[18]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[19]  Hareo Hamada,et al.  Experiments on a System for the Synthesis of Virtual Acoustic Sources , 1996 .

[20]  Henrik Møller,et al.  Audibility of direct switching between head-related transfer functions , 2008 .

[21]  Jacob W. Scarpaci,et al.  A real‐time virtual auditory system for spatially dynamic perception research , 2004 .

[22]  Ville Pulkki,et al.  Compensating Displacement of Amplitude-panned Virtual Sources , 2002 .

[23]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[24]  Brian F. G. Katz International Round Robin on Room Acoustical Impulse Response Analysis Software 2004 , 2004 .

[25]  Elizabeth M. Wenzel,et al.  Localization in Virtual Acoustic Displays , 1992, Presence: Teleoperators & Virtual Environments.

[26]  Ville Pulkki,et al.  Localization of virtual sources in multichannel audio reproduction , 2005, IEEE Transactions on Speech and Audio Processing.

[27]  David R. Perrott,et al.  Auditory and Visual Localization: Two Modalities, One World , 1993 .

[28]  William L. Martens,et al.  Perceptual evaluation of filters controlling source direction : Customized and generalized HRTFs for binaural synthesis , 2003 .

[29]  Khoa-Van Nguyen,et al.  Spatial audition in a static virtual environment : the role of auditory-visual interaction , 2009, J. Virtual Real. Broadcast..

[30]  Elizabeth M. Wenzel,et al.  Effect of Increasing System Latency on Localization of Virtual Sounds , 1999 .

[31]  Etienne Corteel Synthesis of Directional Sources Using Wave Field Synthesis, Possibilities, and Limitations , 2007, EURASIP J. Adv. Signal Process..

[32]  John C Middlebrooks,et al.  Vertical-plane sound localization probed with ripple-spectrum noise. , 2003, The Journal of the Acoustical Society of America.

[33]  Ville Pulkki,et al.  Spatial sound generation and perception by amplitude panning techniques , 2001 .

[34]  Matti Karjalainen,et al.  Acoustic Positioning and Head Tracking Based on Binaural Signals , 2004 .

[35]  H. Steven Colburn,et al.  Principal components analysis interpolation of head related transfer functions using locally‐chosen basis functions , 2005 .

[36]  V. Ralph Algazi,et al.  Estimation of a Spherical-Head Model from Anthropometry , 2001 .

[37]  John A. White,et al.  A SYSTEM FOR REAL-TIME VIRTUAL AUDITORY SPACE , 2005 .

[38]  T. Anderson,et al.  Binaural and spatial hearing in real and virtual environments , 1997 .

[39]  C. Kyriakakis Virtual microphones and virtual loudspeakers for multichannel audio , 2000, 2000 Digest of Technical Papers. International Conference on Consumer Electronics. Nineteenth in the Series (Cat. No.00CH37102).

[40]  David A. Burgess Techniques for low cost spatial audio , 1992, UIST '92.

[41]  Ray Meddis,et al.  The Role of Binaural and Fundamental Frequency Difference cues in the Identification of Concurrently Presented Vowels , 1994 .

[42]  M. Karjalainen,et al.  About room response equalization and dereverberation , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[43]  Bill Gardner,et al.  HRTF Measurements of a KEMAR Dummy-Head Microphone , 1994 .

[44]  Jean-Marc Jot,et al.  Real-time spatial processing of sounds for music, multimedia and interactive human-computer interfaces , 1999, Multimedia Systems.

[45]  Robert Höldrich,et al.  A LIBRARY FOR REALTIME 3D BINAURAL SOUND REPRODUCTION IN PURE DATA (PD) , 2005 .

[46]  J. C. Middlebrooks,et al.  Psychophysical customization of directional transfer functions for virtual sound localization. , 2000, The Journal of the Acoustical Society of America.

[47]  W. Yost Comments on "Lateralization and the binaural masking-level difference" [G. B. Henning, J. Acoust. Soc. Am. 55, 1259-1263 (1974)]. , 1975, The Journal of the Acoustical Society of America.

[48]  Tapio Lokki,et al.  Evaluation of Geometry-based Parametric Auralization , 2002 .

[49]  Athanasios Mouchtaris,et al.  Virtual Microphones for Multichannel Audio Resynthesis , 2003, EURASIP J. Adv. Signal Process..

[50]  Michael Zyda,et al.  NPSNET: a multi-player 3D virtual environment over the Internet , 1995, I3D '95.

[51]  Michael Zyda,et al.  The laboratory for human interaction in the virtual environment , 1996, VRST.

[52]  Jie Huang,et al.  A model-based sound localization system and its application to robot navigation , 1999, Robotics Auton. Syst..

[53]  W. Yost Weber’s fraction for the intensity of pure tones presented binaurally , 1972 .

[54]  Hareo Hamada,et al.  Multi-Channel Sound Reproduction Using a Four-Ear Dummy Head , 1997 .

[55]  Gary S. Kendall,et al.  The Simulation of Three-Dimensional Localization Cues for Headphone Listening , 1981, ICMC.

[56]  Nick Zacharov,et al.  Subjective Evaluation of Virtual Home Theatre Sound Systems for Loudspeakers and Headphones , 2004 .

[57]  Noboru Ohnishi,et al.  Building ears for robots: Sound localization and separation , 1997, Artificial Life and Robotics.

[58]  V. Pulkki,et al.  MULTICHANNEL REPRODUCTION OF LOW FREQUENCIES , 2004 .

[59]  Angelo Farina,et al.  Silence Sweep: A Novel Method for Measuring Electroacoustical Devices , 2009 .

[60]  Robert Höldrich,et al.  3D binaural sound reproduction using a virtual ambisonic approach , 2003, IEEE International Symposium on Virtual Environments, Human-Computer Interfaces and Measurement Systems, 2003. VECIMS '03. 2003.

[61]  Juha Merimaa,et al.  Spatial Impulse Response Rendering I: Analysis and Synthesis , 2005 .

[62]  Jukka Ahonen,et al.  Directional Audio Coding with Stereo Microphone Input , 2009 .

[63]  Ewan A. Macpherson,et al.  A Computer Model of Binaural Localization for Stereo Imaging Measurement , 1989 .

[64]  E. Langendijk,et al.  Sound localization in the presence of one or two distracters. , 2001, The Journal of the Acoustical Society of America.

[65]  William L. Martens,et al.  A Spatial Sound Processor for Loudspeaker and Headphone Reproduction , 1990 .

[66]  Constantine Trahiotis,et al.  Functions of the Binaural System , 2007 .

[67]  Juha Merimaa,et al.  Training of Listeners for Evaluation of Spatial Attributes of Sound , 2004 .

[68]  Henrik Møller,et al.  Audibility of Spectral Differences in Head-Related Transfer Functions , 2006 .

[69]  Jerome Daniel,et al.  Further Investigations of High-Order Ambisonics and Wavefield Synthesis for Holophonic Sound Imaging , 2003 .

[70]  Flemming Christensen,et al.  Directional resolution of head-related transfer functions required in binaural synthesis , 2005 .

[71]  Lauri Savioja,et al.  Modeling Techniques for Virtual Acoustics , 1999 .

[72]  V. Ralph Algazi,et al.  High-Frequency Interpolation for Motion-Tracked Binaural Sound , 2006 .