Binaural Ambisonics: Its optimization and applications for auralization

To better understand acoustic environment and the resulting auditory perception, it is essential to capture, analyze, and reproduce a sound field as a three-dimensional physical phenomenon because spatial aspects of auditory perception play important roles in various situations in our lives. Some approaches have been proposed to achieve the three-dimensional capture and reproduction of acoustic fields. Among them, Higher-Order Ambisonics (HOA) based on spherical harmonics expansion enables the capture and reproduction of a directivity pattern of incoming sound waves. On the basis of HOA, three-dimensional auditory space can be presented to a listener typically via a spherical loudspeaker array. In addition, binaural synthesis emulating the loudspeaker presentation enables HOA reproduction with a set of headphones or several loudspeakers by employing crosstalk cancellation. Thus, we are developing an HOA-based binaural reproduction/auralization system with head tracking. This system is aimed at realizing the reproduction and auralization of a sound field, including one excited by the listener’s own voice. In this paper, we review the topics related to the reproduction and auralization of the sound field and introduce the HOA-based binaural synthesis system we have developed, as well as our works on sweet-spot expansion in HOA decoding and selfvoice reproduction/auralization.

[1]  Franz Zotter,et al.  Spatial and Timbral Fidelities of Binaural Ambisonics Decoders for Main Microphone Array Recordings , 2019 .

[2]  Tobias Lentz Dynamic crosstalk cancellation for binaural synthesis in virtual reality environments , 2006 .

[3]  Shuichi Sakamoto,et al.  Design theory for binaural synthesis: Combining microphone array recordings and head-related transfer function datasets , 2017 .

[4]  Yôiti Suzuki,et al.  A new theory for high definition virtual acoustic display named ADVISE , 2003 .

[5]  Boaz Rafaely,et al.  RENDERING BINAURAL ROOM IMPULSE RESPONSES FROM SPHERICAL MICROPHONE ARRAY RECORDINGS USING TIMBRE CORRECTION , 2014 .

[6]  Makoto Otani,et al.  Accuracy of binaural signal in Higher-Order Ambisonics reproduction with different decoding approaches , 2019 .

[7]  W. G. Gardner,et al.  3-D Audio Using Loudspeakers , 1998 .

[8]  Boaz Rafaely,et al.  Spatial perception of sound fields recorded by spherical microphone arrays with varying spatial resolution. , 2013, The Journal of the Acoustical Society of America.

[9]  Shoken Kaneko,et al.  Development of a 64-Channel Spherical Microphone Array and a 122-Channel Loudspeaker Array System for 3D Sound Field Capturing and Reproduction Technology Research , 2018 .

[10]  Thushara D. Abhayapala,et al.  Reproduction of a plane-wave sound field using an array of loudspeakers , 2001, IEEE Trans. Speech Audio Process..

[11]  Shuichi Sakamoto,et al.  Spatial accuracy of binaural synthesis from rigid spherical microphone array recordings , 2017 .

[12]  Jerome Daniel,et al.  Further Investigations of High-Order Ambisonics and Wavefield Synthesis for Holophonic Sound Imaging , 2003 .

[13]  William L. Martens,et al.  A system for simulating room acoustical environments for one’s own voice , 2012 .

[14]  M. Otani,et al.  Reproduction accuracy of Higher-Order Ambisonics with Max-rE and/or least norm solution in decoding , 2019, Acoustical Science and Technology.

[15]  H. Vogel A better way to construct the sunflower head , 1979 .

[16]  Christoph Pörschmann,et al.  Binaural Reproduction of Plane Waves With Reduced Modal Order , 2014 .

[17]  Makoto Otani,et al.  Fast calculation system specialized for head-related transfer function based on boundary element method. , 2006, The Journal of the Acoustical Society of America.

[18]  Boaz Rafaely,et al.  Open-Sphere Designs for Spherical Microphone Arrays , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Makoto Otani,et al.  Auditory Artifacts due to Switching Head-Related Transfer Functions of a Dynamic Virtual Auditory Display , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[20]  Boaz Rafaely,et al.  Binaural Reproduction of Finite Difference Simulations Using Spherical Array Processing , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[21]  Boaz Rafaely,et al.  Spectral equalization in binaural signals represented by order-truncated spherical harmonics. , 2017, The Journal of the Acoustical Society of America.

[22]  Larry S. Davis,et al.  High Order Spatial Audio Capture and Its Binaural Head-Tracked Playback Over Headphones with HRTF Cues , 2005 .

[23]  A. J. Berkhout,et al.  A Holographic Approach to Acoustic Control , 1988 .

[24]  Filippo Maria Fazi,et al.  Evaluation of Ambisonics Decoding Methods with Experimental Measurements , 2014 .

[25]  Franz Zotter,et al.  All-Round Ambisonic Panning and Decoding , 2012 .

[26]  S. Ise A principle of sound field control based on the Kirchhoff-Helmholtz integral equation and the theory of inverse systems , 1999 .

[27]  Gavin Kearney,et al.  Diffuse-Field Equalisation of Binaural Ambisonic Rendering , 2018, Applied Sciences.

[28]  Jerome Daniel,et al.  Ambisonics Encoding of Other Audio Formats for Multiple Listening Conditions , 1998 .

[29]  Stefan Weinzierl,et al.  Comparison of Head-Related Transfer Functions Pre-Processing Techniques for Spherical Harmonics Decomposition , 2018 .

[30]  Makoto Otani,et al.  Numerical study on source-distance dependency of head-related transfer functions. , 2009, The Journal of the Acoustical Society of America.

[31]  E. Williams,et al.  Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography , 1999 .

[32]  Nikunj Raghuvanshi,et al.  Improving Binaural Ambisonics Decoding by Spherical Harmonics Domain Tapering and Coloration Compensation , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[33]  Simon Doclo,et al.  Smoothing individual head-related transfer functions in the frequency and spatial domains. , 2014, The Journal of the Acoustical Society of America.

[34]  Shuichi Sakamoto,et al.  Numerical Evaluation of Binaural Synthesis from Rigid Spherical Microphone Array Recordings , 2016 .

[35]  Robert Höldrich,et al.  A 3D Ambisonic Based Binaural Sound Reproduction System , 2003 .

[36]  Makoto Otani,et al.  Sound Image Localization Using Dynamic Transaural Reproduction with Non-contact Head Tracking , 2014, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[37]  Robert Höldrich,et al.  Binaural rendering of Ambisonic signals by head-related impulse response time alignment and a diffuseness constraint. , 2018, The Journal of the Acoustical Society of America.

[38]  M. R. Schroeder,et al.  Digital simulation of sound transmission in reverberant spaces (part 1) , 1969 .